Le. 


. 
A 


b 


Y 


^ 


T OS i tee AM. * [ow 
ENAA : hi t l 
AUC: Cees 


€ * 
‘The: ] dirá | gf Abnomial, dn 
P ) " 
©. Social Psychology “ 
Editor: J. MoV. Hunt, Ynrvererry or ILLINOIS 
Associate Editor: ARTHUR JENNESS, WILLIAMS COLLEGE 
Book Reve, Editor: STANLEY G. Estes, HARVARD UNIVERSITY 
Mate Editor: LORRAINE BourHier g / 
e c A UU CONSULTING EDITORS. * 
N W. ALLPORT | £ O. HOBART MOWRER 
arvard University University of Illinois 
HADLEY CANTRIL LAWRENCE I. O'K&rrv 
Princeton University University of Illinois 
LEE J. CRONBACH CHARLES E. Oscoop 
University of Illinois University of Illinois 
_Joun Dorranp SAUL ROSENZWEIG 
Yale University Washington University 
Lzonarp W. Doos HAROLD SEASHORE ` 
Yale Pniversity Psychological Corporation 
LEON FESTINGER Davip SHAKOW 
University of Minnesota National Institute of Mental 
CHARLES/M. HARSH e Health 
U. S. Navy pus DEMOS RICHARD L. SOLOMON 
. San Diego B i Harvard University 
WiLELAM A. Hunt. Ross STAGNER 
Northwestern University University of Illinois — 
W. Ernst Kris» " IVAN STEINER - 
Yale University. Scltool ^ Medicine University of Illinois 
Donatp W. MacKinnon : ELIZABETH COLLINS WESMAN £ 
University « of. ar : White Plains, N. Y. 


Qun MeNExax < e * RonERT W. WHITE * 
Stanford University e $ Harvard University We 
CHARLES WRIGLEY 
a8 t University of Illinois 

o *VOLUME 49. (i7. NN 

i s B iT. Ab: a 
1 ».- $ * Dp 

"L. : NS "t ke ED 1954 n 


PUBLISHED BY e 
THE AMERICAN PSYCHOLOGICAL ASSOCIATION, Inc. 
. 1333 SIXTEENTH STREET N.W., WASHINGTON UC 
-e — AND MT. ROYAL & GUILFORD AVES., BALTIMORE 2, MD. eo 


Copyright 1954 by the American Psychological Association, Inc. = 
€ LU. 


A a gs har, j s 3 i a 
: N A A ^ enj p t * Bow 
x bi D 4 
Ja} " , : 
"hn » i, 
» j T ; 9 
e é «4 
NS CM s. 
ARTE : . 
X ^ at E * 
! iji buts: 
TUNE J 


i Bares Bani, “sv. Research 
bo na TING CALLEME 
1 Dated | je x qe d 5 
i 


n 
E 
He 


6 t 6 c 
€ T e ^ 
e l . H i : G 
n » À. € A 7 
€ $ * € e lá 
CONTENTS OF. VOLUME 49 € ; T 
ba Pate [5 * 
ARTICLES, CASE REPORTS, CRITIQUE AND NOTES 1 
€ ^ I € 
x $ PAGE 
uson, H. W., & Altison, S. G. Personality Changes following Transorbital Lobotomy. .. 219 
PLEZWEIG, D. G. Some Determinants of Behaviora| Rigidii........... os . 224 


rkivsoN, J. W., HevNs, R. W., & Vegorz, J. The Effect of Experimental Arousal of the Affiliation 

Motive on Thematic Apperception......... sse ns ... 405 
Ecker, W. C. Perceptual Rigidity as Measured by Aniseikonic Lenses. . 
ett, Gi B., & Har, H. E., Jr. The Relationship between Leadership and Empathy. . r 
eNNETT, E. M., & JoHANNSEN, D. M Some Psychodynamic Aspects of Felt Parental Alliance 


rep, C., & Klonacnesy E. D. Prejudice and Discontent.......... ors 
LAKE, R. R, & Baru, J. W. The Use of Tape Recording to Simulate a Group Atmosphere...... 
LAERCR. Re, MOUTON, J. S., & FRUCHTER, B. *The Consistency of Interpersonal Behavior Judgments 
p on the Basis of S 573 


Schmeidler,.. 
HODORKOFF, B. Self-Perception, Perceptual ‘Defense, and Adjustment 
OREN, L. D. Level-of-Aspiration Behavior and Feelings of Adequacy and Self-Acceptance 
omw,B.N. Projective Methods and Verbal Learning............ oos 
onn, T. S., & Canscu, H. Administration of the F Scale to a Sample of Germans. . 
OWEN; E. L., & Begg, E. G. Threat-Expectancy, Word Frequencies, and Perceptual Prerecog; 
tion Hypotheses, .......-..: ses ented ee rem tet 
ox, F. N., & Sarason, S. B: Test Anxiety and Rorschach Performance. . 
orRis, R. J., LEVINSON, D; J., & HANFMANN, E. Authoritarian Personality Studied by a New Varia- 
tidn of the Sentence Completion Technique... e. 
joucrass, W. B., & Brown, K. B. Adequacy of the Reporting- 
‘tions to Frustration....... ALA Ta 
LASH, A, Fixation and Inhibition. ... 
ISEN, N.H. The Influence of Set on Semantic Generalization. 
Eriksen, C. W. - Psychological Defenses and “Ego Strength” in the Recall of Completed and In- 


GouvpeRe, S. C. ‘Three Situational Déterminants of Conformity to Social Norms. j 
Goopsrem, L. D. Affective Tone and Visual Recognition Thresholds............ 000s 
Greer, F. L., GALANTER, E. H., & Nonpu, P. G. Interpersonal Knowledge and Individual and 


Grossack, M. M. Some Effects of Cooperation and Competition upon Small Group Behavior.... 341 
‘Hatrin, A. W. The Leadership Behavior and Combat Performance of Airplane Commanders.... 19 
HARRINGTON, R., & EnRMANN, J.C. Complexity of Response as a Factor in the Vocabulary Perform- 


ance of Schizophrenics............. metn Qs ees 362 
Hasrorr, A. H., & Cantr, H. They Saw a Game: A Case Study + 129 
Hottanver, E.P. Authoritarianism and Leadership Choice in a Militar§ Setting.............. ».. 365 


HorrzMAN, W. He, & SeLrs, S.B. Prediction of Flying Success by Clinical Analysis of Test Protocols 485 

“Horwitz, M., & Leg, F. J. Effeots of Decision Making by Group Members on Recall of Finished and 
Unfinished Task 27 CEU OU em ce ep Css solet gems cie cs ee aes Aine s o 

| Howanp, A. R., & Kerry, G. A. A Theoretical Approach to Psychological Movement. 

Irving, R. P. Qritical Flicker Frequency for Paretics and Schizophrenics......... 

Janopag¢G. Political Attitudes and Judgments of Other People. ............ 

Janis, L L., &Kriwo, B. T. The Influence of Role Playing on Opinion Change. ....... le 


ii 


n D kd ) 


Contents or VOLUME åo 
ET N ) ə ; M 


» ) d 
‘Jessor, R. The Generalization of a EDS TEASE TATUR 
ee S. M^ Ego Sitength and the Recall of Tasks. 


LEVINE, J., LAFFAL, J., Berkowitz, M., LINDEMANN, 
Psychiatric and Medical Patients 


McGavcuran, L. S. Predicting Language Behavior from Object Sorting..... 
McKzacnre, W. J. Individual Conformity to Attitudes of Classroom Groups. 
MacRaz, D., Jr. A Test of Piaget's Theories of Moral Development 


Ronms, E., & Mens, I. N. Predic 
Psychiatry, and Psychology............... 

Romer, J. H., Baron, S. H., Horruan, E. 
ments 


Strverman, R. E. Anxiety and the Mode of 
Sotomon, R. L., & Cores, M.R. A 

across Situations........,., | 
Sommer, R. On the Brown Adapta 


TALLaND, G. A. “The Assessment 
Hona AE NE PR 

Tivvon, J. A, & Spence, K. W. Conditioning Level in the B 

Tuitren, C. H., &CrEckrev, H. A Case un 


MEE s 


a A 
t c 1 t 
e %  Céwrents or VOLUME 49° c v 
4 4 uds : t 97. 


t 
yess, S. C. Irregularities in Judgment Data Collécted by the Method of Eqgial-Appgaring In- 
terval sees ab rine yes Ee: One 415 


Pact: 


C. Reaction Time as a Function of Manifest Anxiety and Stimulus Intensity. tne 335 


Form... rv ttt 
, J. & Wore, C. Psychological 
C iG y Boox REVIEWS 
e H Pack 
A x (The reviewer’s namt appears in parentheses after the title of the work.) 
ARTWRIGHT, D., & ZANDER, A. (Eds) Group Dynamics: Research and Theory (E. L. Hartley).....- 478 


es in the Scope and Method of “The Authoritarian Personality” 


(T. M. Newcomb) 23:8 rede A A, nr A ptu 598 
ivans, J. Three Men: An Experiment in the Biography of Emotion (A. B. Wood). 2 476 
tySENCK, H. J. The Structure of Personality (W. Stephenson). .... ene 599 


Jortow, L., Hoc, E. L., & TELsCHOW, E. F. The Nature of Nondirective Group Psychotherapy. 

An Experimental Investigation (J. L. Singet).......-- tpm 475 
Harpinc, D. W. Social Psychology and Individual Values (H. Bonner) 1 
Jones, E. The Life and Work of Sigmund Freud. Volume I. The Formative Years and the 

Great Discoveries, 1856-1900. (R. T. Hole). «s sane cncsecer tents cpernes nin eene Sae cn ne 319 
Jones, M., in collaboration with BAKER, A., FREEMAN, T., Merry, J., Pomryn, B. A., SANDLER, J., & Tox- 

FORD, J. The Therapeutic Community. A New Treatment Method in Psychiatry (J. L. Singer) 474 


Kinsey, A: C., Pomeroy, W. B., Martin, C. E., & GEBHARD, 


POE PUTAT PR Rcs Pere re mE MEER E TOT 158 
MCCLELLAND, D. C., ATKINSON, J. W., CLARK, R. A., &Lowett, E. L. The Achievement Motive (C. C. 
McArthur) ns... e.. -errerroreetererererreret Were eee ae cae E KA P ETTET 477 


Moreno, J. L. WhoShallSurvfre? Foundations 
drama (E. L. Hartley)... 
Ror, A. The Making of a Scientist (S. G. Estes) 


Simpson, G. E., & YincER, J.M. Racial and Cultural Minorities: 
crimination (3. G. Smitb)...» .. -sepeser rrerrrrnr ete . 601 
U. S. VETERANS ADMINISTRATION. Ggoup Psychotherapy tudies in Methodology of Research and 
Therapy (J. L. Singer)... | JOLIE CEU A A er cmn TIVI 59 
* a ; « . 
. e 
Ç . 
` 
i - 
E 
e 
* 
s. € 
. 2 x 
G ES ANN 
e 
n = 
S E s £ e 


^ 


t t hd x 
* t * ( * bid i : l 
E Au e P TS E. 
a ** € Rs M ^ 
. bs uu 3 St 
é y aom 
PSYCHOLINGUISTICS: » 
á LM u am 
A ^ gd OF THEORY AND;RESEARCH F PROBLEMS, 
Report of the 1953 Su kre aigor ý " 
; Sponsored by the Committee ón Linguistics 
and Psychology of the Social Science , 
Research Council 
^ a 
s n 
m wu * 
" * AE * 
CHARLES E. OSGOOD, Editor we : 
“THOMAS A. SEBEOK, Associate Editor oe 
Y 
M With a Foreword by JOHN W. GARDNER 
n JOHN B. CARROLL iE LEONARD D. NEWMARK 
SUSAN M. ERVIN SOL SAPORTA 
JOSEPH H. GREENBERG DONALD E. WALKER 
j JAMES J. JENKINS LEA KELLOGG WILSON 
FLOYD G. LOUNSBURY , * " 
P NUM 
MEME Ea 4 è 
VR , j * 
, & 
^ “i af WAVERLY PRESS, INC. 
, EP Baltimore 
| TEES 1954 
* € 
a S Ld 
£ e 
*. Aw 
e" . ` -e 
: "i e 


Mt INVE Ed ate oe n a 
{ Hurex: Bani, sv, Rasearc 


ru 


bones 


t 
€ € * 
e URRE : - i ra 
D ° € a 
€ * € € [a 
CONTENTS OF. VOLUME 49 € $ e 
a 
li e ot io 
: ARTICLES, CASE REPORTS, CRITIQUE AND NOTES i 
S » € 
& t PAGE 
tuson, H. W., & Attison, S. G. Personality Changes following Transorbital Lobotomy......... 219 
PPLEZWEIG, D. G. Some Determinants of Behaviora| Rigidit............. eee 224 


ITKINSON, J. W., Heyns, R. W., & Verorr, J. The Effect of Experimental Arousal of the Affiliation 
Motive on Thematic Apperception EASQUE - OMA AAAI Ce RM SES P A- acce NA gi ME Do P SEDENT 
ECKER) W. C. Perceptual Rigidity as Measured by Aniseikonic Lenses 3n 
ELL, , & Har, H. E., Ja. The Relationship between Leadership and Empathy............ 156 
ENN SE M., & JOHANSSEN, D. E Some Psychodynamic Aspects of Felt Parental Alliance in 
Young Children 
mp, C., & Monacuesy E. D. essai and Discontent. 
LAKE, R. R., & Bremm, J. W. The Use of Tape Recording to Simulate a Group Atmosphere 


R. RS Mouton, J. S., & Frucuter, B. ‘The Consistency of Interpersonal Behavior Judgments 
on the Basis of Short-Terfn Interactions in Three-Man Groups...................... 573 
An Experimental Reunion of Psychoanalytic Theory with Perceptual Vigilance and 


iO D. R., 
ROWN, R. W., & LENNEBERG, E.H. A Study in TAnpiaré and Cognition 
URDOCK, E. L A Case of ESP: Critique of “Personal Values and ESP Scores” by Gertrude R. 
Schmeidler 
'HODORKOFT, B. Self-Perception, Perceptual ‘Defense, and Adjustment. 
OREN, L. D. Level-of-Aspiration Behavior and Feelings of Adequacy and Self-Acceptance. 
onn, B. N. Projective Methods and Verbal Learning.. 
Conn, T S., & Carscu, H. Administration of the F Scal: 
Cowen, E. b &Brmg, E.G. Threat-Expectancy, Word esac enietes, and Perceptu 
tion Hypotheses 
Cox, F. N., & Sarason, S. B. ‘Test Anxiety and Rorschach Performance. 
Dorris, R. d LEVINSON," D. J., & Hanrmann, E. Authoritarian Personality Studied by a New Varia- 
tion of the Sentence Completion Technique...&....... esee 99 
iDovorass, W. B., & Brown, K. B. Adequacy of the "Reporting-Test Measures for DE Reac- 
tions to Frustration.......¢....... . 
EoLasH, A. Fixation and Inhibfüon... 
Eisen, N.H. The Influence of Set on Semantic Generalization. m 
EnrksEN, C. W. Psychological Defenses and “Ego Strength” in the Recall of Completed and In- 
E completed Tagks......: SES cea Bites RD M Sat eed SRP Te gf le d diag d A ta S CR MUR 
|ERIKSEN, C. W. Some Personality € Correlates of Stimulus Generalization under Stress 
Festinger, L., & Hurre, H. A. Ail Experimental,Investigation of the Effect of Unstable Inter- 
personal Relations inia Group. i-i 07 ait acai es T ds dedit e pda D DE e ETT *513 
FieDLER, F. E. Assumed Similarity Measures as Predictors of Team Effectiveness. 
|Frsuem, S. The Role of Egpectancy in the Performance of Posthypnotic Behavior. 
FRrNkEL-BRUNSWIK, E. Sociaf Research and the Problem of Values: A Reply............. s. 466 
penus J. C., Lupeman, J. F., & LygAK, W. Values as Determinants of Word-Recognition Thresh- 
olds.. 
Gucasr, J. C., Suaw, M. E, & Warker, L.C. Some Effects of Unequal Distribution of Information 
"ina Wheel Group Ero esae NEUTRO LEOTE RP Tor EA N UE RRASA 
| GOLDBERG, S. C. Three Situational Déterminants of Conformity to Social Norms. Us 
.GoopsrEm, L. D. Affective Tone and Visual Recognition Thresholds.....................0000. 
Gnxzn, F. pa Gatanrer, E. H., & NORDLIE, P. G. Interpersonal Knowledge and Individual and 
Group Effectiveness E DA as Gate CARI esa IST ORE ERR REPE, dese ERE AS 
| Grossack, M. M. Some Effects of Cooperation and Competition upon Small Group Behavior. 
Harry, A.W. The Leadership Behavior and Combat Performance of Airplane Commanders. . 
HARRINGTON, R., & Eurmann, J.C. Complexity of Response as a Factor in the Vocabulary Perform- 
ance of Schizophrenics UE SP T wi so U A E EA EEE (Um 
Hasronr, A. H., & Cantr, H. They Saw a Game: A Case Study............... 
| HorzANDER, E.P. Authoritarianism and Leadership Choice in a Militarf Setting | 365 
| Horrzman, W. He, & Setts,$.B. Prediction of Flying Success by Clinical Analysis of Test Protocols 485 
“Horwitz, M., & Leg, F. J. Effects of Decision Making by Group Members on Recall of. Finished and 
Unfinished WASKB) epee homies AEA oti meen ese teh STs hag eee . 201 
|Howanp, A. R., & Kzrtv, G. A. A Theoretical Approach to Psychological Movement. 
Irvine, R. P. Qritical Flicker Frequency for Paretics and Schizophrenics. 
JamopAeG. Political Attitudes and Judgments of Other People........... 
p L L., & Kine, B. T. The Innen. of Role NUR on Opinion Change. 


» 
N a D VR g 


is ; CONTENTS or VOLUME (49 

SY 3 4 : i 

‘Jessor, R. The Generalization of Expectancies. ,,...... 

Jourarp, S. M> Ego Sitength and the Recall of Tasks. . 

Keren, J.D. The Color-Form Res Neurotic Subjects. ... y 
Konvetsxy, C., & D. L. GERARD” Effect of Increasing the Number of Rorschach Responses on S 


KuzLAND, S. H. The Lack of Generali: 
OIDHODUES site tansy» sade «deve. 3 
LANDFELD, A. W. A Movement Interpreta 
LANZETTA, J. T., HAEFNER, D., LANGHAM, ) 
udo lut M NER INN E rode — A 
Lazarus, R. S. Is There a Mechanism of Perceptual Defense? A Reply to Postman, Bronson 


Psychiatric and Medical Patients 
Linton, H. B. Rorschach Correlates of Response to Suggestion 
Linton, H. B. Autokinetic Judgments as a Measure of Influence, .. 
McBars, W. N. Imagery and Suggestibility: A Test of the Arnold Hypothesis, 
McConnett, J. Abstract Behavior among the Tepehuan 
McGaveuran, L. S. Predicting Language 
McKracme, W. 
MacRar, D., Jr. 
Masrmo, 
MAUSNER, 


Mzzrk1AN, L., & PROTHRO, 
Mznzsko, R., RUBIN, 
Habits and Its I 


Monsin, S. M. Effect of Frustration on Problem-Solving Behavior... 
Morray, E. J. A Case Study in a Behavioral Analysis 
Mussen, P. H., & Naytor, H. K. The Relationshi 
O'Rrm, C. T, & O'Rziy, E. J. Religious Beli 


Psychiatry, and Psychology. 0... NS 
ROHRER, J. H., Baron, S. H., Horruay, E. L., & SWANDER, 


Suaw, 

Various Communication Nets ............... 
SILVERMAN, R. E. Anxiety and the Mode of Respo; 
Soroxon, R. L., &Cores, M. R. A Case of Failure of 

acrossSituatlons....... 
Somer, R. On the Brown Adap 
Sosxin, W. F. Bias in Postdisti 


` Seruvax, P. L, & Apzisow, J. Ethnocentrism and Misco ced 


‘Op’ 
emory Distortion of Favorable an 


Tavron, J. A., & SPENCE, K. W. 


RE CH; & CrECKIEY, 


ELS. i . 


t c t € 


e >  Céwrents oF VOLUME 49* e v 
t T "t y t eU Ü 


t Pact: 
VEBB, SIC Irregularities in Judgment Data Collécted by the Method of Eqgial-Appearing In- 


tervalg e e lan en Ee REAL ose ary Rr ri sage Rae c de v EAC Os carter dir rena . 415 
ENAR, C. Reaction Time as a Function of Manifest Anxiety and Stimulus Intensity. 938 
* 266 


'HITEMAN, M. The Performance of Schizophrenics op Social Concepts............... 
1sPÉ, L. G. Physiological Need, Verbal Frequency, and Word Association................... 229 
RICHT, G. O. Projection and Displacement: A Crosg-Cultufhl Study of Folk-Tale Aggression.... 523 
ARNOLD, J. K., & BERKELEY, M. H. An Analysis of the Cattell-Luborsky Humor Test into 


Homogeneous Sc8168, 5.5.5. EE ener quere eau qerNeEAererr rernm E 
ELEN) S. L., & LEVITT, E. E. Notes on the Wesley Rigidity Scale: The Development of a Short 


Rum c EE MEM MEM, cig AUR REN A Lv nice Faces rao pon ned SRI Rana be RU 
IN, J., & Wiwpre, C. Psychological Prognosis of Outcome in the Mental Disorders 


> c 


Book REVÆWS 


e - PAGE 
(The reviewer’s namt appears in parentheses after the title of the work.) 
‘ARTWRIGHT, D., & ZANDER, A. (Eds) Group Dynamics: Research and Theory (E. L. Hartley)...... 478 
'HRISTIE, R., & Jamopa, M. Studies in the Scope and Method of “The Authoritarian Personality” 
(ESM Newcoml) Vs; EACUS CONES dnd DRE oy te IRR — "eO ies Pe eae eSeminar 598 
vans, J. Three Men: An Experiment in the Biography of Emotion (A. B. Wood). ANE 


XSENCK, H. J. The Structure of Personality (W. Stephenson). ........ ce eee ees 
onLow, L., Hocm, E. L., & Terscuow, E. F. The Nature of Nondirective Group Psychotherapy. 
An Experimental Investigation (J. L. Singer)... . 
IHAnpmNG, D. W. Social Psychology and Individual 
Jones, E. The Life and Work of Sigmund Freud. Volume 
Great Discoveries, 1856-1900. (R. R Holt)............eelee e cece eee e cnet en nere 
Jones, M., in collaboration with Baker, A., FREEMAN, T., Merry, J., Pomryn, B. A., SANDLER, J., & Tux- 
` FORD, J. The Therapeutic Community. A New Treatment Method in Psychiatry (J. L. Singer) 474 
Kinsey, A. C., Pomeroy, W. B., Martin, C. E., & Gesuarp, P. H. Sexual Behavior in the Human 


Femfale (G. V. Ramsey)... eene EL A pn ee MEA eoa [Aet piss cte 158 
McCzxLLAND, D. C., ATKINSON, J. Crark, R. A., &LoweLL, E. L. The Achievement Motive (C. C. 
MCA Pie) Bere eet es aps MSN RE oda e ier eoe Se E EAS U S E NE ED 477 
Moreno, J. L. Who Shall Survive? Foundations of Sociometry, Group Psychotherapy and Socio- 
densa a ote rete rites ares ce cae Coes edarec ese ae COO He E ERSTE neg emia 418 
Ror, A. The Making of a Scientist (S. G. Estes)....... ccce ee 600 
Smpson, G. E., & YiNGER, J.M. Racial and Cultural Minorities: An Analysis of Prejudice and Dis- 
crimination (3. G. Smith) 601 
U. S. VETERANS ADMINISTRATION. Gsoup Psychotherapy: Studies in Methodology of Research and 
Therapy (J. L. Singer); s.i. resien erena nenene | Xd TTE AEE 5 EETA 159 
ha © . f 6 
© t€ 
d . 
© 
« 3 ; 
« 
i € 
* x * 
$ € 
e " ~w 
e 
i - 


neu $ woe 


PSYCHOLINGUISTICS »*" | 


Un ? ; eX hi : . at L4 


Ea 
hy i i M 


] 


A SURVEY OF THEORY AND RESEARCH PROBLEMS, 


r Research Couneil 
* E d s». 
Y m a jd " 
» * a o. ag # * & 
CHARLES E. OSGOOD, Editor ` ae : 
THOMAS A. SEBEOK, Associate Editor s. i 
; t Ned 
t With a Foreword by JOHN W. GARDNER " 
y JOHN B. CARROLL jt LEONARD D. NEWMARK P 
SUSAN M. ERVIN SOL SAPORTA 
JOSEPH H. GREENBERG DONALD E. WALKER 
JAMES J. JENKINS / KELLOGG -WILSON 
FLOYD G. LOUNSBURY > * bm 
^ a 2 B € 
"meto 
i B T4 L 
4 1 * 
E! 
i on M, WAVERLY PRESS, INC. $ 
E E Je Baltimore 
í é 1954 
4 
a x s, 
* l : 
€ x 
E š aii 
f h ^ £ * 6 
Li £ 


*. 


Report of the 1953 Summer Seminar 
Sponsored by the Committee'ón Linguistics 
and Psychology of the Social Science 


by | 
. Taw TRUSTEES or INDIANA University 


t t 
, WS. A T 
oe xi [] € 
Ne e LJ 
d hd e " 

° CONTENTS * 

Forewouls: 2.25.02. 33 RAM un oM. gne tA. c ANNE iii 
n e . Gardner m . 

vii 
1 
1 
isciplines Concerned with Human Communication.............. 3 
Ror Plan'of this aveporty. scr we LOMA LI serene 4 
' 2. Three Approaches to Language Behavior....................0...0.. 8 
2.1. The Linguistic Approach. :,2.2. ore ri ee be as 8 
2.2. The Learning Theory Approach 20 
2,3. The Information Theory Approach, . 35 
9. Psycholinguistic Units... mi escapee. ee cael ss os ects (398500. 
3.1. Psychological Bases of Unit Formation : vi e ADR 
3.2. Relations Between Psychological and Linguistic Units............ 60 
3.3, The Word as a Linguistic Unit... EEn eee ae lee 66 
3.4. Hierarchies of Psycholinguistic Units oag, 
4. Synchronie Psycholinguisties I: Microstructure 1.54 
4.1, Within Band Organization......................0. 2000. i. 74 
42. E 3 Band Organization: a... seres emere os donne .. 88 
5. SequentiSBPsycholinguisties............sslseeeesse rr enn 93 

e 5.1, Transitional Probability, Linguistic Structure, and Systems of Habit- 
family ECTATODIES ie n AEn pia ase coed E ENS oed Mg qe P 93 
5.2. Certain Characteristics of Phoneme Sequences.................. 101 

5.3. Applications of Entropy Measures to Problems of Dauert Struc- 
TAINO Ae DAVA TA EO AS SI, ORSA, 103 


. 9.4. Transitional Organization: Association Techaiqudi 
* 5.5. Channel Capacity dn Semantic Decoding 
6. Diachronic Psycholinguisties......5........... 
48.1. First Language nes 


72. Menine iem ioa dudes das der NEM BOB yl Me Eee ; 
7.3. Information Transmission by Language Messages 
7.4. Language, Cognition, and Culture......... RSE ee ra 192 
w e kt M 
P . 65 
s € 
Ü € 
. 3 e 
xi 
«HA a 


*( 


T HAS been suggested that the "rigidity" 
shown by the Einstellung method, as 
exemplified in Luchins' (11, 12, 13, 14) 

ater-jar problems, is a “general response 

aracteristic that pervades all aspects of an 
dividual’s behavior" (3, p. 165). If this were 

, à person who behaves rigidly when trying 

solve water-jar problems should also behave 

igidly when trying to solve other types of 
roblems, and thus his adjustment to his 
nvironment would be affected. Rigid and 
onrigid people should differ, then, in the kind 

Í personality traits they develop. Rigidity, 

efined as the resistance of the Einstellung 


an the one brought about by the Einstellung, 
as studied in the present experiment by 
eans of a test of similarities of words, in 
rder fo investigate rigidity in solving problems 
f'a verbal, rather than numerical, nature. 
‘en personality traits, general activity, 
estraint, ascendance, sociability, emotional 
tability, objectivity, friendliness, thoughtful- 
„personal selation$, and masculinity, 


immerman Temperament Survey, a stand- 
rdized personality test (6). ' , . 
Results from previousestudies have not given 
clear-cut answer to thé problem of the 
enerality of rigidity. In the first*place, vary- 
ng, even contradictory, concepts, such as 
ation of re$ponse (5), lack of variability 
15), perseveration '(1, 2), degree of imper- 
eability of boundaries (8, 9, 10), as well as 
instellung, have been equated ¢vith rigidity. 
ome individuals working in this area, for 

mple, Goldstein (5), Werner (20, 21), and 
attell and Tiner (2), also distinguish more 
han one kind of rigi 

In general, the various definitions of rigidity 
eem to be of two types, those that conceive 
f rigidity as evidenced by repetition within 
ne situation (1, 12, 14, 16) and those that 
onsider it to be a basic trait underlying many 
r all kinds of behavior (3, 9, 15, 17). As a 
1 Now,at the University of Michigan. 


3 E 


(20 


€ j € 
e 


Ere 


16 EINSTELLUNG RIGIDITY A GENERAL TRAIT? « 


BARBARA PITCHER! AND CHALMERS L. SPACEY 
ii ‘ - Syracuse University e 
" e 


result of the use of these two concepts, there 
is the suggestion that the type of behavior 
classed as rigid in one situation may be due 
to a general rigidity factor which influences 
personality development. Investigators testing 
this hypothesis directly have found contradic- 
tory results. Rokeach (17, 18) and Cowen 
and Thompson (3) conclude that the relation- 
ships they found between rigidity and prej- 
udice and between rigidity and personality 
factors on the Rorschach support the hypothe- 
sis of a generalized rigidity factor. In contrast, 
Kleemeier and Dudek (7) found no generalized 
rigidity factor by factorial analysis, and Not- 
cutt (16), using a variety of tests purporting 
to measure rigidity, found no high correla- 
tions among them. 
The present study attempts to test the 
hypothesis of the generality of rigidity by 
e examining differences in personality of groups 
differing in amount of rigidity. _ 


METHOD 


Subjects. About four hundred students in general 
introductory psychi classes at Syracuse University 
answered both the rigidity test and the personality test, 
Three hundred and seventy-three, or those who fell 
into groups large enough to be studied statistically, 
were actually used for the final analysis of the data. 

Administration and scoring of tests. Both tests were 
administered to the students by their instructors dur- 
ing their regular class hours. 

. The rigidity test consisted of 13 groups of three 
words, with instructions designed to establish a set to 
answer by stating how the three words were alike 
functionally? Items eight and nine were thought to call 
more obviously for an abstract answer, and thus they 
would break the set to respond functionally. If the 
person returned to the functional method for the last 
four items, he would be considered to have responded 
rigidly. A copy of this test follows: 

“Look at the three ‘examples’ below. You will see 
that each example is made up of three words. After 
each group of three words is a statement which tells you 
in what way the three Words are ‘alike’ or the ‘same.’ 

(X) Cigar-Cigarette-Pipe of tobacco. ‘You can 
smoke all of them.’ A 


2 According to Gerstein (4) and Stacey and Markin 
(19), there are three possible ways of responding f& 
such questions, e.g., abstract—an automobile is a 
vehicle; descriptive—an automobile has wheels; furit 


tional—you can ride in an automobile. e 


^N > H 


\ - b | 
4 Yd 


dos QV) TiicyclesBleycle-iotorcycle. *Vou can ridé on 
all of them.’ ^ 
(Z) Chair-Cushion—Berth. ‘You can sit on all of 
them." 
“Below are thirteen groups of words. Each group is 
made up of three words. After each group write out ycur 
statement of how you think the ¢hree words, in each 
group are ‘alike’ or the same." 9 
“Remember you are to state how they are ‘alike or the 
same’ in only one way! 
(1) Dress-Shoes-Coat 
(2) Carrot-Apple- Tomato 
(3) Coal-Paper-Wood.... 
(4) Saw-Scissors-Knife. . .. 
(5) Newspaper-Poem-Novel. 
(6) Piano-Flute-Violin..... 
(7) Horse-Elephant-Mule 
(8) Mars-Jupiter-Earth. ...... 
(9) Hydrogen-Ammonia-Oxygen. 
(10) Pencil-Chalk-Pen........ 
(11) Wine-Water-Coca-Cola . 
(12) Sofa-Bed-Couch. ............ id 
(13) Microscope-Eyeglasses—Telescope . . 2 
"Three hundred and seventy-three students were thus 
divided, on the basis of their answers to this verbal 
similarities test, into seven groups varying from most 
rigid, Group I, with the greatest number of functional 
answers, to least rigid, Group VII, with the most ab- 
Stract answers. The composition of the seven groups 
cantbe seen in Table 1. 


TABLE 1 
Composition or SEVEN Groups BASED 
ON TvPE OF ANSWER TO RicmpITy 


Test 
M Sex 
Group ‘Tyre or ANSWER “ADs N 
Mate |FEMALE 
I 1-13 functional 19.72 235 25 48 
iu 1-7 functional 20.14 8 13 2 
Dor 9 abstract 
10-13 functional 
II 1-7 functional 19.78 52 83 135 
8-9 abstract Ju 
10-13 functional 
IV | 1-6 functional 19.29 16 36 
7-9 abstract. 5 
10-13 functional 
v 1-5 functional 19.15 4 
6-9 abstract. M 5 
10-13 functional 
VI | 1-7 either 20.42) 36 
8-9 abstract € WR 
10-13 include any one 
Or two as abstract 
VII 1-7 either 20.24 19 
2 8-9 abstract S 
310-13 include any three ; 
E or four as abstract. 


J 


BARBARA PITCHER AND CHALMERS L. STACEY 


^ TABLE 2 
Mean Score or EacH GROUP ON EACH | 
TRAIT AND RESULTS OF ANALYSIS 
OF VARIANCE 


Group FRA 
Trait 


17.41. 17.04 17.14 
17.68 18.10 18.24 
17.20 17.67 17.19 
21.85 22.60 21.86 
17.93 18.13 17.67 
19.11 19.27 18.38 
15.58 17.02 15.86 
19.07 19.06 20.38 
18.85 18.23 18.29 
14.97 14.21 13.29 


EUvH"uOWNH^0»WO 


* Significant at .05 level. 
** Significant at .01 level. 


The Guilford-Zimmerman Temperament Sum 
was scored by means of the key and manual (6) 
nished with the test. 

Analysis of personality trait scores. When 
differing in rigidity according to the verbal sim 
test had been found, their scores on each of tht 
personality traits of the Guilford-Zimmerman Tem 
ment Survey were compared by means of an analy 
variance to determine whether there were differe 
among the groups varying in rigidity that were 
nificantly greater than the differences within any 
group. 


> RESULTS 


The results of the analysis of variance) 
be seen in Table 2. s 

Only two significant differences were fo 
among groups, on trait A (ascendance) and 
trait M (masculinity). The difference 
trait M can be explained by the fact that 
number of males and» females in each g 
was not equated. For trait A, critical rà 
were’ computed for all possible combinati 
of pairs of groups. The results of this comp 
tion showed that a significant differe 
resulted whenever Group VII was comp 
with any other group. 

Thus, it Was found that, in general, 
Sever groups did not differ significantly on 
personality' traits. The one difference, W 
the more rigid seem to be the more submiss 
agrees with the results of Cattell (1), 
found that high perseveration, or rigi¢ 
was correlated with the reverse of dom 
or submission. 3 

„Tt was suggested that rigidity in 
situations may depend on a generalized. ; 
of submissiveness, rather than a general 


t 
(3 


r (ur ue 


t Is EXINSTELTNG RIGIDITY A GENERAL TRAIT? g 5 


rigidity factor. Submissiventss itself may 


further depend on a more fundamental need , 


for approval. (Phe person confronted with an 
Einstellung form: of test may experience a 
threat to the satisfaction of this need when 
he suddenly finds a problem that cannot 
easily be answered by the method he has just 
previously been using; he tries to continue to 
use it because it is perceived as “what the 
experimenter wants,” even though it does not 
readily give a solution to the problem. The 
giving of oneself over to the instructions of the 
task so that a strong set to respond in the 
instructed way is established may be what 
some psychologists call “ego involvement,” 
as differentiated from “task involvement.” 
If the person is ego involved in the situation, 
the meaning of that task to him is quite 
different from the meaning to the task-in- 
volved person. The test situation would be 
more likely to constitute a threat to the ego- 
involved person, who would then respond 
rigidly. This would not necessarily mean that 
such a person wé6uld always respond rigidly 
in all types of situations. The result would 
depend.on the reasons for his becoming ego 
involved. 

Percentages of each group falling in the 
Guilford-Zimmerman “least favorable” and 
“most favorable” score categories (6) were 
also computed. There was no evidence of 
any tendency fer the môst rigid group to be 


the poorly adjusted one. . á 


SUMMARY AND CONCLUSIONS 


The purpose of this*research' was tọ study * 
the relation between the Einsteħurfg type of 
rigidity and certain personality traits.“ The 
hypothesis that this kind of rigidity is an 
aspect of a generalized tendency to respond 
rigidly that would be reflected in other phases 
of personality structure was a basis for this 
study. 

To examine this hypothesis, a test of rigidity, 
consisting of verbal similarities with instruc- 
tions designed to establish a set to respond 
functionally, was answered by 373 students 
in introductory psychology classes at Syracuse 
University. On the basis of their answers to 
this test, these students were divided into 
Seven groups varying in amount of rigidity. 

These same students also answered the 
Guilfotd-Zimmerman Temperament Survey, 


scored for 10 persopality traifse A statistical æ 
analysis was then made for each of the 10 
traits to determine whether théreewere differ- 
ences among the, seven groups on each of the 
19 traits. In general, there were no differences, 
exceptgn the trait of ascendance-submission, 


where rigidity seemed to be related to sub- 


mission. 

It was concluded that the results obtained 
in the present study did not support the 
hypothesis of a generalized rigidity factor. 
This conclusion, although it is contradictory 
to Cowen and Thompson's (3), nevertheless 
is related to their discussion of results. They 
suggest that rigidity may, to some extent, 
“be deduced from a higher order construct 
in personality organization" and it may, to 
some extent, “be a culturally induced factor" 
(3, p. 175). Ascendance-submission may be a 
higher order construct to which rigidity is 
related in a way that brings about what seem 
to be generalized results in one case, specific 
results in another. Ego involvement, possibly 
related to submissiveness, may be the deter- 
mining factor in whether or not rigid behavior 
is exhibited, when the factors of age and 


* intelligence are held relatively constant. 


REFERENCES 


1. CarrELL, R. B. The riddle of perseveration: I. 
“Creative effort" and disposition rigidity. II. 
Solution in terms of personality structure. J. 
Pers., 1946, 14, 229-238; 239-267. 

2. CarrELL, R. B., & Trver, L. G. The varieties of 
structural rigidity. J. Pers., 1949, 17, 321-341. 

3. Cowen, E. L., & TuowursoN, G. G. Problem solv- 
ing rigidity and personality structure. J. abnorm. 
soc. Psychol., 1951, 46, 165-176. — * 

GERSTEIN, Reva A. A suggested method for 
analyzing and extending the use of Bellevue- 

. Wechsler vocabulary responses. J. consult. 
Psychol., 1949, 13, 366-370. 

5. GorpsrEIN, K. Concerning rigidity. Charact. & 
Pers., 1943, 11, 209-226. 

6. GUILFORD, J. P., & ZiuvERMAN, W. S. The Guil- 
ford-Zimmerman Temperament Survey, manual of 
instructions and interpretations. Beverly Hills, 
Calif.: Sheridan Supply Co., 1949. 

7. KLEEMEIER, R. W., & Dupex, F. J. A factorial 
investigation of flexibility. Educ. psychol. 
Measmt, 1950, 1@, 107-118. 

8. KouNiN, J. S. Experimental studies of rigidity: 
I. Measurement of rigidity in normal and feeble- 
minded persons. II. Explanatory power of the 
concept of rigidity as applied to feeble-minded- 
ness. Charact. & Pers., 1941, 9, 251-272; 273- 
282. e 

9. KouNiN, J. S. The meaning of rigidity: a reply to 
Heinz Werner. Psychol. Rev., 1948, 55, 157-166. 


Ceg 


i 


6 i BARBARA PrrcHER AND CHALMERŞ L. STACEY 


j 3) y 
10. Lewin, KoA dynamic theory of personality. New 
York: McGraw-Hill, 1935. 
11. Lucus, :A.’S, Mechiinization in problem solving: 
the effect of Einstellung. Psychol, Monogr., 
.. . 1942, 54, No. 6 (Whole No? 248). 3 
12. LucuiNs, A. S. Rigidity and ethnocentrism: "a 
critique. J. Pers., 1949, 17, 449-466, » 


n 
13. Lucas, A. S. The Einstellung test of rigidity: 


its relation to concreteness of thinking. J. consult. 
Psychol., 1951, 15, 303-310. 

14. Lucus, A. S. On recent usage of the Einstellung- 
effect as a test of rigidity. J. consult, Psychol., 
1951, 15, 89-94, 

15. McAwpnEw, H. Rigidity and isolation: a study of 
the deaf and the blind. J. abnorm, soc. Psychol., 
1948, 43, 476-494. 


- Rigidity and ethnocentrism: 
rejoinder. J. Pers., 1949, 17, 467-474, 

19. Stacey, C. L., & Marx, K. E, A study of the 
differential responses among three groups of 
subnormals on the similarities sub-test of the 
Wechsler Intelligence Scale. Amer. J. + ment, 
Def., 1951, 56, 424-428, 

20. Werner, H. Abnormal and subnormal rigidity? 


Ju 
21. Werner, 
evaluation. Psycho], Rev., 1946, 53, 43-52. 
Received September 4, 1952. 


^ 


A CASE or FAILURE OF GENERALIZATION OF IMITATIÓN ACROSS 
"DRIVES AND ACROSS SITUATIONS 


RICHARD L. SOLOMON go MARTIN R. COLES 


Harvard Picci 


N THE experiment below, white rats were 
trained to imitate white rats in a T maze. 
The imitators were motivated by food 
deprivation, and they were rewarded by food 
presentation only when they followed the 
leader. After intensive training, the imitators 
were transferred to a new experimental situa- 
tion where they were required to learn a 
jumping habit in the presence of a new leader. 
In the new situation the imitators were moti- 
vated by pain and fear, and they could solve 
the new imitation problem immediately if 
they followed the new leader. Thus, the experi- 
ment represents an attempt to produce a 
generalized imitative tendency in rats, a 
tendency which ‘will generalize across drives 
and across situations. 
_In,many ways our problem, the transfer 


of strongly reinforced imitative tendencies, * 


is like the task of ascertaining whether or not 
the functional autonomy of any motive exists. 
We can never approximate the experimental 
conditions of completely changed environment 
which are requited to test for the existence of 
, such an unconditional acquired fnotive (imita, 
tion, for example); nor, in fact, can we ever 
experimentally approximate à complete change 
in the internal envirorfmept, a‘ primary drive 
shift of dramatic proportions, *wifich would 

è required in a test for an autonómous 


quired motive. The experimenter, then, - 


must always work,at the negative approach. 


If he can demonstrate the existence of severe 


decrements (in some specified type of per- 
formance) that are wrought by changed con- 
ditions, the conception of a self-perpetuating, 
unconditional, and acquired tendency becomes 
relatively fallible in practice. Our problem, 
‘the limits of generalization of a tendency to 
` imitate, is typical of many recurring problems 
in the study ef personality where the question 
asked is: how whconditional, how situation- 
free is some general behavioral trend? 
Miller and Pollard (4), in a series of pioneer- 


1 THIS research was supported by the Laboratory of 
Social Relations, Harvard University. 


ing PES OS very convincingly demon- 
strated that the tendency to imitate is acquired 
in rats and in children. They have further 
shown that the common operations of rein- 
forcement, such as reward and nonreward, 
are necessary for the establishment of strong 
imitative or nonimitative tendencies. They 
presented their follower rats with a food reward 
only if they turned in the same direction in 
a T maze as the leader rats. These follower 
rats gradually acquired a tendency to turn 
in the same direction as the leader rats. After 
84 trials of training the follower rats were 
imitating the turns of the leader rats on ap- 
proximately 90 per cent of the trials. 

The leader rats during the 84 training trials 
had been albinos. Miller and Dollard then 
tested the followers with black rat leaders, 
and on a series of seven test trials they found 
that the followers imitated the leaders on 
87 per cent of the trials. Thus, generalization 
to the black rats was almost perfect, since the 
level of imitation of the white leaders at the 
end of training was at approximately 90 per 
cent rather than imitative perfection. 

The follower rats had been motivated 
through food deprivation, and the reward for 
imitating the leaders had been food presenta- 
tion. Miller and Dollard, wishing to measure . 
the extent of generalization of imitation to a 
new drive state, then tested the followers 
after they had been deprived of water for 24 
hours. The thirsty follower rats imitated the 
leaders on 92 per cent of these test trials, 
thus exhibiting perfect generalization from 
food deprivation to water deprivation. 

The follower rats were then tested for 
imitation in a &ifferent experimental environ- 
ment. Using an Érrangement of platforms, 
with leader rats who had been trained to jump 
Írom one platform to another, the follower 
rats jumped to the Same platforms as did the 
leaders in only 39 per cent of the test trials. 
However, there were four possible "response 
alternatives in this test situation; and thus 
the tendency to imitate the leader rats fase - 


8 RicuanD b. SoLomon AND Martin R. Cares 
4 Ys > 


Fic. 1. T-Maze APPARATUS FOR TRAINING LEADER 
AND FOLLOWER RATS 


above a chance level. The imitative tendency 
was not as strong as that obtained in the other 
types of generalization tests, 

"There does not seem to be convincing evi- 
dence against the interpretation of results 
used by Miller and Dollard to explain gen- 
eralization of imitation from the white leader 
rats to the black ones, They maintain, in 
harmony with S-R reinforcement theories, 
that generalization across types of leaders is 
based upon stimulus similarity. There is, 
however, a good reason for questioning the 
interpretation of their data on generalization 
of imitative tendencies across drives. While 
the work of Hull (1), Leeper (2), and more, 
recently, Thistlethwaite (6) indicates that 
drive discrimination is acquirable in rats with 
Special training, the work of Verplanck (7), 
Stellar (5), and McCleary (3) indicates that 
it is extremely difficult, if not impossible, to 
deprive rats of food without inducing water 
deprivation, and even more difficult to deprive 
rats of water without inducing food depriva- 
tion. So it is possible that the drive states of 
the follower rats were not radically different 
during the thirst generalization test from what 
they had been during the training sessions; 
whether there is a strong tendency in well- 
trained imitative rats to transfer imitation 

> acrpss drives that are quite discriminally 
D 


, 9 
different is possibly an open question at 
present. C 
METHOD 


In the experiment below we shall measure the trans- 
fer of acquired imitative tendencies in rats across 
changed drives and changed situations, with the type 
of leader held constant. We shall alter the drives and 
situations as radically as we can, in order to meet as 
closely as possible the ideal requirements for demon- 
strating the presence of a generalized, acquired tend- 
ency to imitate. P te 

Apparatus. The apparatus for the initial imitation 
section of the experiment was^an elevated T maze, 32 
in. high, (see Fig. 1). The stem of the T (A) from the 
foot to the choice point, was a path 49 in. long and 1.5 
in. wide, with'a wire mesh covering the wood of the 
maze. The two arms of the T (B, B’) were a wire-mesh 
path similar to that of the foot, each 54.5 in. long from 
the choice point to the end, and 1.5 in. wide. At a dis- 
tance of 37.5 in. from the choice point on each arm was 
the first sunken food cup (C, C^, about 1.5 in. deep; 
it was covered by a metal lid to which a string was 
attached, thus enabling the leader rat to run directly 
over to the next cup, whereas the cover could be lifted 
at the followers’ approach. The leaders’ food cup (D, 
D^), identical to the followers’, was situated 13.75 in. 
from the followers’ on each arm; Just in front of the 
followers’ cup, and likewise in front of the leaders’ cup, 
there were affixed, perpendicular to the maze and on 
both sides, wooden slits into which a metal door could 

" fall in such a way that the leader and follower rats were 
separated while eating, and the follower rat was pre- 
vented from retracing on the maze (E, E' F, F^. 
These doors, like the lid covers of both follower and 
leader cups, were operated by a pulley system. N 

The apparatus for the second phase of the experi- 
ment, testing generalization of imitative responses, 
was a wooden box (Fig. 2) 14 in. wide, 16.5 in. long, and 

78.5 in. tall, divided into four equal sections (A, B, C, 
D) by glass partitions longitudinally (E, E^), and lat- 
erally by two wooden partitions which could be lifted 
> up by tlie experimenter (FF’). The bottom of the box 
was an electric;grid (G) of 28 wires, each .5 in. apart, 
running lengthwise from end to end of the box. Half 
of each grid was controlled by a separate switch, thus 
enabling the experimenter (E) to shock the animal on 
one side, without giving a similar shock should the 
animal jump to the other side of the grid. Separating 
and delineating the two sides was a thin wooden bar- 
rier (H, H’) abqut 3$ in. wide and 2.75 in. tall over 
which the animals had to jump to reach the area of 
nonshctk. Thus two animals could be put in two cham- 
bers beside edch other and Separated by glass on one 
half of the grid; and, when shocked, and the two wooden 
doors opened, both animals could jump over the 
barrier to the two separate and similar chambers on the 
other side of the grid. Then, after a stated interval of 
time, they could be shocked again and forced to jump 
back to the original side. The entire box rested on an’ 
elevated table, 24 in. wide and 20 in. long. On the table, 
the grid floor and directly 

underneath it, was an electric buzze which emitted a 

Suitable sound and could be controlled by a>switch. 

The ceilings of the compartments were of glass, so that 


f 
| 
| 


gre : 


t A FAILURE, OF GENERALIZATION OF IMITATION 


the animals could be put into tither side by pulling out 
the respective sliding glass top and reinserting it for 
the experimental period. 

Procedure. Twenty female albino rats, 90 days old, 
were gentled. Each rat waskept in a separate cage with 
constant supply of water and was fed once a day to 
satiation with wet mash. After five days of such a rou- 
tine, each rat was tested on the maze for any initial 
preference for turning left or right. Two trials per day 
for four days were given to each rat. Six rats were 
selected as leader trainees, three left-turning and three 
right-turning. 

The six animals selected as leaders were kept on a 
24-hour feeding schedule, while the others were now 
allowed a nearly full whiskey glass of wet mash in their 
cages. The potential leaders were run three times 
a day at first, then five, and finally eight times per day. 
Tf a future left-turning leader turned correctly, the lid 
of the leader's food cup was lifted and the animal was 
allowed to eat for 20 seconds, and then was returned to 
its cage. If the animal turned to the right, it received 
no food, and was removed from the maze. As soon as 
the animal arrived at the leader's food cup, on either 
side, the gate behind him was closed to prevent re- 
tracing. They were always started in the same spot, a 
few inches from the base of the stem, facing away from 
the choice point, and their first response usually was 
to turn directly about and go down the path to the 
choice point. The rats were run seven days per week, 
at the same time each day until they had achieved 40 
successive errorless trials, so they were reliable leaders. 

Five more rats were gentled, and were habituated to 
an elevated runway on which they were fed. They were* 
always accompanied by another rat during the habitua- 
tion. The imitation training procedure was next started. 
The hungry leader and follower were put on the T maze 
at the same time, the leader nearer the choice point 
than the follower. The gate separating the leader from 
the follower’s cup was kept slightly open, so that if and 
wheti*the follower rat reached the fqod cup it might 
see and smell the leader. The leaders were allowed tos 
eat until the followers had arrived at their food cup 
and had started eating; and later on, as the followers 
grew more proficient, bothewere allowed to eat for the: 
same 20 seconds. The follower fats that tyrned in the 
same direction as the leader were rewarded with food 
and allowed in this initial phase to eat for about one 
to two minutes, This time limit was slowly reduced, 
over a period of a week, to 20 seconds. When the 
imitators reached the food cups, the second door was 
lowered to prevent retracing. If the rats turned in the 
wrong direction, they received no food, but they were 
allowed to wander along the maze until they came to 
the other side, whereupon, as soon as they reached the 
cup, the door was let down, and the food-cup cover 
pulled up. If the rats, in the very beginning, did not 
come to the choice point within two minutes after the 
leader had run to its food, the leader was brought back 
and the trial attempted again. If the imitator rat re- 
mained at the cbeice point for more than three minutes, 
*which occasionally happened at first, it was likewise 
brought back and run again with a leader rat. 

The number of trials per day ranged from one, at 
the beginning, toras many as ten. They were given at 
the same time every day, seven days a week for eight 


€ 


© E 


i 


^ 
© 


LI 


Fic. 2. AVOIDANCE CONDITIONING APPARATUS FOR 
THE TESTING OF GENERALIZATION 
OF IMITATION 


successive weeks. During the last few weeks, when 
imitation was up to 95 to 100 per cent, the rats were 
kept on a five-trial-per-day training regimen. The 
leaders, either left- or right-turning, were chosen each 
day in a carefully systematized manner so as to elimi- 
nate completely any chance that the follower rats 
could learn any given order of turning. Every possible 
combination of left- and right-turning leaders was used. 
When the imitators had reached a level of over 95 
per cent correct responses and were dependable, the 
third and crucial phase of the experiment was begun. 
Two new rats were selected for training as leaders in 
a new situation. The aim was to teach them shock avoid- 
ance to a one-second latency criterion. The rat was 
put in one of the sections of the gridbox, and, after 
‘two minutes, the buzzer was sounded and simul- 
taneously the wooden partition was lifted so as to allow 
the rat to jump over the barrier into the opposite 
chamber. If the rat did not jump within three seconds, 
it was shocked by a standardized electric current. If it 
jumped to the other side, it received no shock. If it 
remained on the barrier, the door was pushed upon it, 
forcing it to jump off to either side. The buzzer was 
stopped when the rat had jumped to the other side. 
After 60 seconds, the buzzer was started again and the 
shock applied three seconds later to the side where the 
rat was now situated; the buzzer was stopped when the 
rat, usually after considerable jumping around the grid 
for varying periods of time, reached the other side. As 
soon as the rat reached the other side, the door was 
lowered. Each of these two rats was given 15 trials per 
day, followed by another two-minute period in the box 
before being removeq to its cage. As shock-avoidance 
Jearning progressed, the time between buzzer and shock 
was cut to two seconds, and finally, to 1.5 seconds. The 
rats were thus at a level of shock avoidance charac- 
terized by almost “immediate” jumping to the other 
side of the grid at the sound of the buzzer. | < 
At this stage, one of the five imitator rats, which 
had been fed to satiation, was put on the same side-of 


VMITATION - 
3 


PERCENT 


100 200 


TRIALS OF TRAINING 


Fic. 3. LEARNING CURVE FOR THE ACQUISITION OF 
IMITATION BY FOLLOWERS IN THE T Maze 


the grid as a shock-avoidance leader animal, separated 
from it by the glass partition. The imitator was given 
every opportunity to see the shock-avoidance animal, 
There was a small space between the bottom of the 
glass partition and the grid, so that the two rats often 
sniffed at each other. After two minutes the buzzer 
was sounded, and three seconds later the shock was 
put on. The shock-avoidance animal jumped over 
within one second, and was visible to the imitator 
through the glass section, The buzzer and shock con- 
tinued until the imitator rat jumped to the other, 
"safe" side of the grid. After a 60-second interval, the 
Same procedure began again. Each of the five T-maze 
imitator rats, and four T-maze leader rats? were given 
10 trials per day, for 12 successive days. The time from 
the initial sounding of the buzzer to the moment when 
the follower or leader rat jumped completely over to 
the other side of the grid was recorded at each trial by 
& stop watch. The doors were lowered successively and 
immediately as the shock-avoidance and imitator or 
leader rats jumped over. Since the shock-avoidance 
leader animals often heard the buzzer for a consider- 
able period of time after they had jumped, there was a 
tendency after 10 or 11 trials toward a progressive 
lengthening of their latency period in jumping. This was 
counteracted by "refreshing" the shock-avoidance 
leader animals, before a leader was put in the box. The 
fact that there were two shock-avoidance leader animals 
also enabled E to switch leader animals when they were 
fatigued. The T-maze imitators and T-maze leaders 
were both run as shock-avoidance followers. The T- 
maze imitator rats were kept running on the T maze, 
five trials per day, during the 12-day period of shock- 
avoidance imitation training. 

At the conclusion of this period, at which time all 
animals were avoiding shock ly from 9 to 10 
times out of 10 trials, seven control animals were run 
as shock-avoidance followers. The same procedure was 
followed with this group for a similar period of 12 
successive days. These rats hrd been handled very in- 
frequently, and had no previous T-maze experience. 
_20n the supposition that latency figures in an ex- 


Two T-maze leaders were dropped from th experi- 
ent for reasons of poor health. PE 


RICHARD L. SOLOMON AND Marry R. Cores 


tinction process would yield some pertinent data, ex- 


tinction was also started for all shock-avoidance fol- 


lower rats. The procedure was the same as before, - 


only there was no shock at all. The buzzer would start 
and continue until the rat jumped to the other side, 
The latency was recorded each time. The same 10 
trials per day were given. The shock-avoidance leader: 


ally to ensure their continued immediate jumping, 
Extinction of shock-avoidance learning is always ex- 
tremely difficult and requires many weeks; since in- 
spection of the data showed no difference between 
leader-follower latency during the period, and since 
extinction itself would have required many more 
weeks of experimentation, thé procedure was ter- 
minated after eight days. 

In summary, then, five rats were trained to imitate 
leader rats on an elevated T maze. Then, after achiev- 
ing a high degree of proficiency in imitation, the imi- 
tators (followers) were transferred to a new situation in 
which they were trained in shock avoidance. In the sec- 
ond situation the imitators were paired with new leader 
rats who had already been trained to avoid shock by 
responding to a buzzer in less than 1.5 seconds. The new 
leader rats were visible to the follower rats, and if the 
follower rats had imitated them in the new situation, 
the followers would have been able to avoid shock at 
the outset. As controls for experimental handling and 
for the effects of being run with another rat, two other 
groups were trained in the shock-avoidance situation 
in the same manner as were the T-maze followers. One 
group consisted of the T-maze leaders, the other was 
“composed of experimentally naive rats. " 


RESULTS 


T-mase imitation. It is interesting to note 
that the followers started out with a marked 
nonimitative bias. During thë first 20 imita- 
tion training trials, the followers imitated the 
leaders on only 40 per cent of the trials. The 
,course of acquisition of imitation in the T 
maze is plotted in Fig. 3. The final level of 
imitative praficiency was approximately 97 
per cent. Our rats acquired the habit more 
slowly than did those of Miller and Dollard. 
When their rats had received 84 trials, they 
were approximately 90 per cent proficient; 
our rats were only about 80 per cent proficient 
at this stage of training. Our rats were given 
about'four times as many training trials as 
those of Miller and Dollard, and they achieved 
a higher steady level of imitative excellence. 
It is interesting to note that the leader rats, 
who were trained in the same maze but did 
not have to learn to imitate, acquired a stable. 
position response (100 per cent responses to 
the reinforced side) in approximately 50 
trials. Apparently it is easier for rats to learn 
an absolute discriminative response tlisn an 


) animals were given “refresher” shock periods occasion- ^ 


"se M $ 


A BAILURE (op GENERALIZATION Of IMITATIQN 


| COUNT: 


€ 
li 


11 
s z Í t . L4 


TABLE 1 


TRIALS REQUIRED TO PRODUCE THE FIRST AVOIDANCE RESPONSE na THE NÉwprn or TRIALS 


THE NUMBER 
REQUIRED TO MEET A CRITERION OF TEN AVOIDANCE RESPONSES IN TEN TRIALS, FOR THE T-Maze IMITATORS 
AND THE CONTROLS 
= 
T-Mazz IurrATORS T-Maze LEADERS (CONTROLS) EXPERIMENTALLY NAIVE CONTROLS 
TRIALS TO TRIALS TO TRIALS TO 
Tarárs ro Emst Renca 10/10] „RIAIS TO ERST React 10/10 AA TO DEST | Reacmi0/10 
AVOIDANCE RESP. CEMERION VOIDANCE P. CERION. VOIDANCE RESP. | CRITERION 
RAT TRIALS TRIALS Rat TRIALS TRIALS RAT TRIALS TRIALS 
xt 15:998 #10 7 110 #20 15 37 
#2 15 64 wit 3 70 #21 3 103 
#3 9 82 * #12 3 53 #22 12 83 
#4 2 75 #13 18 51 #23 5 50 
#5 15 45 #24 21 120 
#25 9 109 
#26 10 120 
Mean: 11.2 68.8 7.8 71.0 10.7 88.9 


imitative discrimination. The latter is a 
conditional discrimination rather than an 
absolute one, an@it is therefore more difficult. 
The establishment of imitation confirms 
Miller and Dollard’s findings. 

Shock-avoidance imitation. Table 1 reveals* 
clearly that cross-drive, cross-situation gen- 
eralization of imitation did not occur to a 
significant extent. In trials to achieve the first 
avoidance response, the T-maze imitators 
did not significantly behefit from their prior 
imitation training. The same is«true when we, 
examine the number of trials required to reach 
a criterion of 10 avoidance responses in 10 
trials. An analysis of latency of response to the* 
buzzer was made, and there was o @ignificant 
difference between the T-maze fmitators and 
the two control groups. 

T-maze imitation on the days of shock- 
avoidance training. Ìt will be remembered that 
the shock-avoidance training was introduced 
after the T-maze imitators or*followers had 
reached a stable and reliable level of imitative 
. performance. The T-maze trainifig was con- 
tinued during the days when shock-avoidance 
training was taking place. It is noteworthy 
that the level of imitation on the T maze 
-did not change during the 12 days of shock- 
‘avoidance training. Had there been any 
generalization of a tendency to imitate, one 
might have expected that in the initial “escape” 
phasee of avoidance training, when any gen- 
eralized tendency to imitate would have been 


` associated with shock on every escape trial, 


there might have been a subsequent decrease 
in the level of imitation in the T-maze situa- 
tion. This was not the case. 

Extinction of  shock-avoidance responses. 
The latencies of response to the buzzer during 
the extinction trials revealed no difference 
between the three groups of rats. There was 
no change in the level of imitation on the 
T maze during the eight days of extinction 
trials in the shock-avoidance situation. 


DISCUSSION 


With regard to the demonstration of a 
relatively autonomous, acquired tendency to 
imitate, the results in this experiment were 
clearly negative. There was no transfer across 
very different drives and situations, keeping 
the characteristics of the leaders constant. 
This experiment suggests that a systematic 
variation of drives, situations, and attributes 
of leaders is now required in order to find 
out the conditions under which imitative 
tendencies generalize most easily, least easily, 
or never in white rats. Such a systematic 
study would probdoly reveal some significant 
dimensions of varfation that we are now over- 
looking. vt 

The failure to Obtain generalization o 
imitation in this experiment can be attributed 
to many different hypothetical events. In 
the first place, it could be argued that the 


imitator rats in the shock-avoidance situation « 


e 


9 ! iJ 
j ) 


> ; g 
12 RICHARD L. SOLOMON AND Martin R. Cones 


» J 2 
did not see, hear, or smell the activity of the different attributes of hunger and fear would 
leader. This is unlikely, since the small opening enter in to determine the results of such an 
under the glass panel facilitated contact be- experiment. Such an analysis predicts a high 
tween follower and leader; then, too, the ex-, level of generalization of imitation when the 
perimenters on several occasions noticed the initial imitation training is accomplished on 
follower rat quickly orienting in the direction the basis of fear. When the sequence used is 


of the leader when the latter was in the process 
of jumping to the safe compartment. In other 
words, the followers seemed to “pay attention 
to" the leaders in the shock-avoidance situa- 
tion. 

It is interesting to analyze the possible 
important factors which probably enter into 
the generalization of imitation, in order to 
account for our negative results in theoretical 
terms. The leader rats in the T-maze imitation 
situation most likely acquire secondary rein- 
forcing properties for the followers. But, it is 
also possible to conceive of the leaders as 
being stimuli which have acquired drive 
properties. It is possible that in the shock- 
avoidance situation the fear drive is incom- 
patible with drives like hunger or appetite, 
and since the imitative behavior of the fol- 


like the one in our experiment, it is possible 
that imitation is impossible to detect and 
measure; competing reactions, based on fear, 
make it impossible. 

It would seem that the problem of generali- 
zation of imitation is identical with the 
traditional problem of transfer of any training, 
just as it is also identical with the problem of 
functional autonomy of motives. Owing to 


the initial training procedure there will be a 


social stimulus (the leader), and this stimulus 
will come to control certain instrumental acts 
due to the effects of reward. This stimulus 
may also tend to acquire rewarding or driving 
properties of its own; but it must be admitted 


that these aspects of learning have been too - 
poorly worked out as yet to enable us to pre- - 


dict very well. In the training situation some 


lowers has been established in the presence of sort of motivation state will be present which 
hunger and food, a sharply contrasting situa- «may be conceived to have stimulus properties 
tion is present. In terms of response tendencies, in addition to the features of the experimental 
it might be argued that the fear engendered environment. When the test for generalization 
in the shock-avoidance situation tends to of imitation is introduced (the test for transfer 


elicit reactions in the followers which are 
incompatible with following the leader. 
Freezing and crouching were sometimes 
observed in the followers. 

Then, too, the order of presentation of situa- 
tions may contribute to the outcome of this 
experiment. Consider the hypothetical case 
of establishing a tendency to imitate on the 
basis of shock avoidance and then testing for ` 
transfer of this training to a T-maze situation 
with hunger drive present and with a food 
incentive. In such a situation the followers 
would quickly learn that when they were near 
the leaders they were safe, and they might 
exhibit fear reactions when the leaders were 
at some distance. Thus, in the T maze, fol- 
lowers with such training might very well 
imitate the leaders, but tls would be con- 
trolled largely by the presence of the fear drive. 
Even though the experinzenter had ostensibly 
Set up a hunger situation, the fear drive might 
be elicited when the leader rat in the new 
T-maze situation increased its distance from 
^ <thé>follower. It is clear that the basically 


2 


of imitation training), some aspects of the 
original situation are changed,in a controlled 
fashion. 2 i 

> Jf the attributes of the leader are changed 


in a situation such as that described by Miller _ 


and Dollard (4), then;we obtain very good 
transfer. But, one ‘might guess that if the 
original imitation training had been accom- 
plished on the basis of fear, generalization 
might have been greater. The followers might 
strive to stay near any object bearing resem- 
blance to the one which had been associated 
with safety in>the initial training. When the 
drive i$ changed in the transfer test, the out- 
come should-depend on the response hierarchy 
which innately exists in the presence of the 
new drive. If the drive tends to elicit responses 
which are compatible with the old imitative 
responses, transfer should be good; but if the 
responses elicited by the drive in the new 


situation are incompatible with the old imita- | 


tive responses, then transfer should not be 
detectable. In general, when incompatability 
exists there will be a tendency in favor of the 


* * 


responses controlled by the stronger or more 
prepotent drive. This might account for the 
negative results of our experiment, since the 
fear reactions in the transfer situation might 
have interfered with a learned tendency to 
imitate in the presence of a leader. When the 
features of the experimental environmeni are 
changed, the follower will have problems of 
orientation, but if the leader is the same and 
the drive is the same, transfer should be good 
if the new environmental features allow the 
old imitative responses to occur. This was not 
the case in Miller and Dollard’s „experiment, 
where transfer across situations involved 
jumping responses in contrast to the running 
and walking responses which were required 
in the initial training situation. In our experi- 
ment the new situation also required a response 
pattern change, and therefore probably 
hindered good generalization of imitation. 


SUMMARY 


Albino rats wert trained to imitate leaders 
in a T-maze situation, and training was con- 
tinued until the followers were 97 per cent 
imitative. They were trained hungry, with 
food incentive. Then they were transferred 
to a shock-avoidance training situation, where 
imitation of leader rats was not only possible 
but also would have been a means to shock 
avoidgnce. However, tlie T-maze imitator 
rats did not show a course of acquisition of | 
shock-avoidance responses “which differed * 


ee 


6 ; 
A FAILURE OF GENERALIZATION oF IMITATION i 13 


6 . 
significantly from that ef controls who had 
not had prior imitator training. The results 
are discussed as an example of transfer of 
trgining in which the following factors play 
a signigcant role: social stimulus pattern, 

ive, environmental features, drive-related 
response hierarchies, and instrumental re- 
sponses required by environmental features. 
The experiment served to highlight the diffi- 
culties in a practical test for the existence of 
any postulated functionally autonomous, ac- 
quired behavioral tendency. 


REFERENCES 


1. Hutt, C. L. Differential habituation to internal 
stimuli in the albino rat. J. comp. Psychol., 1933, 
16, 255-273. 

2. Leeper, R. The role of motivation in learning: a 
study of the phenomenon of differential motiva- 
tional control of the utilization of habits, J. 
genet. Psychol., 1935, 46, 3-40. 

3. McCreary, R. A. Some physiological factors under- 
lying the specific hunger for sugar. Amer. Psy- 
chologist, 1951, 6, 288. (Abstract) 

4. Miter, N. E., & Doxtarp, J. Social learning and 
imitation. New Haven: Yale Univer. Press, 1947. 

5. SrELLAR, E. Gastric factors controlling water and 
salt solution drinking. Amer. Psychologist, 1951, 
6, 288. (Abstract) 

6. THISTLETHWAITE, D. Drive-discrimination and food 
location as factors in irrelevant inc entive learn- 
ing. Amer. Psychologist, 1951, 6, 287. (Abstract) 

7. VERPLANCE, W. S., & Hayes, J. R. Some relation- 
ships between “hunger” and “thirst,” and “eat- 
ing” and “drinking.” Amer. Psychologist, 1951, 
6, 288. (Abstract) 


Received September 11, 1952. 


25, D. C., remitting 
cd 


2 


yi) f 


"n 


DUNCAN MacRAE, Jr. 
Department of Sociology and Social Institutions, University of California, Berkeley 


HE theories of children's moral develop- 

ment put forward by Piaget in The 

Moral Judgment of the Child (8) occupy 
an important place in the literature dealing 
with socialization. The idea that a child's 
morality changes as he grows older, from 
strict and specific moral rules deriving force 
from parental authority to more general 
principles supported by groups of equals, has 
gained wide acceptance. The data that Piaget 
advanced, showing systematic differences with 
age in interview responses, appeared to support 
this idea. Lerner (4) showed that the same 
differences among age groups existed in the 
United States. He also studied the relation 
between social status and moral judgment, 
and connected the child's developing morality 
with an increase in ability to see social situa- 
tions from the viewpoints of others (5). Both 
Piaget's theories and Lerner's may be sub- 
jected to additional tests. 

We shall treat two main hypotheses involved 
in Piaget's and Lerner's work: (a) that the 
questions used involve a Single underlying 
dimension or entity other than the child's 
age; (b) that this dimension of moral judgment 
is associated with the type of authority rela- 
tions to which a child is subjected. 


PROCEDURE 
Data on children's moral judgment were gathered 


by means of questions similar to those of Piaget and: 
, Lerner. Each child was 


1 also asked questions intended 
to reveal his authority relations with his parents and 
his relations with his peers. 

The sample consisted of 244 boys aged 5 to 14. 
Contacts were made in camps, schools, and play- 
grounds in three Eastern states during the summer of 


was read at the meeting 
Association at State College, Pa. tember, 19: 
Detailed information as to dcum and dex 
Ad may be found in the original thesis (6). 

or a copy of the questionnaire used, order Document 
3915 from ADI Auxiliary Publications Project, Photo. 
duplication Service, Library of Con; Washington 
25, in advance $1.25 for microfilm 
of $1.25 for photoprints. Make checks payable to Chief, 
es eaiplication Service, Library of Congress, 


A TEST OF PIAGET'S THEORIES OF MORAL DEVELOPMENT: © 
| 


1949. A questionnaire form was used as both a 
interview schedule for younger children and a write 
questionnaire for those who could fill it out. Al 
half an hour was devoted to moral-judgment ques! 
and half an hour to questions about the child’s f 
activities and peer-group relations. The moral-jud| 
questions were stories like the following: 

4. Here are stories of what two boys did. Which 
you think was worse? 

A. Tom thought it would be fun to play with 
father’s inkwell while he was out. First he played i 
the pen, and then he made a little blot on the tal 
cloth. 

B. Joe noticed his father's inkwell was empty. 
day when his father was away, he thought of filling 
inkwell to help his father, so that his father would 
it full when he came home. But while he was o 
the ink bottle he made a big blot on the tablecloth, 

Which of the two things that the boys did do 
think was worse? 

1 __ The first (A) 

2 — The second (B) 

Why? 

Most of these questions were taken from Piaget 
or Lerner (5), with minor changes? Another set 0 
questions, added by the writer, will be discussed la! 


Hyporuesis I: UNIDIMENSIONALITY 


Within our sample the variation of answ 
with age was in the same direction as found 
by Piaget and Lerner and was in general &ccoi 

? with the comparable data they cited (8, 
251, 277; 5, pp. 255, 259, 265). This indica 

. that the phenomena, under observation à 
relativèėl stable.’ For each question 
answer given by older boys (the “mat 
answer) was considered positive for sco 
purposes. If the answer to each questio) 
varies systematically with the child's ag& 
however, the answers to any pair of questio 
can be expected to show association due 
this fact alone. We proceed, therefore, 
control age statistically and examine 
assóciations that remain. E 

The technique used is to compute the yim 
Positive frequency that would have obtained 
for each pair of questions had there Es 

* The page references co; idi e questi 
be di: jabez by order in ior question 
are as follows (reference 8): question 2, pp. 145-146 
IV; 4, p. 118, II; 16, p. 118, III; i8, p. 276, IV; 2 
p. 201, III; 12, p. 201, IT; 22, p. 251, IIT; 13, P 
(modified). Those in reference (3) are: questions 5 an4 


15, p. 259; 21, v. 262; 8, pp. 251-252. j 


ng 


to 


e u 


A TEST Qe Prace?’s THEORIES OF Morat s ard 


NONU] 
association between them within each two- 
year age interval. This is compared with the 
observed joint frequency and the significance 
of the difference tested. The "expected" joint 
positive frequency is given by 


5 
=D) mpri pij» 
k=1 


. where #;; = expected number of boys in sample 
who gave "mature" answers to both question 
i and question j; ,= number of boys in k* 
age group (e.g. m = 14 if there are 14 boys 
in the first age group, aged 5-6); ri = pro- 
portion of boys in k** age group'giving “ma- 
ture" answers to question i; fr; = proportion 
of boys in k age group giving "mature" 
answers to question j. 

The “expected” and observed frequencies 
may be compared by means of a t test. The 
variance of their difference is the sum of the 
variances of the differences within the five 
age groups. This sum can be approximated 
conservatively byethe expression 


ei? = Np.gibsdi, 


where’ N = total number of cases in entire 
sample; p; = fraction of "mature" answers 
to question i in entire sample; pj = fraction 
of “mature” answers to question j in entire 
sample; q; = 1— pis ai = 1— pi 


15 

€ 

The results of this test of significance are 
shown in Table 1. From A table there appears 
to be no general factor underlying all the 
v vien Rather, there appear to be several 

bclusters, each representing a separate 
aspect “of moral development. We conclude, 
therefore, that these data do not support 
Piaget’s and Lerner’s implicit assumption that 
there is a factor other than age common to 
all these questions. 


CLUSTERS OF MoRAL-JUDGMENT QUESTIONS 


The subclusters of questions appearing in 
Table 1 may be interpreted as follows: 
Questions 2, 4, 16, 18. The first three of these 
questions concern the evaluation of acts in 
terms of either the actor’s intentions or the 
actual consequences of the act. The fourth is 
concerned with the rightness of parental com- 
mands that treat children unequally. A pos- 
sible common factor is aggression or dis- 
obedience by children against adults, 
Questions 20, 12, 22, 13. The first two of 
these questions concern the attitude that 
strict punishment is “right.” The last two 
e concern the expectation of “immanent jus- 
tice"—"the existence of automatic punish- 
ments which emanate from things themselves" 
(8, p. 250). 
Questions 5, 15, 21. These questions were 


© TABLE 1 
ASSOCIATIONS BETWEEN PIAGET-TYPE AND LERNÉR-TvPE MORAL-JUDGMENT QUESTIONS, WITH AGE 
.  CONTROLLED* 
Source or aaa . SIGNIFICANCE LEVEL OF ASSOCIATION 
QUESTIONS QUESTION +t 
AND NAME or Nomsert i: . . Question NUMBER 
CLUSTER 2 EEE TID 0 18 5 420. 042 O AKA EAER TS EA 
Piaget m 
Intentions- 4 02+ 
Consequences 16 20+ . 9t 
18 01+ ".054- .054- 
. 
Piaget 20 x « x .104- 
Punishment 12 x x $ x .014- 
22 x x x x .054- .024- 
13 .204- x .20— x x .104- at 
Lerner ee B5 x x x x x 104- x 014- 
Perspective e 15 x x x x x x Xe x .014- 
E: pa! x x x x x x X x .022- .024- 
8 x x x x x x x Es EE . x 


€- 


* Significance levels are indicated as follows: x indicates p .20; .20, 20 = p > -10; .10, .10 = p > .05; .05, .05 = 9.02; .02, 02 = p 
> 015.01, $ < .01. Signs following significance levels indicate whether a “mature” answer to one question is associated positively e or 


negatively (—) with a “mature” answer to the other. 
t See footnote 2 in text. 


e 


Ld 


» 


16 


D 2 
used by Lerner in his id of perspective (5). 
The chief factor comihon to all three may, 
however, be the likelihood of saying, “I 
don’t know,” when limited information is 
given. 2 y 

A fourth cluster of questions, to be di3cussed 
below, is: 

Questions 3, 7, 10, 14. These questions 
are concerned with violation of norms such 
as those regarding lying or stealing, when they 
conflict with other influences such as that of 
friendship. 

Because of the separateness of these clusters, 
it seems hardly useful to construct an index 
from the whole set of questions, such as 
Lerner's “moral realism index" (4, ch. X). 
Rather, we shall construct separate indices 
for the four clusters and use these in further 
analysis. The resulting indices will be called, 
respectively, (a) Intentions-consequences, (b) 
Punishment, (c) Perspective, and (d) Violation 
of norms. The intercorrelations of these indices, 
their correlations with age of child, and their 
partial intercorrelations with age controlled, 
are shown in Table 2. 


Hypotuesis II: PARENTAL AUTHORITY 


Piaget and Lerner asserted that the changes 
with age in answers to their questions reflected 
a change from relations of authority to rela- 
tions of equality. We can test this assertion 
for each of the four indices if we have adequate 
measures of the child’s relations with parents 
and peers. Questions on these topics were 
included in the questionnaire. 

Of six intended measures of various aspects 
of boys' authority relations with parents and 


peers, only two showed both internal con- » 


sistency and significant association with some 


TABLE 2 
CoRRELATIONS OF Morat-Jupcment INDICES WITH 
EACH OTHER AND WITH AGE 


(Partial correlations appear above the diagonal, total 
correlations below.) an 


Morat-Jupement INDEX us gs P ced Mus 
sm) SPEC. RMS 


Intentions-consequences POE 5:09 209p. 19) or 

Punishment +. 4.09 dpa 

Perspective T3500 +,26 mo 

Violation of norms +. T4 ET E 

o» TOTAL CORRELATIONS 

Age S +.53 34A 50 4.38 
D) 


Duncan MacRae, Jn. 


; significant association with moral-judgment 


ysimilar variables in the Fels Parent Behavior 


“enforcement” (9). 


" 


moral-judgment index. This negative finding 
is of interest, but may reflect on the measure 
of authority rather than on the Piaget-Le 
hypothesis. Therefore we shall discuss only 
those measures of “authority” that showet 


indices. These were: 

1. Extent of parental discipline and contro 
This index was intended to measure the current 
extent of control by parents over the boy's 
activities. To the extent that such control 
effective, this may be considered a measure of 
the degree of “authority.” The seven question: 
composing this index had to do with contro 
over the boy’s homework for school, hour of 
coming in at night, bedtime, chores, associates 
etc. 

2. Internalization of parental requirements. 
In distinction to current controls, this index 
was intended as a measure of the cumulative 
effect of past parental controls. In a sense it i 
another measure of the child's superego, in 
terms of his reports of his own behavior 
situations involving parental controls. The 
five questions composing this index had to do 
,With such matters as whether the boy willingly 
came to supper on time, did household chores, 
and avoided playmates disapproved by his 
parents. 

The correlation between these two indices 
was +.07. This low degree of association is not 
inconsistent with the relation between; two 


Scales, “restrictiveness of regulations" and 
“effectiveness cf policy of regulations and 


To conlorr with Piaget’s notion of “au 
thority,” these indices should show a steady 
decrease of authority as the child's age in- 
creases. They do not, as their correlations with 
age are +-.04 and —.08 respectively; some ol 
the questions that compose the indices show 
an increase of “authority” with age, while 
others: Show a decrease. 

„Our measures of authority showed 10 
significant association with any of the Piaget- 
Lerner indices discussed so far, but they did 
show such association with another index o! 
moral judgment. The "violation of norms” 
index showed associations significant at the 
per cent level with both the “extent of dis- 
cipline” index (r = —.13) aad the “inter 
nalization” index (r = — 16). These asocia- 
tions are in the expected direction; boys subject | 
to greater authority tend to approve of acts 


Av 


A Test or PracEf's THEORIES or MORAL DEVELOPMENT 
e 


* 
that violate the norms in question less than 
do other boys. , 

Consideratfon of another aspect of Piaget's 
theories led to the introduction of the questions 
in this index. There may be a change in chil- 
dren's morality with age, connected with 
changes in social relationships, yet not reflected 
in the first three indices. We might expect 
specific parentally-taught norms to be held 
less rigidly when there is less likelihood of 
effective sanctions by the parents. 

These additionak questions involved the 
willingness to violate in certain situatiops such 
moral prescriptions as those favpring respect 
for property, obedience to teachers, and 
veracity, when they conflict with obligations 
of friendship, group membership, or personal 
need. One example is the following: 


3. Bill was watching some boys that he knew playing 
ball in the street. One of them threw a ball at the wall 
on the other side of the street, and it broke a window. 
The boys all ran away. The man whose window was 
broken came running out and looked around. “If I 
find out who did that," he said, “I’ll turn him over to 
` the police.” He asked Bill if he knew the boy. 

What do you think Bill said? 

1 —e— Bill told him the boy's name 

2... Bill said he didn't know the boy's name 

Why? 


Questions similar to this group were also used 
by Piaget (8, pp. 286-289). 

A loose cluster of four questions appeared 
amor this group. The "violation of norms" 


€ 


17 


e 
indices are Pane LG Celstively more with 
cognitive moral develofment amd the “viola- 
tion of norms" questions more with emotional 
moral development. 

€ The importance of cognitive factors in moral 
edevelopment has been amply demonstrated, 
atleast as far as the relation of moral judg- 
ment to intelligence test scores is concerned. 
Piaget presented data illustrating such a rela- 
tion as regards “immanent justice" (8, pp. 
251-252), though he did not seem to dis- 
tinguish the two aspects of moral development 
sharply. Others have found similar relations 
in tests of both normal and mentally retarded 
children. A small sample of gifted boys con- 
tacted in the present study also tended to 
give more mature answers than did a com- 
parison group, matched on age and social 
status, though they tended to give less mature 
answers to the “violation of norms" questions. 
The distinction between "cognitive" and 
“emotional” aspects of moral development is 
supported by some findings of the Character 
Education Inquiry (CEI) which agree with 
those of the present study in distinguishing 
different types of moral-judgment indices. Two 
of the CEI “moral knowledge" tests, “cause- 
effect” and “provocations” (3, pp. 38, 50), re- 
semble our “immanent justice” questions and 
our “violation of norms” index, respectively. 
They also resemble our indices in relation to 
age and intelligence. The correlation of the 


a Baris RA Sonatas à 
index, based on them, is relatively independente "Provocations" test with intelligence is 


of the previous three (Table 2). Since this 
index is also the only one of the four that shows 
significant associations with indices *of au- 
thority, we conclude that it comes closer to 
measuring an aspect of moral judgment con- 
nected with parental authority than do the 
first three Piaget-Lerner indices. 


Discussion 


+ 

The “violation of norms” index seems to fit 
our second Piaget hypothesis bet£er than do 
the other indices derived more directly from 
Piaget and Lerner. This difference may 
perhaps be explained by postulating two 
distinct processes of moral development: 
“cognitive” moral development, involving 
the learning of what behavior patterns are 
approved and disapproved, and "emotional" 
moral developfhent, including the association 
of anxiety with one’s own deviance and moral 
indignétion with that of others. Perhaps the 
Piaget-Lerner questions in the first three 


opposite in sign to its correlation with age 
(3, p. 111); hence, on this test the more in- 
telligent children tend to give less “mature” 
answers, as they do on our “violation of 
.norms" index. The “provocations” test also 
correlates more closely than does the “cause- 
effect" test with behavioral tests of “honesty” 
(3, p. 156), but distinctly less closely with 
tests of intelligence and vocabulary (3, pp. 
111, 392). Thus it seems that the “provoca- 
tions” test shows closer relation to behavioral 
moral development, and less to cognitive 
development, than does the “cause-effect” 
test. 

The distinction between cognitive and 
emotional aspects may also help to clarify 
the relation of moral Judgment to social status. 
Lerner (4) reported that higher status children 
gave more “mature” answers to Piaget-type 
questions than did lower status children, even 
after control on age and intelligence. D&ta 
from the present study and anothee (2) con- 


18^. Duncan MacRae, JR. 


T 
URN 
firm this finding. A rating of father's occupa- 


tional status, based on’a national survey (7), 
correlates +.15 with the "intentions-conse- 
quences” index and +.23 with the “punish- 
ment” index, both significant at the .05 level: 
The “violation of norms" index, however, 
shows a reversal of sign in this association. 

The differences in moral judgment with 
status observed by Lerner, together with a 
more authoritarian view of parents by lower 
status children, were taken by him to confirm 
Piaget’s hypothesis concerning parental au- 
thority. Such an interpretation leads to a 
paradox, however. A study by Davis and 
Havighurst (1) pictures lower status white 
parents as more permissive in early training 
than higher status parents. If this finding 
holds throughout the United States, and if 
answers to the Piaget questions reflect the 
strictness or permissiveness of early discipline, 
we should expect lower status children to show 
more rather than less “maturity” on the Piaget 
questions. 

This paradox can be resolved by reinterpret- 
ing Lerner’s findings. The difference in moral 
judgment between status groups that he ob- 
served may be due more to differences in the 
cognitive indoctrination process than to 
differences in authority. The apparently greater 
degree of parental authority that he observed 
in lower status families may correspond more 
to overt punitiveness than to effective inculca- 
tion of norms. Such a parent-child relation may 
be more frequent in immigrant families, repre- 
sented heavily in Lerner’s lower status sample, 
than in higher status families. In short, it may 
be that the higher status child matures more 


rapidly in his cognitive moral development, » 


while the lower status child matures more 
rapidly emotionally, in the sense of emancipa- 
tion from his parents. 

"This interpretation is, of course, limited in 
applicability. The phenomena studied seem 
at least to be common to Western Europe and 
the Eastern United States. They will not 
necessarily appear in primitive societies, as 
Thompson (10) has shown, but common 
learning processes may underlie moral de- 
velopment in diverse societies, 


SUMMARY 


] ^ 
_ A study of the moral judgment of 244 boys 
saged 5 to 14 was conducted to test two major 


m 


E > 


hypotheses of Piaget and Lerner. Neither | 
hypothesis was confirmed. - | I 

1. Moral-judgment questions used by Piaget 1 
and Lerner revealed on cluster analysis not a | 
single factor but three relatively independent | 
clusters, when age of child was controlled 1 
statistically. 

2. Indices based on these clusters showed no 
significant association with measures of 
parental authority constructed in this study. 

3. Two of these same measures of parental |, 
authority did, however, show significant {| 
associations with another moral-judgment |) 
index, relating to violation of norms regarding 
lying and stealing. L| 

4. It is hypothesized that the Piaget-Lerner 
questions are more concerned with boys" 
cognitive moral development, while the ques- 
tions relating to violation of norms are more 
closely related to their emotional moral de- 
velopment, and hence to parental authority. 

5. Lerner's findings with regard to moral 
judgment and social status are confirmed, but 
another explanation is suggested. 


REFERENCES 


"4. Davis, A, & Havionunsr, R. J. Social class and 7 
color differences in child-rearing. Amer. socidh 
Rev., 1946, 11, 698-710. 

2. Harrower, Moxy R. Social status and the moral 
development of the child. Brit. J. educ. Psychol; 
1934, 4, 75-95. 5 

3. HartsHorNe, H., May, M. A., & SHUTTLEWORTH, 
F. K. Studies in the nature of character. Vol. W 
Studies im the organization of character. New 
York; Macmillan, 1930. 

4. Lerner, E. Constraint areas and the moral judgmett 
of children. Menasha, Wis.: Banta, 1937. 

5. Lerner} E.-The problem of perspective in mo! 
reasoning. Amer. J. Sociol., 1937, 48, 249-269. 

6. MacRar, D., Jr. The development of moral 
judgment in children. Unpublished doctor's 
dissertation, Harvard Univer., 1950. 

7. Nort, C. C., & Harr, P. K. Jobs and occupations) 
a popular evaluation. In L. Wilson & W. L 
Kolb (Eds), Sociological analysis. New York: 
Harcourt, Brace, 1949. Pp. 461-473. (A more 
detailed, list of occupations is given in the 
„National Opinion Research Center's Opinion 
News, Sept. 1947.) 

8. PIAGET, J. The moral judgment of the child. Genco’ 
Ill.: Free Press, 1948. (Translation by Marjorie 
Gabain from Le jugement morale ches Penfant 
Paris: Alcan, 1932.) iu 

9. Rorr, M. A factorial study -of the Fels Parent 
Behavior Scales. Child Develpm., 1949, 20, 29-45- 

10. TmowPsON, Lavra. Attitudes and acculturation. 
Amer. Anthropologist, 1948, 80,200-215. 


Received September 15, 1957. 


» 


1 


zi M 
^ 


asc Aaa aded 
4 


THE LEADERSHIP BEHAVIOR AND COMBAT PERFORMANCE OF 
AIRPLANE COMMANDERS: ‘ 


ANDREW W. HALPIN 
‘Personnel Research Board, The Ohio State University 


me fresh attention; which social psy- 

chology has focused upon small group 

behavior (4, 12) is reflected in current 
research on military leadership. Stouffer (13) 
and others (3, 6, 8, 10) have discussed the 
military significance of primary groups, and 
have indicated the need for research on leader- 
ship within such face-to-face groups. The 11 
members of the B-29 crew constitute a group 
of this kind. The crew, however, is part of a 
larger hierarchical organization in which the 
airplane commander is responsible to squadron 
and wing superiors who evaluate his combat 
performance and that of his crew. The present 
study is an analysis of the relationship between 
the crew’s perception of the leadership be- 
havior of the airplane commander and (a) 
ratings of his combat performance made by, 
his superiors, and (6) an index of his crew’s 
satisfaction with him as a commander. Thus, 
the analysis involves a comparison between 
the crew’s description of the commander’s 
behavior and two independent evaluations 
of tlfe effectiveness of that bebavior: one by 
his superiors and the other by his crew. © 

In ordinary parlance, the concept of leader- 

ship is used in an eyaluative sense..To say, 
that a man displays leadership implies that 
this is “good” or “effective” leadership. The 


evaluation of leadership, however, is only one * 


aspect of the problem. A more primary task 
is to describe the*behavior of the leader in 
terms of psychologically meaningful dimen- 
sions. If a description of the leader with respect 
to specific dimensions of behavior and an 
evaluation of the effectiveness of that béhavior 


1This study was sponsored jointly by the Human 
Resources Research Laboratories, Department of the 
Air Force, and The Ohio State Leadership Studies. It - 
is one of a series of Studies in Aircrew Composition 
supervised by Dr. John K. Hemphill. The author is 
indebted to Dr. Hemphill for his many valuable sug- 
gestions, and to Mrs. Janet W. Bieri for her assistance 
in analyzing the data. The full data upon which this 
partial report is ‘based are reported elsewhere (5). The 
opiniofis expressed are those of the author and should 
not be regarded as having the endorsement of the 
Department of the Air Force. 


19 


can be obtained independently, then it be- 
comes possible to ascertain how much each 
dimension contributes to favorable evaluation. 
Furthermore, one also may determine whether 
this contribution changes when the source of 
evaluation is changed. 

Evaluations of leadership may be obtained 
readily enough by means of various rating 
schedules. On the other hand, the measure- 
ment of a group's description of its leader’s 
behavior is a less commonly used procedure. 


‘Hemphill and Coons (7) have devised a 


technique for this purpose, a Leader Behavior 
Description Questionnaire. This instrument 
was used originally with nonmilitary groups. 
Later, two forms of the questionnaire (130 
and. 80 items respectively) were designed 
specifically for the aircrew situation (5). 
Both questionnaires are in multiple-choice 
format, and the frequency with which the 
leader engages in the behavior described is 
indicated by marking, for each item, a state- 
ment containing one of five adverbs: always, 
often, occasionally, seldom, or never. The 
following items are illustrative: 


1. He tries out his new ideas on the crew. 

2. He rules with an iron hand. à 

3. He gets crew approval on important matters 
before going ahead. 

4. He does personal favors for crew members. 


PROCEDURE 


In order to develop empirical keys for the Leader 
Behavior Description Questionnaire, a factor analysis? 
was made of the responses obtained from a sample of 
300 questionnaires. The procedure was based upon an 
extension of an iterative factor analysis technique (14, 
15). Although seven factors were extracted, after the 
completion of rotation it was found that the last three 
accounted for only ¢pproximately 6 per cent of the 
variance common to the seven. Because no meaningful 
interpretation could be made of these relatively un- 
represented factors, notking more was done with them. 
The four remaining factors have been designated as: 
Consideration, Initiating Structure-in-Interaction, Pro- 


? Dr. B. James Winer, University of North Carolina, 
made this factor analysis and constructed the dimegsion «» 


keys. 
e 
e 


20 i ) | 


duction Emphasis? and Soci} Awareness. The first two 
of these account fér approximately 50 and 34 per cent 
respectively of the common variance. These factors, 
Consideration and the Initiating Structure-in-Inter- 
action, are the only two pertinent to the present study.» 
Other investigators have noted the relevance of these 
two aspects of leadership. Freud (2), for example, has 
discussed the importance of consideration and libidinal 
ties in the relationship between the leader and the 
group. Similarly, Homans’ concept of the leader “origi- 
nating interaction” (9, ch. 16) corresponds essentially 
to what is referred to here as Initiating Structure-in- 
Interaction. Furthermore, these two dimensions of 
leader behavior are analogous to constructs which have 
been developed in the field of personality theory: for 
example, to Rank’s (11) principles of love and force and 
to Bronfenbrenner's (1) dimensions of support and 
structure. 

High positive loadings on the Consideration factor 
are associated with behavior indicative of friendship, 
mutual trust, respect, and a certain warmth in the rela- 
tionship between the airplane commander and his 
crew. High negative loadings appear on items which 
suggest that the commander is authoritarian and im- 
personal in his relations with members of the crew. 
Consideration thus refers to the extent to which the 
airplane commander, while carrying out his leadership 
functions, is considerate of the men in his crew. 

High positive loadings on the Initiating Structure 
factor occur on items which imply that the airplane 
commander organizes and defines the relationship 
between himself and the members of his crew. He tends 
to define the role which he expects each member of the 
crew to assume, and endeavors to establish well- 
defined patterns of organization, channels of communi- 
cation, and ways of getting jobs done. This factor 
probably represents a basic and unique function of 
leadership. It is possible that other factors, including 
Consideration, primarily represent facilitating means 
for accomplishing this end. 

On the basis of the factor analysis, keys were con- 
structed for these two dimensions of leadership be- 
havior. The Consideration key of 28 items! has an 
estimated reliability (corrected by the Spearman- 
Brown formula) of .94, The corresponding estimate 
for the 29-item Initiating Structure key is .76. 


? Production Emphasis refers to a manner of moti- 
vating the crew to greater activity by emphasizing the 
mission or job to be done. Social Awareness (Sensi- 
tivity) appears to measure the airplane commander's 
sensitivity to and awareness of social interrelationships 
and pressures existing both inside and outside the 
crew. It may represent activities on the part of the air- 
plane commander which often have been referred to as 

sizing up the situation." This falltor does not indicate 
the extent to which the airplane commander acts in 
accord with these insights, but merely indicates his 
awareness of social pressures originating from either 
within or outside the crew. There appears to be a dis- 
tinction between being aware of these social pressures 
and taking appropriate action with respect to them. 

í In the 80-item form of the questionnaire, there are 

only 15 items on each of the keys. The estimated 
reliabilities are .93 and .86 respectively. 
i is 


©. 


"ANDREW W. HALPIN 


The 130-item form of the Leader Behavior De- 
scription Questionnaire was adminisiereg 
bers of 52 B-29 crews who attended the Combat Crew 
Training School at MacDill Air Force Base during the 
autumn of 1950. In order to determine whether the 


to 353 mem- ` 


crew members agreed on how they described their l 


' leaders on the Consideration and the Initiating Struc- 
ture dimensions, an analysis was made of the dimension 
score variance associated with the airplane commander 
who was described. Specifically, the between-crew and 
within-crew variances were compared for each dimen- 


sion separately. The F ratio, in the case of both dimen- - 


sions, was significant at the .01 level. The correspond- 
ing unbiased correlation ratios are .54 and .36, thus 
indicating a tendency for the crew members to agree in 
their perception of their commander's leadership 
behavior. Accordingly, crew mean Consideration and 
crew mean Initiating Structure scores were used as 
indices for describing the leader's behavior on these 
dimensions. For 52 crews, the correlation between these 
ascribed Consideration and Initiating Structure scores 
is .45, which is significant at the .01 level. 

During the summer of 1951, data from three sources 
were collected in Japan on 33 of the 52 airplane com- 
manders who had been described by their crews at 
MacDill Air Force Base. First, 29 of the 33 commanders 
were described again on the Leader Behavior De- 
scription Questionnaire (80-item form). Second, the 
squadron and wing superiors rated all 33 commanders 
with respect to seven criteria of combat performance. 
The rating for each criterion was made on a nine;point 
scale. The commanders were evaluated by from one 
to four raters, with 73 per cent of the ratings secured 
from more than a single rater. Third, the members of 
27 crews answered the question, “If you could make up 
a crew from among the crew members in your squad- 
ron, whom would you choose for each crew position?" 
The ratio between the number of votes the incursbent 
commander received and the number of votes cast 
Was used as an index of the crew's satisfaction with his 
leadership. Accordingly, the data consist of two de- 
scriptions of the leader's behavior—one in training and 
one in cembat—ánd two evaluations of his combat 
performance; one by his superiors and the other by 
his crew. The süperiors' ratings and the Satisfaction 


` Index then were correlated with the Consideration and — 
the Initiating Structure scores ascribed to these com- - 


manders by their crews, both in this country and in the 
Far Eastern Air Force. In each case, partial as well as 
zero-order correlations were computed, that is, for the 
Consideration scores with the Initiating Structure 
scores partialled out, and for the Initiating Structure 
scores with the.Consideration scores partialled out (see 
Table 4). 


RESULTS 


In both the training and the combat situa- 
tion, there is a trend toward negative correla- 
tions between the superiors' ratings and the 
Consideration scores, and positive correla- 
tions between these ratings and the Initiating 
Structure scores. The partial correlations 
accentuate this trend, which is more marked 
in the combat situation than in the training 


e 


LEADERSHIP BEHAVI@Re 
t 
e 


AND COMBAT PERFORMANCE OF cia 


© 
COMMANDERS 


s SR AGAT 


21 


[3 


TABLE 1 


CORRELATIONS BRgWEEN THE AIRPLANE COMMANDER'S CONSIDERATION AND Toninad STRUCTURE, SCORES AND 
(A) SUPERIORS’ RATINGS OF His COMBAT PERFORMANCE, AND (B) CREW'S SATISFACTION Tupex 


IN Trame (N € 337) In Comsat (N = 291) 
+ : 
SUPERIORS’ RATINGS OF INITIAT- ai INITIAT- 
COMBAT PERFORMANCE Cowsm-| ING Consp- | ING 
ERATION | Srruc-| ^? | 7? | eration| STRUC- do TE 
TURE TURE 
(1) (2) (3) (4) (5 

A. Technical competence on arrival) —.24 AL |—.31 .23 | —.02 .24 —.18 .29 

overseas 
B. Technical competence at present| — .21 * 10 |-.27 .20 | —.22 .19 —.38* .36 

time a 
C. Effectiveness of working with| —.14 14 |—.21 .21 | —.13 .27 — .33 .40* 

crew members 
D. Conformity to SOP .03 .37* |—.13 .39*| —.25 .32 —.52** .54** 
E. Performance under stress —.22 15 |—.31 .26 | —.12 .16 —.24 .26 
F. Attitude and motivation to be ef-| —.15 .20 |—.25 .28 | —.34 .16 —.50** .42* 

fective 
G. Over-all effectiveness as a combat| — .20 .22 |-.32 .33 | —.23 .28 —.46* .48** 

crew member 

Crew's Satisfaction Index .48* | —.17 .63**.—.48*|  .64** .35 :57** |—.03 


* Significant at the .05 fevel. 

** Significant at the .01 level. 

i For Ratings B and E, N = 32; for Satisfaction Index, 27. 
For Rating E, N = 28; for Satisfaction Index, 27. 


iJ 
situation. One notes particularly that in the 
case of the rating on “over-all effectiveness 
as a combat crew member," which perhaps 
best represents the way the superiore, in day- 
to-day operation, evaluafe the airplane com- 
A 

mander, both partial correlations based upon 
the crews’ perception of the commander’s 
behavior in combat are statistically significant. 

On the other hand, the correlations between 
the leadership dimensions aiid the, Satisfaction 
Index show a trend in the opposite direction. 
The zero- and first-order correlations between 
this index and the Consideration scores, 
whether in training' or combat, are positive 
and statistically significant. Conversely, the 
correlations with the Initiating Structure 
scores, although not consistently significant, 
tend to be negative. x 


The four entries in the lower right-hand 
corner of Table 1 succinctly illustrate the two 
trends. It will be recalled, however, that the 
correlation between the leadership dimensions, 
as described, is .45. Thus, although the rela- 


of the leader is evaluated both by superiors, 


and stbordinates as antithetical. Further 


LJ 
more, the superiors and the subordinates 
each view this antithesis in an opposite way. 


SUMMARY 


1. In developing a Leader Behavior Descrip- 
tion Questionnaire on which the members of 
952 B-29 crews described the behavior of their 
airplane commander, two major dimensions 
.of leadership were identified: Consideration 
and Initiating Structure-in-Interaction. The 
correlation between these dimension scores is 
^45. 

2. While in training in this country 33 
airplane commanders were described by their 
crews on this questionnaire. Later, 29 of these 
commanders on combat assignment in the 
Far Eastern Air Force were again described 
by their crews. Ratings on seven aspects of the 
combat performance of these commanders 
were secured from, their administrative su- 
periors. An index 6f each crew's satisfaction 
with its commander was also computed. 

3. A trend towamd negative correlations 
was found between the superiors' ratings and 


ture scores. Conversely, the correla- 


een the Satisfaction Index and fhe 
LJ 


r 
& c 


22 ' ANDREW W. HALPIN 


1 
dimension scores "i ed a trend in the op- 
posite directiün. Thus the superiors and sub- 
ordinates were inclined to evaluate oppositely 
the contribution of these dimensions to the 
effectiveness of leadership. This difference 1à 
evaluation confronts the leader with conflicting 
role expectations. It is suggested that the role 
conflict so induced constitutes the core of 
“the dilemma of leadership." Further research 
is needed to determine the precise techniques 
which different leaders use in their attempt to 
resolve this conflict, and to analyze the con- 
sequences of these techniques for both the 
personal adjustment of the leader and the 
effectiveness of the group. 


REFERENCES 


1. BRONFENBRENNER, U. Toward an integrated theory 
of personality, In R. R. Blake & G. V. Ramsey 
(Eds.), Perception: an approach to personality. 
New York: Ronald, 1951. Pp. 206-257. 

2. FREUD, S. Group psychology and the analysis of the 
ego. (4th Impr.) London: Hogarth, 1948. 

3. GRINKER, R. R., & SPIEGEL, J. P. Men under stress. 
Philadelphia; Blakiston, 1945. 

4, Hater, A. W. Current conceptual trends in small 


group study: social psychology. Autonomous T 


Groups Bull., 1952, T (2 & 3), 4-17. 
5. Harem, A. W., & Waiver, B. J. The leadership 
behavior of the airplane commander, Washington, 


» 


D. C.: Human Resources Research Laboratories, 
Dept. of the Air Force, 1952. | | 

6. Havron, M. D., Fay, R. J., & Goopacre, D. M, 
Ill. The effectiveness of small military unils, 
(Personnel Research Section, AGO, Report No, 
885.) Washington, D. C.: Dept. of the Army, 
1951. 

7. Hexen, J. K., & Coons, A. E. Leader behavior 
description. Columbus, Ohio: Personnel Re 
search Board, The Ohio State Univer., 1950, | 

8. Homans, G. C. The small warship. Amer. socioh 
Rev., 1946, 11, 294-300. 

9. Homans, G. C. The human group. New York: Har 
court, Brace, 1950. 13 

10. MawpxrBAUM, D. G. Group dynamics of military 

units, (Prepared for the Working Group on 
Human Behavior Under Conditions of Military) 
Service.) Washington, D. C.: Dept. of Defense] 
Research and Development Board, 1951. | 
11. Ran, O. Will therapy and truth and reality. New) 
York: Knopf, 1950. | 

12. Surrn, M. B. Social psychology and group proc! 

esses. Annu. Rev. Psychol., 1952, 8, 175-204. — 

13. Srourrer, S. A., et al. The American soldier. Vol. 

1. Adjustment during Army life. Princeton, N. J^? 
Princeton Univer. Press, 1949, ; 
14. Wuerry, R. J., & Gaytorp, R. H. The concept of 
test and item reliability 4n relation to factor 
pattern. Psychometrika, 1943, 8, 247-264. | 
15. Wuerry, R. J., CAMPBELL, J. T., & PrnLoxr, R. 
An empirical verification of the Wherry-Gaylord 


iterative factor analysis procedure. Psych 
melrika, 1951, 16, 67-74. | 
Received September 20, 1952. | 
a A, 

*" * | 

» i 
pu^ > 4 


Bu 


t e 


A Ee 


SELECTIVE RECALL AND MEMORY DISTORTION E FAVORABLE : 
AND UNFAVORABLE MATERIAL 
RONALD ‘RAFT 
University of Wesfirn Australia 


RECENT article by Alper and Korchin 
(1) describes a study of selective 
memory among males and females 
in the recall of “socially relevant material.” 
The present article -reports similar material 
obtained from Negro and white delinquent 
subjects (Ss) which partly supports and partly 
contradicts Alper and Korchin's findings. 
Alper and Korchin’s study was concerned 
with the differential recall by male and female 
students of a passage dealing in a controversial 
manner with the problem of coeducation. 
These writers assume that the general tone 
of the passage was derogatory to women and 
aroused the defenses of their female Ss. They 
found that males consistently retained more 
of the material thn females, particularly the 
pro-male, pro-female, and anti-female items. 
The females’ protocols contained more dis- 
tortions, errors, and exaggerations, particularly 
of the anti-female statements, than the males’ 
protocols. Alper and Korchin conclude that 
“|. female Ss accept the cultural stereotype 
of their inferiority and reinforce it by keeping 
it in the focus oftheir attention” (1, p. 36). 


e 
METHOD 
The experiment is similar to Alper and 'Korchin's 


although it differs in several*important respect, which © 


will be referred to when we compare the firedings. 


iy © 


Subjects 


Two groups of 30 Negro and white boys in the 
New York State Trainirtg School for delinquents served 
as Ss. The groups were matched on their immediate 
recall of a control passage, and the mean MA’s were 
12-5 and 12-7 respectively. None had been in the school 
for more than seven months. It is relevant to qur ex- 
periment to note that all the Negroes were unmistak- 
ably dark-skinned and that the E was a white Aus- 
tralian whom they had known only superficially, if at 
all, prior to the study. 


: Procedure 


Each S was examined individually, and, to avoid 
` the boys’ discussing the content of the passages among 


1 The data weretcollected by the author when he was 
a psychological intern at the New York State Training 
School for Boys in 1941. The complete experiment is 
reported in (11). 


23 ] 


themselves, those who had taken the experiment were 
kept apart for the day from those who had not. 

Immediale recall series. After a. brief general discus- 
sion, S was instructed that he was to have a memory 
test. "You don't have to remember the exact words, but 
just listen carefully and tell me as much about it as 
you can.” 

The E read aloud the passages from a specially 
prepared script in which the emphases and pauses were 
notated. The E practiced the reading prior to the experi- 
ment and found little difficulty in preserving approxi- 
mately uniform loudness and speed from reading to 
reading. 

Delayed recall series. Three days after the immediate 
recall the Ss were interviewed individually and in- 
structed as follows: “Remember that story about base- 
ball which I read to you the other day? Tell me as 
much of it as you can remember.” When S stopped, 
E read over as much as he had recalled and asked him 
if he could think of anything that he had left out. 

The fact that for all Ss very little material was in- 
troduced in the delayed recall which was not included 
in their immediate recall makes it likely that if the 

*boys did discuss the experiment among themselves, 
they did not discuss the subject matter of the passages. 

The experimental passage. The experimental passage 
reproduced below has been subdivided into items for 
scoring purposes. The passage was divided, as far as 
possible, so that each item would include one idea, but 
that the recall of one item would not automatically 
include the recall of another. There is no suggestion 

ethat each item is equally difficult to recall. 

“One of / the greatest / Negro / baseball / pitchers / 
of America / played ball on local / sandlots / for four- 
teen years / and then went to Brazil / to play in the 
important / games there. / He has never played major 
league games in the United States / and some people 
say / that this is because he is colored; / as a matter of 
fact / his mother is light-skinned / and his father is 
brown-skinned. / Joe Washington / has a very / good 
nature / and is not too / upset by his lack of success./ 
However, he says, /‘Iam becoming / a little / lonesome / 
for Harlem, / even though / it's half colored there. / 
The United States is home to me, / just as it is to the 
other Americans, / light or dark.’ /” 

In order to note differences in the recall of types of 
items, each item was rated, as in the previous experi- 
ment, as "favorable" (to the Negroes), “unfavorable,” 
“neutral,” or “ambigugas” (i.e., hedonic tone likely to 
vary from S to S). The “ambiguous” items are prob- 
ably comparable to Alper and Korchin’s “ambivalent” 
items. e 

The ratings were made by three raters, all white; 
disagreement was resolved by conference. Pgrhaps it 
would have been more meaningful if the Ss themselves 
had rated the hedonic tone of the items, but it is doubt- 
ful whether, because of the type of Ss used, they co; 
have given very reliable ratings of the.items. It is 
e 

c 


24 " , 
Ji ji 
» T)BLZ 1 
Tanen; REcCLL: TOTAL RECALLS 
is 7 
(N= 30 Negro Ss and 30 white Ss) 
No. N 
NEGRO »] 
Bes Posst- | CALLS | CALLS peor 
BLE Y 
Favorable 480 181 140 29  |2.36* 
Unfavorable| 240 84 68 24 1.21 
Ambiguous 150 76 57 33:11:12 
Neutral 120 28 27 4 0.39 
"Total 990 369 292 26  |3.70** 


* Significant at .05 level. 
** Significant at .01 level. 


hoped that the *ambiguous" category will take care of 
the more controversial items. 

There were 33 items altogether, 16 favorable, 8 un- 
favorable, 5 ambiguous, and 4 neutral. 

This story has a few similarities to the story of 
Satchell Paige, a Negro baseball pitcher, However, in 
a survey of Negroes at the school (not used as Ss in 
the experiment), it was found that they did not know 
anything about him. At the time, there were no Negroes 
in major league teams, and the subject matter, that of a 
suggested discrimination against a great Negro player, 
could be presumed to touch on a topic in which the ^ 
Negro Ss had considerable emotional involvement as 
compared with the white Ss. 

Scoring. Each item recalled was assigned a weight 
of unity; an item was scored as a recall if the idea itself 
was reproduced accurately. An inaccurate reproduction 
of an item was scored as a distortion. 


RESULTS? 2 


Immediate recall. The Negroes recalled 
significantly more items than the whites, : 
particularly those items that were favorable 
to them (Table 1). There was no difference 
in their recall of neutral items. Both groups 
tecalled favorable items slightly better than 
unfavorable ones, and ambiguous ones best 
of all. Many fewer of the neutral items were 
recalled. 

Confirmation of these results is provided 
by Table 2, which shows the comparison for 
the number of recalls plus distortions, since 
distortions may be considered to be indicative 


2 The results presented here differ in some respects 
from those reported in (11); my recomputation of the 
statistics for this paper revealed Some minor and in- 
substantial errors in the Previous presentation—how 
many theses submitted hurriedly to beat university- 
dictated headlines could emerge completely unscathed 
from a careful recomputation? d 

5 


2 
D 


RoNarp TAFT 


ao 


TABLE 2 


IMMEDIATE RECALL: TOTAL RECALLS PLUS 
DISTORTIONS ^ 


No. 
T or |9* Re-| NEGRO | WHITE eae 
Irex ALIS RS Rr- RIORITY 
Possi- | CALLS | CALLS % 
BLE 2 
Favorable 480 243 205 19 | 1.72 
Unfavorable | 240 113 97 17 | 1.49 | 
Ambiguous 150 83 76 9 | 0.50 | 
Neutral 120 31 33 —6 | 0.50 
Total ` 990 470 411 14 | 2.13% 


* Significant at .05 level, 


TABLE 3 
Detavep RECALL: Toran RECALLS 


Tyre or | °F RE-| Necro | Warre pene 
Irex SAAT RE ORE | prey 
Posst- | CALLS | CALLS % 
BLE 
eae 
Favorable 480 176 106 66 |3.17** 
Unfavorable | 240 60 62 —3 0.28 
Ambiguous 150 44 33 33 1-78 
Neutral 120 16 12 33 10.65 
Total 990 | 296 | 213 38 3.4" 


** Significant at .01 level. 


of partial recall as opposed to complete omis- 
sion. The results show a pattern similar to | 
the recalls alone, but the levels of significance 
are reduced. | 

Delayed recall. The delayed recall series 
reproduces the same superiority for the Negro | 
Ss in the total number of items recalled, in- 
the items favorable to Negroes, and in the 
ambiguous items (Table 3). There is no longer 
any difference, however, ia the recall of items | 
unfavorable to Negroes. The Negroes re- 
called best the favorable items, while the 
whites recalled best the unfavorable items. 
Both'groups recalled comparatively few of the 
neutral items. 

Is there any differential forgetting of items | 
from the immediate to the delayed recall 
series? Table 4 expresses the number of delayed 
tecalls as a percentage of the immediate | 
recalls plus distortions (subtracted from 100 | 
to provide a percentage of forgetting). The | 
comparison is made with recálls plus distor- 
tions rather than with just recalls, as the de- | 


E 


[Oe 


SELECTavVe RECALL AND FAVORABLENESS OF Ma’ 
t 


layed recalls have been scored liberally. 
Table 4 suggests that the Negroes’ superiority 
of recall increased in the delayed series, par- 
ticularly for the favorable items. The neutral 
items were also surprisingly well retained by 
the Negroes in comparison to the whites. 
On the other hand, the percentage of unfavor- 
able items recalled was lower for the Negroes 
than for the whites. 

This measure provides us with an indication 
of the forgetting that has occurred between 
the original perception (as measured by im- 
mediate recall) and the recall test threé days 
later (delayed recall). It is interesting to note 
that in the Negro group this forgetting process 
occurred for the unfavorable items to the 
same extent as for the neutral and ambiguous 
items, whereas many fewer of the favorable 
items were forgotten. In the white group, 
the unfavorable items were forgotten least, 
with the favorable items next. These results 
suggest some degree of repression for the 
unfavorable itemsan the case of the Negro Ss 
whereas their involvement in the other items 
led to good retention of these over a period 
of time. 

The findings for the immediate and delayed 
recalls are given more meaning if we consider 
the results for individual items. 

The items on which the Negroes exceeded 
the whites in the immedfate recalls consisted 
of vafious types: fact (he was a pitcher"), 
ethnic discrimination (‘this is because he is 
colored"), and escape from discrimination 
("and then went to Rrazil”)’ On items re- 
ferring to self-restraint the fesults arg ntixed— 

. Negroes excelled in the recall o£ *he has a 
good nature," whites on *he was not too upset 
by his lack of success," and (he played on 
sandlots) “for fourteen years.” 

In the delayed series, few of the boys of 
either group recalled these “self-restraint” 
items at all. The differences in the recalls of 
the two groups were largely taken up by the 
Negroes’ superiority in items referring’ to 
achievement (“to play important games 
there,” “one of the greatest”) and acceptance 
(“lonesome for Harlem”), 


Results Compared with Those of Alper and 
Korchin 


e 
How do our results compare with those of 
Alper and Korchin? Since the females were 


RIAL 25 


id Li 
TABLE 4 
AMOUNT OF FORGETTING BETWEEN IMMEDIATE 
RECALL AND DELAYED RECALL 
(Delayed recalls expressed as percentage of immediate 
recalls plus immediate distortions; 100 per cent 


@ equals gomplete forgetting.) 

PERCENTAGE OF FORGETTING! 

— L] SOE- 
TYPE or ITEM Com- RIOR 

INEGROES| WarrEs| BINED | GRovP 

Group 
Favorable 27.7 48.2 37.0 N 
Unfavorable 46.9 36.0 42.1 Ww 
Ambiguous 47.0 56.7 51.5 N 
Neutral 48.3 63.7 56.2 N 
All items com- | 37.1 48.2 42.4 N 
bined 


presumed to be emotionally involved and on 
the defensive in their experiment, we shall 
compare the results for their female Ss with 
those for the Negro Ss in our experiment. 

1. The females consistently recalled Jess of 
the material; the Negroes consistently retained 
more. 

* 2. The females recalled Jess of the pro- and 
anti-female items than the males; the Negroes 
recalled more of the *pro-Negro" items in both 
immediate and delayed series, and more of the 
"anti-Negro" items in the immediate recall 
series than the whites. The Negroes forgot 

more of the unfavorable items in a three-day 
period than did the whites. 

3. The females and males did not differ 
on ambivalent items, but the Negroes tended 
to be superior to the whites in the recall of 
ambiguous items. 

4, In neither experiment did any significant 
differences appear in the immediate recall 
of neutral items. The Negroes retained the 
neutral items better than the whites in the 
delayed recall. 

5. Three days’ delay in recall accentuated 
the above trends in the Negro-white experi- 
ment, except for the repression by the Negroes 
of items unfavorable to themselves. There 
were no comparable data for the female-male 
study although in the course of an hour no 
such trends were obsefved. 

Distortions. Distortions were scored where 
the recall obviously referred to an iterfi in the 
original but was so distorted that it could not 


be regarded as having the original meaning? œ 


© 


e 


| 
j 
É 


TABLE 5 
Disyokrions x IuwxDtATE RECALL 


Table 5 presents the comparative number 
of distortions for the two groups in the im- 
mediate recall series. 

Although none of the differences is statis- 
tically significant, there are some interesting 


bered this item remem. 


accurately, 
Washington to 


eemtbiy, the 


f 
H 
| 


i 
d 
| 
ri 
i 


i H 
f 
H 
zs 
i 


I 
i 


Hn] 
i 
ji 
1 


; 
i 
f 


Rowarp Tarr 


the whites were satisfied® Pleasant than their unpleasant experien 


on the part of some of the Negroes, and 
unsympathetic disinterest, or even hostili 
on the part of some of the whites. No evide 
was found to support Alper and Korchi 
finding that their ego-involved group tend 
to introduce defensive interpretations 4 
comments. 


Discussion 


How are we to account for the differen 
found between the experiment described i 
and Alper and Korchin's experiment? W 
does ego involvement produce such diffen 
effects in the two experiments? The an 
apparently lies in the different circumstan 


are attributable to the type of Ss used. 
example, some types of personality typical 
repress emotionally distressing experien 
while others typically recall them. Cont 
the delinquent boys of rather low mental 
used in this study with Alper and Korchi 
college males and females, it might be expedi 
that the latter groups would be more,apt 
use repression as an immediately ava 
defense, while the former would tend 
to overt expression, 

Meltzer has found individual differem 
in the recall of pleasant and unpleasant exp 
ences; over half of his Ss recalled more af tl 


while 36 per cent of the Ss recalled more) 
their unpleasant experiences (8). He spe 


2. The 
the 
also 


represented not only a threaten! 
authority figure, but was also a white, ocrs0 


* d 


Serectvý RECALL AND FAVORADLENES® OF vx usc 
t 


* 

It is not surprising then that the Negro boys 
were "on t toes" and recalled all emo- 
tionally t (nonneutral) items, whether 
Íavorable or not, more accurately than did 
the whites in the first recall, This would be an 
example of what Bruner and Postman have 
termed “vigilance” in the area of perception. 
The superiority of the Negro boys in the im- 
mediate recall may be, at least in part, a 
function of their superior perception of the 
material, 

On the other hand, with delay the principle 
of “defense” tends to take over and the more 
unpleasant aspects of the original perception 
may be repressed, Thus, in the delayed recall 
series, the Negroes forgot more of the unfavor- 
able items than did the whites, and recalled 
more of the favorable ones, Neutral and 
ambiguous items were in between, 

Support is found for this explanation of our 
results from Clark's study (6). In his experi- 
ment girls showed better immediate recall 
than boys for a paragraph dealing with the 
meeting of a man and a dominant woman of a 
rather unpleasant type. However, in a delayed 
recall series two weeks later many of the items 
which the girls recalled better in the original 
series no longer discriminated between the 
girls and boys, Many of these were items that 
emphasized the aggressive attitude of the 
woman and it is possible shat in the meantime 
such stems had become threatening to the 
girls and were repressed, s 

Functionalism in selective recall, The act 
of recalling and forgetting should be considered 
from the functional pointe of View, Íe, in 
terms of the function which it performs for 
the organism, If the functional approach is 
kept in mind, we can explain the divergent 
findings of other workers in this field. For 
n ED 0 MO REN RO e 
e tions and asked them to rate the degree 


] 


, 27 
LI 

It is not difficult to 
the results of such ts, but, and this 
is a big bul, the explanations offered, including 
the ones in my own study are ex post facto, 

] we can hypothesize ahead of time is that 

invélving factors will make a difference, 
but we do not know enough yet about human 
behavior lo predict for any group of Ss in any 
given circumstances whether tive and 
negative ego involvement will lead to sensitiza- 
tion or repression. 

Perhaps Bruner and Postman's work on 
perception (3, 5, 9) may provide some leads, 
at least in the case of immediate recall.^ “So 
long as the situation is not foo threatening or 
too exacting, avoidance of threatening stimuli 
may be emotionally the most economical 
response, But in situations which are highly 
threatening and highly exacting, the most 
adaptive response is frequently the one which 
takes most ‘vigilant’ account of reality" 
(5, p. 94). This principle may readily account 
for the factors operating in immediate recall, 
provided that we take into account individual 
differences; however, our findings suggest 

, the hypothesis that, after the initial shock has 
subsided, the extremely threatening material 
will also tend to be repressed provided that it 
is no longer essential for the person to remain 
aware of this material in order to preserve 
his ego, 


ceptual recognition (4) may be a 
distortions in memory; Le, these distortions 
from the S's 


an explanation for —'"" 


28 ' D 

j H 
forgetting), repressidn, and suppression. 
strength of kypoth: 


mental set, his expectation derived from past 
experience, his attitudes, values, and needs, 


involvement" is used. 


more experiments in which some of the above 
variables are systematically controlled. In 
particular, we need to investigate what condi- 
tions lead to some individuals' being particu- 
larly sensitized to ego-involving material, 
and what conditions lead to their readily 
repressing and distorting such material in 
immediate recall and over a period of time. 


SUMMARY 


Does ego involvement aid or hinder the 
accurate recall of favorable and unfavorable 
material? An experiment has been described 
in which the recall of meaningful material 
by Negro and white delinquent boys has been 
compared. In an immediate recall series, the 


Negro Ss recalled more items favorable and” 


unfavorable to Negroes than did the white 
Ss. In a delayed recall series the Negro Ss 
were even more superior in the recall of favor- 
able items, but not in the recall of unfavorable 
items. There were no significant differences 


between the groups in the number of distor... 


tions of favorable or unfavorable items, but 
on the whole the white Ss distorted more 
items. 

These results were interpreted as indicating 
that in the immediate recall series the Negro 
Ss were "vigilant," and therefore sensitized 
to the material, but that before the delayed 
recall series they repressed the unfavorable 
items. These results appeared to contradict 
Alper and Korchin’s study; the differences 
between the various studies may be explained 
in terms of two factors: (a) differences in the 


5 


RonaLp TAFT 
à tes 
The type of Ss used, and (b) differences in the 
is a function of many circumstances under which 
factors, including familiarity, the S's experi- was administered. 
ex post facto explanations of results, there i 
and their intensity. This intensity factor :3 a great need for the formulation of hypothese 
often all that is meant when the term “ego for experimental confirmation. It was sug 
gested that the Bruner-Postman “tra 
If we are to advance our ability to explain hypothesis-confirmation” approach to 


the mechanisms of selective recall, we need ception might provide a valuable concepti 
scheme for the formulation of such hypotheses | 


1. 


2. 


4. 


5. 


6. 


di 


8. Mettzer, H. Individual differences in forgetting 


9. 


10. 


11. 


the experimen 
While this area of research abounds with 


REFERENCES 


ALPER, THELMA S., & Korcury, S. J. Memory fo 
socially relevant material J. abnorm. so 
Psychol., 1952, 47, 25-37. ] 

BELMONT, LILLIAN, & Brrcu, H, G. Re-individis) 
alizing the repression hypothesis. J. 
soc. Psychol., 1951, 46, 226-235. 

Bruner, J. S. Personality dynamics and the 
process of perceiving. In R. R. Blake & G. W 
Ramsey (Eds), Perception, an approach W 
fais New York: Ronald, 1951. Pp. 12f 
147. 

Bruner, J. S., & Postman, L. On the perception 
of incongruity: a paradigin. J. Pers., 1949, 18 
206-223. 

Bruner, J. S., & PosrMAN, L. An approach 

ial perception. In W. Dennis (Ed.),.Currttl 
trends in social psychology. Pittsburgh: Univety 
of Pittsburgh Press, 1948, Pp. 70-119. 

CLARK, K. B. Some factors influencing the remem 
bering of prose material. Arch. Psychol., 1940 
No. 253. 1 

Kocu, HELEN L. The influence of some affective 
An upon recall. J. gen. Psychol., 1930,4, 111 


pleasant and unpleasant experiences. J. edie 
Psychol., 1930, 31, 399-409. | 
Postman, L. Toward À general theory of cognition: 
In J. H? Rohrer & M. Sherif (Eds.), Social ? 
> chology ùt the crossroads. New York: Harpeh 
1951. Pp. 242-272. 
Postman, L., & SCHNEIDER, B. H. Personal values) 
visual recognition and recall. Psychol. Rev., 1951 
58, 271-284. 
Tarr, R. The influence of cultural background E 
the recall of meaningful material. Unpublished 
,master's thesis, Columbia Univer., 1941. 


; Received September 22, 1952, 


— 


e PREJUDICE AND DISCONTENT: ° 


CHARLES BIRD AND 


ELIO D. MONACHESI 


University of Ê innesotg 


into prejudice or ethnocentrism have led 

to the interpretation that persons who 
are dissatisfied with their social and economic 
status tend to show more aversion toward 
out-groups than persons who express a general- 
ized form of contentment (1, 2, 4). In line 
with emphases stressed by Freud (9), it is 
assumed that discontent may be displaced 
and manifested as antagonism toward minority 
groups—the scapegoat theory of prejudice 
(12). There is, however, another aspect of 
displacement which has not received as much 
consideration, even though Freud put much 
stress on it (10, 11). Since prejudice is not 
always socially acceptable, repression serves 
to allay anxiety anless flagrant violation of 
approved conduct invites public censure. The 
repression is gained through investing cir- 
cumstances with some of the properties of the 
frowned-upon activities, so that the individual 
is protected from recognizing the actual source 
of his antipathy and simultaneously from 
reactivating anxiety. 

Freud clearly established that a number of 
neurotics vested apparently trivial things 
with anxiety-provoking significance, thereby 
hiding from themselves the important experi- 
ences underlying confliat. Displacement; there- 
fore, could be a process thfough whith a sub- 
stitution represses conflictual elements which 
hold greater threat to the individual than that 
represented by the overemphasized trivialities. 
The process favors the recall of reasons which 
appear logical in themselves; it masks im- 
portant objections by giving undue weight to 
associated phenomena. n 

In planning an inquiry into various aspects 
of intergroup relations among white and Negro 
residents of two adjacent areas in a northern 
metropolis, an attempt was made to test the 
assumption that white residents who disliked 
their neighborhood or their job would reveal 


! This is one of a series of studies on the broad prob- 
lem of social resfonsibility being conducted in the 
Laboratory for Research in Social Relations, University 
Minnesota, under a grant from the Carnegie Corpora- 

ion. 


L RECENT years a number of investigations 


more prejudice against Negroes than members 
of the same communities who liked their 
neighborhood or their job. A questionnaire 
administered verbally by trained interviewers 
to parents provided a prejudice score and data 
that afforded information about the discon- 
tent variables. The main questions comprising 
the questionnaire have been reported else- 
where (5, pp. 689-690), as well as the evidence 
indicating high validity of the schedule (6). 
The questions specifically raised to establish 
relationships between discontent and prejudice 
were as follows: Do you like or dislike living 
in this neighborhood? Why? What is your 
occupation? What kind of work do you do? 
What other full-time jobs have you held? 
Which of these jobs you have held did you 
like best? What kind of work would you really 
like to be doing now? The questions that re- 
* ferred to occupations were addressed to men 
only but those that concerned the neighbor- 
hood were put to both men and women. 
The white adults who comprised our sample 
represented 145 families. There were 144 
mothers and 128 fathers. They were selected 
from two adjacent districts in which Negroes 
? were living under conditions of nonsegregation. 
These two districts, designated as X and Y, 
were clearly differentiated by their physical 
properties. District X contains many more 
physical features considered undesirable in a 
residential section than district Y. District X 
is close to the main business section of the 
city. Its houses are relatively old and often 
contiguous to apartments and rooming houses. 
A large factory and a number of small business 
organizations exist within its confines; more- 
over, depressed railroad tracks bisect its 
southern tip. There are no playground facili- 
ties, except the school yard, and there is no 
park in the immédiate vicinity. Although a 
main thoroughfare and the railroad tracks one 
block north of it are all that divide districts 
X and Y, the latter area is a residential section 
free from factories and with playground facili- 
ties in a small park. Negro families reside in 
approximately equal numbers in both districts. 
A reasonable assumption, one which sfig- 
« 


29 


^ 


30 


es ) 

gested a qualificati of the displacement 
hypothesis, is that the residents of district X 
have more instigations to dislike it. The physi- 
cal properties of this area may be a more 
powerful negative force capable of masking 
the displacement of ideas and affects of dis? 
content belonging to a constellation of antip- 
athies toward Negroes. On the other hand, 
the more desirable physical features of district 
X may operate to alert white adults to what 
they consider a continuing encroachment upon 
their prerogatives. 


LIKING-DISLIKING NEIGHBORHOOD 

District X is disliked as a place in which 
to live by 41 per cent of the fathers and by 
55 per cent of the mothers, whereas in district 
Y the corresponding percentages are 28 and 

| 24. Since we asked each person to give his 
reasons for disliking the neighborhood, it is 
possible to substantiate the assumption that 
district X imposes upon its residents a greater 
force to dislike it because of its physical 
properties. In district X 28 per cent of the 
men and 32 per cent of the women specifically 
mentioned undesirable physical features of . 
the neighborhood in comparison with 18 per 
cent of the men and 15 per cent of the women 
in district Y. Although none of the residents 
of district Y complained about poor recrea- 
tional facilities for children, in district X 12 
per cent of the mothers and 19 per cent of the 
fathers did complain. Conversely, a consider-e 
ably larger proportion of the residents in 
district Y said they liked the neighborhood 
because it was quiet, clean, not near factories, 
and close to a playground than was true for 
the respondents in the older residential area. 

Without regard for the questions that 
deal with liking the neighborhood or the job, 
we may note the mean scores on the question- 
naire for white men and women in the two 
areas, In district X the mean score for 68 men 
was 18.15 with a standard deviation of 6.19, 
and the mean score for 78 women was 17.53 + 
5.6. The 60 men of district Y achieved a mean 
score of 18.00 + 6.51, whiteas the 66 women 
had a mean score of 19.14 + 6,53, 

What attitudes are indicated by these mean 
scores? An analysis, undertaken to validate 
the questionnaire (6), supports the interpreta- 
tion that such mean scores reflect more rejec- 
tion of Negroes than acceptance of them, Their 

2 


2 


CnanLzs Bro AND Erto D. MONACHESI 
Li 


range is from 17.53 to 19.14. What kind of 
gratuitous comments, that is, comments not 
specifically sought by the structure of the 
items in the questionnaire, are made yy 
clusters of respondents from the total sample 
of 272 white parents whose mean scores are 
comparable to those of the men and women 
in the two districts? A score of 18.50 char 
acterizes white parents who insist that their 
children must not get too friendly with Ne 
groes; a score of 17.86 typifies white respond) 
ents who express a fear of marriage possibilit 
ties hetween their children and Negroes, 
parents who say they would allow children 
to play only when young with Negroes; and 
a score of 17.33, which is almost identical 
with the mean score of women in district Xj 
is made by men and women who say they will 
allow their children to play with Negroes only 
when in groups having a predominantly white 
membership. It follows, therefore, that group) 
comparisons about to be made involve molt 
people with antipathy toward than ved 
for Negroes. Bi | 

The mean scores on the questionnaire di 
men and women who either like or dislike 
their neighborhood are presented in Table t 

Although each appropriate comparison. 
shows that aggregates of persons who lik& 
the neighborhood achieve higher scores, Of 
are less prejudiced,. than those who dislike) 
the neighborhood, the criterion differentiate 
groups of women from each other but i | 
not differentiate groups of men at an accept. 
able levél of confidence. Women in district X 
who dislike the neighborhood have a meam 
score which differs from that of women wh) 
like the neighborhood at the .05 level of coni 


TABLE.1 j 
COMPARISON OF MEAN SCORES ON QUESTIONNAIRE OF 
MEN AND Women WHO LIKE OR 

DISLIKE NEIGHBORHOOD 


DISLIKE 


Croup i 
N Mean SD | N Mean SD 
District X ! 
Men 40 18.40 6.07 |28 18.07 5.0 
Women 35 18.89 5.10 | 43 16.37 5.75. 
District Y 
Men 43 18.72 6.45 17 16.12 6.77. 
Women 50 20.34 6.32 | 16 15.25 5.5) 


[d f 
fidence. A similar comparison between the 
women of district Y yields a difference which 
is significant at less than the .01 confidence 
level. The men in district Y who dislike the 
neighborhood achieve mean scores indicative 
of more prejudice than men holding the same 
evaluation in district X, but the magnitude 
of the differences merely suggests a tendency 
for men who dislike living in the better resi- 
dential section to manifest more prejudice 
than men in the less desirable area. 

Not one of the 272 white parents referred 
to Negroes when expressing a liking for the 
neighborhood, but eight women and eight 
men said they disliked it because of Negro 
infiltration. All but one of these clearly ethno- 
centric persons achieved a score below the 
mean of all men and women respondents con- 
sidered apart from the criterion of liking the 
neighborhood. The eight women who expressed 
marked dislike of Negroes in the question 
under review are among the most prejudiced 
persons in the greup of women studied, for 
only two had a score as high as 12 and the rest 
had scores of 10 and below. Of the eight men 
who expressed direct antagonism, three had 
scores below 12, four had scores between 13 
and 16, and one a score of 18. We may note 
that the actual range of scores for all men 
and women in the total. sample was from 7 
to 31 points. 

Since the scoring scheme used assessed a 
value of minus two points to any person who 
openly expressed antagonism to Negroes as 
a reason for disliking éhe neighborhood, the 
data in Table 1 reflect the weight ofethis pro- 
cedure. In view of the lack of statistical sig- 
nificance in the differences between the mean 
scores of men who say they like or dislike the 
neighborhood, we håve not thought it neces- 
sary to make a correction for the assessment 
incurred by the eight men who made deroga- 
tory remarks about Negro neighbors. When a 
restoration of two points is made t6 the scores 
of the eight women who express open dislike 
of the neighborhood because Negroes live in 
it, the mean scores of all women who dislike 
either neighborhood change very slightly. 
Women in district, X who like their neighbor- 
hood now differ from women who dislike it 
at a confidence Jevel between .05 and .10, but 
@ corresponding comparison between women 
in district Y still yields a difference in mean 


PREJUDICE AND Discontent e 


31 


scores which is dul it at a than the .01 
level of confidence. ji 
There is the possibility that women, who 
ust spend a great part of their time at home, 
[E more sensitized than men to irrita- 
ions induced within the neighborhood. Having 
fewer opportunities than men to express frus- 
trations eutside of the home, women who dis- 
like their neighborhood may be inclined to 
be sensitive also to what they consider an 
encroachment by Negroes upon their preroga- 


tives. As we have emphasized already, women . 


who live in a more desirable area most often 
like it, but those who do not comprise the more 
prejudiced group of women. It is recognized 
that only an association between one kind of 
satisfaction and prejudice has been estab- 
lished. There is the possibility that discontent 
with living conditions and prejudice are both 
anchored to nuclear personality structures, 
or as Mussen (13) has indicated, to aggressive 
and dominance needs. 

The reader may wish to know whether 
agreement exists between parents about liking 
the neighborhood. Eighteen persons out of the 

* 272 had been deprived of their spouse. Of the 
254 remaining, 63 husbands and wives agreed 
in liking the neighborhood, 32 husbands and 
wives expressed dislike, and 32 pairs disagreed 
with each other. Husbands and wives who 
showed no dissatisfaction with the neighbor- 
hood had mean scores of 18.81 and 18.97, 

“respectively; and, husbands and wives who 
disliked the neighborhood had mean scores 
of 16.22 and 16.5, respectively. The difference 
between the mean scores of men holding 
opposite evaluations of the neighborhood is 
significant between the .05 and .10 level of 
confidence; correspondingly, the difference in 
the mean scores of women is significant be- 
tween the .02 and .05 level of confidence. Dis- 
agreement within the family regarding the 
neighborhood often indicates that the spouse 
who likes it has a higher score than the one 
who does not; hence, in the aggregate, high 
and low scores tend to balance one another. 
Where family eement exists, 32 pairs, 
the mean score for husbands is 18.6 and the 
mean score for wives és 18.3. 

In Table 2 we have provided the mean 
prejudice scores on the questionnaire together 
with the reasons given for evaluating the 
neighborhoods. For both men and women in 


© 


" 


32 


a 


* CHARLES BIRD AND Ero D. MONACHESI | 


TABLE 2 


MEAN SCORÉS ON QUESTIONNAIRE OF MEN AND WOMEN EXPRESSING MAJOR REASONS FOR LIKIÙG AND DISLIKI 
- THE NEIGHBORHOOD 


Distiqct X 
2 d WOMEN 
REASONS GIVEN MEN 
N Mean SD | N Mean SD 
Physical properties induce ac- | 10 20.20 6.23 | 5 19.60 6.09 
ceptance 
Physical properties induce rejec- | 19 17.00 5.70 | 25 16.20 5.99 
tion 
Desirable neighbors 16 17.44 6.14 | 11 16.09 3.72 
Undesirable neighbors (no refer- | 2 12.00 0.00 | 13 17.00 4.57 
ence to Negroes) 
Close to school and playground | 5 14.80 4.12 | 11 20.64 5.68 
Poor recreational facilities for | 8 18.75 2.00 | 15 17.00 6.56 
children 
Negative comments about chil- | 4 16.25 3.27 | 10 14.00 5.27 
dren 
Derogatory comments about | 3 13.33 3.30| 4 10.75 2.04 
Negroes 
Personal convenience 27 17.37 5.15 | 21 18.76 5.34 | 26 


district X, alertness to desirable or undesirable 
physical properties, such as quietness and 
cleanliness, or their opposites, does not clearly 
indicate attitude toward Negroes, although» 
the direction of the differences suggests that 
discontent with physical features of the en- 
vironment tends to go along with greater 
prejudice. 

In district Y, men who dislike the physical 
properties of the neighborhood are only 
slightly more prejudiced than men who like” 
the physical properties. A similar comparison 
involving women, however, supports the 
hypothesis that, among women, discontent 
with a generally desirable residential area, 
one including Negro residents, is indicative 
of prejudice toward Negroes. The difference 
between the mean scores of women who men- 
tioned like or dislike of physical properties is 
significant between the .05 and .10 level of 
confidence. In this same district only six men 
and four women consider the neighborhood 
undesirable because they do not like some of 
their neighbors. These ns are definitely 
prejudiced against Neses. Comparisons 
made between them and the respondents who 
like their neighbors yield differences in mean 
scores which are significant at levels of con- 
fidence between .05 and .10 for men and .01 
and .02 for women. None of these respondents 
referred to Negroes as the neighbors liked or 
disliked. 

> 


More women in district X express dislik 
of neighbors than women in district Y; 
the former women, as a group, are not chan 
acterized by as marked prejudice towam 
Negroes. The difference of 5.5 between 
means, with a level of confidence between o 
and .05, indicates that women in district 4 
generally meant what they said, namely, y 
disliked white neighbors, but that the foul 
women in district Y were evasive and pi p 
ably meant they disliked Negro neighbors 
Women in district X who claim to like thel 
neighbors, hawever, are more prejudiced that 
the women in district Y who like their neigti 
bors; the difference of 4.34 in mean scores? 
significant between the .02 and .05 level l 
confidence. The women in the preferred 1& 
dential area who like ‘their neighbors 9! 
usually the least prejudiced of their groUP 
but the same cannot be said for women &% 
pressing similar opinions in the less preferré 
district. > 

Men and women in both districts W% 


a 


the neighborhood becaus? some of the child 

are said to be rough, tough, or nasty gene 

have scores indicative of marked projudié 
£— 


e 
In both areas, persons who like their neighbor- 
hoods because they are close to work, trans- 


portation, or* shopping centers mirror the 


distribution of scores of the 272 persons 
interviewed. 


LIKING-DISLIKING PRESENT JOB 


The second potential criterion of discontent, 
liking or disliking the present job, could be 
applied only to men, since all of the women 
in both districts were caring for one or more 
children and, generally, were not working out- 
side of the home. Y 

In securing. responses concernéd with job 
satisfaction, the interviewers asked the men 
which of all the jobs they had held they liked 
best. It should be apparent that a man could 
like his present job best and yet aspire to a 
different and higher level job. We have com- 
puted relationships between scores on the 
questionnaire and level of aspiration, but none 
of them yields a difference meeting a satis- 
factory statistical level of confidence. The men 
in the two districts, however, do differ in their 
level of aspiration: in district X 30 men, or 
44 per cent of the group, and in district Y 41 
men, or 68 per cent of the group, prefer to be 
doing some other kind of work than they have 
. previously engaged in. More men in district 
X, the physically less desirable area, are em- 
ployed in jobs requiring 'high degrees of skill 
than ih district Y (5). 

The mean score on the questionnaire of 
men in district X who like their job and prefer 
no other was 19.24; costrariwise, the men in 
the same district who prefer* another job had a 
mean score of 17.00. A similare comparison 
for the men in district Y shows that men who 
like their job and prefer no other have a mean 
Score of 18.63; in cohtrast, the mean score for 
men who prefer another job was 17.66. Al- 
though the differences between contrasted 
groups of men are not statistically signifigant, 
their direction indicates a slight tendency for 
men who are completely satisfied with their 
work to be less prejudiced than those with 
some degree of job dissatisfaction. If we ignore 
the areas in which the men reside and compare 
all of the men whg like their jobs and prefer 
no other with all of the men who prefer another 
job, we find the former group have a mean 
Score of 19.04 and the latter group a mean 
Score of 17.38. The difference between these 


PREJUDICE-AND DISCONTENT 


33 


LI 
€ t 
TABLE 3 i 
MEAN SCORES ON QUESTIONNAIRE OF MEN WHO Like 
or DISLIKE THEIR PRESENT JOBS or M 
Wno LIKE or DISLIKE THEIR NEIGHBORHOOD. 


ri CAtzapxies 


COMPARED 


e 


District X District Y 


N Mean SD N Mean SD 


18.37 
17.95 


5.89 
6.08 


19.57 6.36 
12.23 2.61 


Like present job 

Dislike present 
job 

Like neighbor- 
hood and pres- 
ent job 

Dislike neigh- | 9 
. borhood and 
dislike present 
job 

Like neighbor- 
hood and dis- 
like present 
job 

Dislike neigh- 
borhood and 
like present 
job 


18.70 6.27 20.14 6.27 


18.44 6.77 | 5 11.80 3.31 


10 17.50 5.35 | 8 12.50 2.00 


19 17.84 5.20 | 12 17.92 6.33 


two mean prejudice scores has a confidence 
level between .10 and .20. 

* Table 3 provides comparisons between men 
who like and men who dislike the job they 
are now doing and men who would rather be 
doing a job they have previously held. 

Job satisfaction or dissatisfaction is unre- 
lated to prejudice against Negroes for the men 
dn district X. Even a combination of two 
varieties of satisfaction, liking the neighbor- 
hood and liking the present job, does not 
differentiate the men from those falling in any 
other combination involving dislike of one or 
the other variable, or dislike of both of them. 
But the men in district Y, the physically 
preferable neighborhood, are differentiated by 
job satisfaction in relation to prejudice toward 
Negroes. The simple criterion of liking or-not 
liking the present job yields a difference in 
mean scores on the questionnaire which is 
significant at less than the .01 level of con- 
fidence. A concentration of satisfaction, such 
as liking both the neighborhood and the job, 
is paralleled by a nfean score of 20.14; in con- 
trast, dissatisfaction with the neighborhood 
and job, admitted by five men only, yields the 
lowest mean score in Table 3, namely 11.80. 
Again the difference between these two*groups 
is significant at less than the .01 confidence 


level. 
e 


» 


34 


For men in districi Y there is a possibility 
that dislike of the job 1s associated with greater 
prejudice than dislike of the neighborhood. 
-The last two categories in Table 3, represented 
by 8 and 12 men, show that men who dislike 
the neighborhood but like their present joù 
have a mean score on the questionnaire which 
is almost identical to the score of all the men 
in this district; but men who dislike the pres- 
ent job and like the neighborhood are among 
the most prejudiced men in the district. The 
difference between the two mean scores meets 
a level of confidence between .02 and .05. 


SUMMARY AND INTERPRETATIONS 

Two groups of parents, living in adjacent 
neighborhoods characterized by markedly 
different physical features, made approxi- 
mately equal mean scores on a questionnaire 
designed to measure prejudice toward Negroes. 
An analysis of these mean scores, utilizing 
categories related to liking or disliking the 
neighborhood, and, for men, liking or disliking 
their job, showed considerable differences 
among the respondents in each neighborhood 
and between the respondents in the two 
neighborhoods, Women who like their neigh- 
borhood achieve mean scores on the question- 
naire significantly higher, or indicative of 
greater tolerance, than women who dislike 
their neighborhood. The contrasts yield 
greater differences between the two groups of 
women in the better residential area than, 
between the two groups of women in the less 
desirable residential district. Dislike of the 
neighborhood, when expressed by men, does 
not as clearly denote prejudice toward Ne- 
'groes, even though the differences among mean 
Scores are in the same direction as those of 
women who share comparable opinions. Hus- 
bands and wives who are in agreement about 
liking the neighborhood were found to show 
less prejudice than husbands and wives in 
agreement about disliking it, a finding which 
supports one made previously (3). 

An analysis of the specific reasons for liking 
or disliking the neighborhood indicates that 
certain replies may be used as indices of 
prejudice. Women in the better residential 
area who evaluate the neighborhood as physi- 
cally desirable or as undesirable differ from 
each other in prejudice to a fairly marked 
degree, but men do not. However, this cri- 
Ld 


> 


CHARLES Birp AND Erro D. MONACRESI . 


| 
| 
terion reveals that, in both districts, men and 
women who dislike the physical properties of 
the neighborhood achieve lowét mean scores 
on the questionnaire than men and women 
who like them. In the preferred residential 
area, men and women who like their Ne 
bors achieve higher mean scores, or are more 
tolerant, than men and wómen who dislike 
their neighbors. Comparable contrasts for 
residents of the less desirable district permit 
no such simple conclusion. Although a com 
siderably larger proportion of the women in) 
district X than in district Y manifest dislike 
of neighbors, the former group of women att! 
not markedly prejudiced against Negroes, 
They seem to be oriented toward the rejection 
of white neighbors. In contrast, fewer womi 
in district X find neighbors sufficiently agree 
able to evoke a liking of the neighborhood) 
than do the women of district Y. This smaller 
group of women generally achieve scores be 
low the mean of all women in either district: 
A logical interpretation is that these women ate 
masking a dislike of Negroés by overemphasiz- 
ing the appealing qualities of white neighbors; 
or they are exhibiting reaction formation. " 
both districts, men and women who admi: 
frankly that they dislike the neighborhood 
because Negroes live in it are assigned prop. 
erly by the questionnaire to the category 0! 
marked ‘prejudice. The large group of 1 
spondents that regards the neighborhood 99 
personally convenient does not provide ally 
cue about their attitudes toward Negroes: 1 
Men in the less attractive district are mol 
likely, to be satisfied*with their present job 
or to have & more realistic level of aspiration 
than men in the more attractive area. +! 
level of aspiration, often referred to by m 
spondents as beyond reach, is not a sath 
factory index of prejudice, even though Ut 
more satisfied men in both districts make 
higher scores on the questionnaire. Content 
ment or dissatisfaction with the job in com 
parison with all jobs held, for men in tht 
physically less desirable area, did not bed 
any marked relationship to prejudice; nor € 
any combination of liking the job and tht 
neighborhood, or disliking the job and 
neighborhood. Quite the contrary was U7 
for men in the better residential district. T 
trend of the evidence based: on job satisfa 
tion, when compared with satisfaction WH 
j 


> 


“PREJUDICE AND DISCONTENT B 35 


t . 
the neighborhood, suggests that for men job 
satisfaction is the better index of the direction 
of racial attitufles. 

The consistent trends related to the physical 
properties of the districts emphasize the neces- 
Sity for considering the environmental field 
when investigating prejudice. Although we 
must be concerned with the meaning the 
environment holds for an individual, the 
physical environment plays an important part 
in orienting the attitudes of people. 

Only eight men and eight women, out of 
the 104 men and women who disliked their 
neighborhood, frankly mentioned Negroes as 
causes of discontent. Undoubtedly, some of 
the reasons for disliking a neighborhood must 
be taken at face value. Often, however, the 
reasons given for disliking a neighborhood, 
associated as they have been with indices of 
prejudice toward Negroes, may be regarded 
as substitutions of trivial memories for the 
more fundamental «ore of prejudice. To ad- 
mit prejudice might arouse feelings of guilt 
and associated anxiety; hence, more socially 
acceptable reasons were given. As Dollard 
and his co-workers have cautioned (8), data 
bearing upon the displacement hypothesis 
may be regarded as evidence supporting it 
rather than as proof of it. At least, the dis- 
placement hypothesis generalizes and makes 
more meaningful the data of this study. 


t 

REFERENCES * 

1. Ackerman, N. W., &'Jamopa;eMamre. Anti- 
semilism and emotional disorder. New York: 
Harper, 1950. = i 

2. Aporno, T. W., FRENKEL-BRUNSWIK, ELSE., 
Levinson, D. J., & Sanrorp, R. N. The au- 

é thorétarian personality. New York: Harper, 1950, 

3. BELLOW, BARBARA, ef al. Prejudice in “Seaside”; 
a report of an action-research project. Hum. 
Relat., 1947, 1, 98-120. 

4. BerrELHEIM, B., & Janowitz, M. Dynamics of 
prejudice. New York: Harper, 1950. 

5. Brd, C., Monacuesi, E. D., & Burpicx, H. 
Infiltration and the attitudes of white and Negro 
parents and children. J. abnorm. soc. Psychol., 
1952, 47, 688-699. 

6. Brb, C. Monacues, E. D., & Burpicx, H. 
Studies of group tensions: II, The validation of 
a questionnaire by means of voluntary state- 
ments. Hum. Relat., 1953, 6, 99-111. 

7. Deutsca, M., & Corrms, Mary E. Interracial 
housing; a psychological evaluation of a social 
experiment. Minneapolis: Univer. of Minnesota 
Press, 1951. 

8. Dotrarp, J., Doos, L. W., Mixer, N. E., Mow- 
RER, O. H., & Sears, R. R. Frustration and ag- 
gression, New Haven: Yale Univer. Press, 1939, 

9. FREUD, S. Civilization and its discontents. London: 
Hogarth, 1930. 

10. Frevo, S. Collected papers. Vol. II. London: Ho-- 
garth, 1948. 

ail. FREUD, S. Collected papers. Vol. V. London: Ho- 
garth, 1950. 

12. Linpzey, G. An experimental examination of the 
scapegoat theory of prejudice. J. abnorm. soc. 
Psychol., 1950, 45, 296-309. 

13. Mussen, P. H. Some personality and social factors 

related to changes in children's attitudes toward 

Negroes. J. abnorm. soc. Psychol, 1950, 45, 

423-441. 


Received September 22, 1952. 


€ 


> 


D 


TA 
,* IMAGERY 
OF THE 


WILL. 


AND SUGGESTIBILITY: A TEST 
ARNOLD HYPOTHESIS' , 


N. McBAIN 


Ə McGin) University 


HAT is the basis of the relatively 
high relationship consistently ob- 
tained by Hull (15) and others 
between measures of "prestige" suggestion 
and of susceptibility to hypnosis? In 1946 
Arnold (1) suggested a mechanism which she 
believes to be common to both. In both cases, 
she believes, the results are attributable to 
ideo-motor action, and the magnitude of ob- 
tained scores is a function of the extent to 
which the subject forms and concentrates 
upon images arising from the stimulus situa- 
tion. The purpose of the present study was to 
secure evidence concerning two predictions 
which were derived from the assumption of 
such a process.” 
Arnold’s hypothesis is designed to make ex- 
plicit the “machinery of automatism” by 
which suggestions are translated into action 


without the subjective experience of “willing.” 


It holds that the goal-directed activity in 
which the subject engages when suggestion 
is brought to bear upon him is not ^. . . striv- 
ing to behave like a hypnotized person” as 
White (21, p. 483) has stated, but '*. . . striving 
*to focus on the situation the experimenter 
describes and to imagine himself in it" (8 
p. 117). It is an observed and verified fact that 
instructions to imagine movements in various 
body members are almost invariably followed 
by muscular movements in those members, 


* This paper is based upon a thesis submitted in 
partial fulfillment of the requirements for the degree of 
Master of Arts in the department of philosophy and 
psychology at the University of British Columbia (18). 
‘The writer wishes to acknowledge his indebtedness to 
Professors D. C. G. McKay, E. I. Signori, and E. S. W. 
Belyea of that department, and to Professor G. A. 
Ferguson of McGill University, for advice and criticism 
in the completion of the original thesis. It is a pleasure 
to express appreciation for the assistance given in the 
preparation of the present paper by Dr. E. E. Ghiselli 


and other faculty members and fellow students in the' 


department of psychology at the University of Cali- 
fornia, Berkeley. * 

_?Eysenck’s (9, p. 196) independent and almost 
simultaneous publication of a notion similar to Arnold's 
gave additional impetus to this study. The speculations 
of Jenness (17) may also be noted. However, the present 

suce is concerned specifically only with Arnold's 
work. 
LÀ 


detectable in some cases only with delicate 
electronic instruments (16). Such behavior 
is commonly called ideo-motor action and is 
often observed in the presence of what Hull 
calls a “prestige” suggestion stimulus. Arnold 
contends that an induced imaginative process 
forms the basis of this type of phenomenon, 
Only as movements are imagined (or, more 
precisely, imaged) will they take place. 

While such movements are often difficult 
to detect in the “waking” state, the relaxed 
condition usually considered a prerequisite to 
obtaining the hypnotic state lessens various 
inhibitory influences. This tends to increase 
movement, and its perception enhances the 
imaginative process. The resulting “feedback”, 
eventually leads to the “aytomatic” responses 
characteristic of hypnotic behavior. 

Of the several predictions made by Arnold, 
this study attempted to secure evidence co 
cerning only two. These will be referred to 5 
the “sway hypothesis" and the “imaget 
hypothesis." 1 

The sway hypothesis. Using a factor analytic 
technique to analyze the results of 14 testi 
which have been said to measure the influent! 
of suggestion, Eysenck (9, p. 171) separa 
out three distinct types. Typical of these at 
the Hull Sway Test (14), the Binet Progressiv@ 
Lines Test (15), and procedures which hà? 
been claimed to demonstrate “social SUE. 
gestibility.” Examples of the last are given bj 
Asch (2), who rejects the adequacy of su% 
an “explanation.” Eysenck found scores d 
only the first type (ie., ideo-motor) to come 
late significantly with hypnotizability, ani 
this relationship has been amply confirme 
by other experimenters (3, 10, 20). Hence i 
seems reasonable that Arnold should consi 
the sway test as a measure of suggestibi 
when developing a hypothesis concerning 
mechanism common to both suggestion anl 
hypnosis. In this test, she believes, the sug 
gestion “you are falling is reacted to onl} 
as the individual imagines himself fall?" 
and so predicts that: “We should expect, 
still higher correlation between hypnotizabl 
ity and suggestibility as measured by the s" 

| 


Y 


e 
test if the subjects were asked fo think of 
falling forward instead of giving them the 
suggestion ‘yu are falling forward,’ which he 
may or may not believe and therefore may or 
may not visualize” (1, p. 118). 

It should be noted that this prediction is 
not concerned with the amount of the sway 
or of hypnotizability, but only with the in- 
crease in the relationship between them. The 
sway hypothesis which is derived may be 
stated: 

If the members ef two comparable groups 
are subjected to a sway test, the first being 
given the standard Hull “falling” instructions, 
while the second are asked to imagine them- 
selves falling as vividly as possible, the scores 
of the second group will correlate with meas- 
ures of their hypnotizability to a degree sig- 
nificantly greater than will the scores of the 
first group. 

The imagery hypothesis. If the occurrence 
of hypnotic phenomena is dependent upon 
imagery, then the ability of the individual 
to imagine beconfes basic to, and a limiting 
factor upon, his ability to become hypnotized. 
Hence Arnold further predicts “... that 


only those subjects who can imagine sharply * 


and well in the waking state will have the 
necessary vividness of imagination and the 
necessary concentration to make them good 
subjects for hypnosis” (1, p. 118). Such a 
statement would seem to make goodness of 


imagery a necessary but not a sufficient con- pe! 


dition for the attainment of the deeper stages 
of hypnosis, and hence it might be expected 
that some poor hypnotfc sybjects might show 
good ability to image. A hypothesf specific 
to this study may be stated: — * 

For a particular population there will be 
found a significant correlation between scores 
representing goodness of imagery, and those 
representing hypnotizability, and in particu- 
lar, those subjects who prove most hypnotiz- 
able will secure imagery scores significantly 
higher than those subjects who prove tq be 
least hypnotizable. 


METHOD 
Measuring Devices 


Before setting up specific experimental procedures it 
was necessary to decide upon means of obtaining ob- 


* A detailed destription of experimental procedures 
and scoring techniques, as well as of the apparatus 
used is available in the original thesis, available by 
interlibfary loan. 


IMAGERY AND SUGGESTIBILITY y 37 


D 
€ t 
jectively determined sco! 
indices of suggestibility, h: 
ability. 
Index of ideo-motor suggestibility. This index was ob- 
tained from the Hull Sway Test. The apparatus used 
as functionally similar to Hull's, but made use of 
lectric {ming and of pen recording. Because Edwards 


which could be used as 
notizability, and imagery 
° A 


* (8) failed to find significant differences between sway at 


head and hips, the thread which recorded sway was 
fastened for convenience to the rear of the subject’s 
(S’s) blindfold rather than to his clothing. The posture 
used (erect, with both heels and toes together) was 
that found by Fearing (11) to produce maximum sway, 
while instructions to relax and adopt an easy, natural 
position were a precaution taken to minimize negative 
(backward) sway, which Hull found to increase when S 
was tense. Such negative sway is difficult to interpret. 
Eysenck (9, p. 181) has shown that, while instructions 
spoken by the experimenter (E) are likely to produce a 
greater amount of sway, recorded instructions give 
more consistent results. For this reason all instructions 
given during the sway tests were wire-recorded. 

Each sway test had a duration of two minutes. Dur- 
ing the first 30 seconds of this period no verbal stimula- 
tion was used. On the kymograph record the median 
position during this “normal” period was determined 
and taken as a base line. The maximum deviation from 
this base line in inches of actual body sway during the 
one and one-half minutes when verbal stimulation was 
used was taken as the raw sway score, forward sway 
being counted as positive and backward as negative. 
Using the actual height at which the recording thread 
was attached, each raw sway score was corrected to 
that value which would have resulted had the thread 
been attached at a uniform height of five feet. 

Corrected sway up to and including 10 inches was 
ranked in accordance with amount of sway, while 
sway over 10 inches was arbitrarily counted as a “fall.” 
“Falls” were ranked according to the unexpired test 
riod time following the point on the record at which 
Sway exceeded 10 inches. Thus, an individual whose 
sway exceeded 10 inches at exactly the end of the one 
and one-half minute test period would be ranked im- 
mediately following an individual whose sway just 
reached the allowable maximum of 10 inches, while an 
individual who fell immediately at the onset of verbal 
stimulation would be given the most extreme ranking. 
The individual's ranking was taken as his suggestibility 
index (SI) for that test. 

Index of hypnotizability. Friedlander and Sarbin (12) 
have published a scale designed to give an objective and 
reliable (test-retest, about .80) index which takes into 
account both the depth of hypnosis attained by S 
and the amount of hypnotic stimulation required to 
reach that depth. Scores may range from zero, which 
indicates no apparent influence, to 20 indicating maxi- 
mum observable susceptibility. The score obtained 
from the administratiea of this scale was taken as the 
S's hypnotizability index (HI). 

The conditions of the present experiment required 
some changes in wording, and additional instructions 
were added to alleviate anxiety in Ss who were unable 
to open eyes, bend arms, etc., when challenged to do 
so. A scoring change was made to allow for cases for 
which it was difficult to determine the exact time of 
eye closure. These changes were made during pēs- 
testing, so that all data collected were comparable. 

© 


» 


38 y Wicuw N. McBain 


y) 

Index of imagery. A A Arnold's hypothesis calls 
for some methed of objéttively quantifying the in- 
dividual's ability to image in at least two modalities. 
Since no suitable technique seemed available, it was 
decided to develop methods yielding scores that de- 
pended mainly on recall in the most relevant modalities 
—visual and kinesthetic. 


; » 
Index of visual imagery. This index was obtained 


from a performance based upon the Memory for De- 
signs Test used by Terman and Merrill in the 1937 
revision of the Stanford-Binet Scale, form L, year IX-3 
(19). In addition to their two designs there was used a 
third, rather more complicated one (Fig. 1). The original 
was on standard 814 X 11 white paper, and the designs 
were not numbered. The test was administered with the 
original time limit and instructions. 

The Terman method of scoring seemed to provide 
too little room for representing various gradations of 
performance. Accordingly a method was developed on 
a preliminary group of Ss, which, when used on a sample 
of performances from the experimental groups, gave 
scores which agreed reasonably well with rankings by 
an independent observer of general “goodness of 
reproduction.” In this method each design was assigned 
10 points, and penalties were subtracted for various 
formal deviations from accurate reproduction. The 
sum of the remaining points for each of the three designs 
gave S’s index of visual imagery, with a possible maxi- 
mum score of 30. 

‘As a possible additional index of visual imagery, the 
Paper Cutting Test as described by Terman and Mer- 
rill for S.A. III-4 (19) was also administered. 


An index of kinesthetic imagery, designed to secure ` 


scores analogous to visual imagery scores, was de- 
veloped from what is termed the Progressive Finger 
Tracing Test. 

An apparatus was constructed which consisted of 
two 24-inch square pieces of }4-inch plywood, hinged 


| together at the top. A slot 4 inch wide was cut from 


the hinged front piece (or guideboard) to the patti 


and dimensions illustrated in Fig. 2. The rear piece - 


(or scoring board) was mounted rigidly on the wall so 
that the center of the board was at shoulder height 
for the average S when seated. At the shaded points 
numbered from one to five, }¢-inch holes were drilled 
to accommodate metal pins to which wooden blocks 
of the same width as the slot were affixed to serve as 
temporary stops in the slot pattern when the two 
boards were together and position indicators when the 
front board was raised. Also on the rear board there 


. was traced a pattern, the center line of which was 


congruent with that of the slot on the board in front, 
‘but with a width of 114 inches. Five circles of 2-inch 
radius were drawn with their centers on the points at 


which the finger was stopped by the wooden blocks on 
the right-hand side of the board. 


"Tests were given with S blindfolded and seated with 
the shoulder of his preferred asm opposite the center 


' line of the board. After instructions, the curtain con- 


cealing the board was drawn, With two blocks inserted 
symmetrically at position one, the forefinger was 
pressed against the left-hand block, and S was in- 
structed^to trace the groove through to its end. A 
retracing was performed in the same manner. Then the 
hinged guideboard was lifted out of the way, the fore- 
ii M 


- 


Ail dimensions in Inches 


Fic. 2. PATTERN AND DIMENSIONS OF PROGRES 
Fincer TxAcinc BoARD 


Di 


finger again pressed against the block, and S w 
structed to retrace the pattern as accurately asi 
sible (this time without the guidance of the slot 
referring to the "feel" of the two previous tri 
second retracing without the aid of the slo! 
was given in the same manner. For these two tri: 
wooden block in the right-hand position was rem@ 
The same procedure was repeated with the tempo 
bs successively in positions two, three, foury 
IV. 

Each S thus made two retracings of each of five! 
cessively more complex patterns, or a total of 10 
These trials were assigned, in order, possible sco 
one through ten. Retracings entirely within the E 
lines on the scoring board received full credit.’ 
tracings which were accurate as to pattern, but 
torted as to proportion received half credit, provi 
that they either were made 50 per cent within the gt 
lines or terminated within the 2-inch circle. Otht 
formances received no credit. If no score was 


IMAGERY AND SUGGESTIBILITY 39 


i: 


* . S 
zach of the four trials in two successive positions, the 
test was terminated. 


The S's index of kinesthetic imagery was the sum of 
his scores on all trials given. The possible maximum 
was 55. 

Subjects 


All Ss were undergraduate nen from the Uni- 
versity of British Columbia, most of them from a large 
introductory course in psychology. A total of 80 Ss 
was tested, but the data reported concern only 60—22 
males and 8 females each in the experimental and con- 
trol groups. The remainder were used in pretesting or 
“Jost” in matching. , 

"The Ss were recruited following a lecture and demon- 
stration of hypnosis by E, which included a rehearsal of 
most of the items in the Friedlander and Sarbin scale, 
with the addition of a positive visual hallucination. 
Individual appointments were made for the test ses- 
sions, which kept contact between Ss to a minimum. 
In addition, the need for secrecy was stressed. Some 
evidence as to the success of these measures is provided 
by the fact that all Ss denied any knowledge of experi- 
mental procedures. Several of the later ones stated 
this with evident annoyance. 

The lecture and demonstration were an attempt to 
partially equalize knowledge concerning hypnosis 
among Ss. An attempt was also made to secure some 
equivalence of motivation. Participants were required 
to sign a rather formidable release of responsibility, 
which probably eliminated those who were least moti- 
vated. Steps were taken to encourage a feeling of par- « 
ticipation by maintaining an informal atmosphere, by 
achieving rapport prior to experimental sessions, and 
by avoiding the manipulative, authoritative attitude 
which often characterizes the hypnotic situation. The 
Ss were referred to as “assistants in a study” rather 
than as “subjects in an expertment,” while “ability to 
achieve hypnosis” was used as a preferred alternative 
to mention of “susceptibility.” 


Procedure 


A testing period of just tfhder one Hour was,planned 
during which six individual tests coulde be Garried out 
with each S. L 

Upon arriving for his session the typical S was 
seated, given certain information concerning the 
experiment, and an opportunity to inquire concerning 
any aspect of it that he might be “uneasy about.” He 
was then blindfolded, his height recorded, and his 
body sway measured in the Hull Sway Test using 
direct suggestions of falling. Following this, each S, 
on the basis of the amount of sway recorded ‘or the 
time of fall, was assigned to either the experimental or 
the control group. 

The amount of reaction to the falling suggestions, 
varying from slight negative sway to almost immediate 
fall, was arbitrarily divided into eight gross categories. 
Ss falling into eath of these “cells” were assigned to 
control and experimental groups in strict alternation. 
Males and females were treated separately. Following 
this assignment, S, still blindfolded, performed the 
Progressive Fingef Tracing Test, and then, with the 
blindfold removed, the Memory for Designs Test and 


Li 


t 
hes Paper Cutting Test. Hin was‘then recorded a 
second time. 

If S had been assigned to the contol group, the 
second sway test consisted of an exact repetition of the 
original test; for the experimental group the situation 
was the same, but the recorded instructions were to 

gine falling as vividly as possible, rather than a 
repetition of the statement that S was falling. An in- 
quiry was carried out on the extent and type of imagery 
during the course of the sway test. Following this the 
Scale of Hypnotic Depth was administered. Questions 
were asked concerning the amount of distraction ex- 
perienced during the administration of the scale, and 
with a further caution regarding secrecy S was dis- 
missed. 

Equating experimental and conirol groups. For an 
experimental test of the sway hypothesis it is desirable 
to have an experimental and a control group which 
can be shown to be comparable in their responses to 
the standard sway test, as well as in any other char- 
acteristics which are known to influence the results of 
sway tests. The scores of a second sway test in which 
Ss of one group are asked to imagine falling as vividly 
as possible should then show a significantly higher 
correlation with measures of their hypnotizability, 
while the scores of the other (or control) group should 
show no significant change. 

At the conclusion of the experiment certain Ss had 
to be discarded from the already equated groups be- 
cause of faulty kymograph recordings. The resulting 
inequality was adjusted by discarding excess Ss ran- 
domly. On the basis of Edward's (8) finding that age 
and sway are related, one S who was more than twice 
as old as the mean of the control group was discarded, 
as was the equated S in the experimental group. 

The height correction mentioned earlier caused a 
divergence of the mean sway of the two groups. To 
remedy this, and prior to the analysis of the data, one 
S was removed from each of the groups and replaced 
from those previously discarded on the sole basis of 


“Bringing the means of the two groups on the first sway 


test more closely together. After these procedures the 
two groups, each containing 22 males and 8 females, 
were demonstrated by / tests not to differ significantly in 
regard to mean age, amount of sway, time of fall, and 
Hypnotizability Index (HI). 


RESULTS 
Test of the Sway Hypothesis 


This hypothesis is based upon the conten- 
tion that movement occurring as a result of 
verbal stimulation is the outcome of incidental 
imagery engaged in by Ss as a consequence of 
suggestion. Because Arnold believes that ideo- 
motor action b on such imagery is also 
the basic mechanism actuated in hypnosis, 
she predicts that a direct appeal to imagina- 
tion will produce sway test scores more closely 
related to Ss' indices of hypnotizability than 
will a sway test using Hull's standard instruc- 
tions. Implicit in this prediction is the assump- 

e 


40 k 

: D) " 
tion that in the hypnptic state the maximum 
degree of imagery of which an individual is 
capable is attained. 

Relation of suggestibility and hypnotizability 
indices. Because of the difficulty of establish: 
ing a quantitative relation between sway ang 
“falls,” S's rank in the group was used as his 
SI with the assumption of continuously in- 
creasing influence from negative sway, through 
positive sway and falls late in the test, to 
falls early in the test Hence correlations 
which are computed with SI's are 7’s inferred 
according to Garrett (13, p. 362) from rank- 
difference correlations (rho’s).6 The Pearson 
product-moment coefficient has been reported 
when calculations involve only HI's or in- 
dices of imagery, unless otherwise noted. 

The obtained correlations are tabulated in 
Table 1. It will be noted that the results ap- 
pear to be in a direction opposite from that 
predicted. According to hypothesis, the cor- 
relation between SI's and HI's for the experi- 
mental group in the second test should have 
been greater than that for the control group. 
Instead, the former is not significantly greater 
than zero, while the latter is greater to a highly 


significant degree (critical s ratio, 5.44). The ° 


available evidence thus favors rejecting the 
sway hypothesis and throws serious doubt 
upon the accuracy of the first of Arnold's 
predictions. 


Test of the Imagery Hypothesis 


= 

Three procedures, the Progressive Finger 
Tracing, the Memory for Designs, and the 
Paper Cutting tests were selected for dis- 
tinguishing *...those subjects who can im- 
agine sharply and well in the waking state." 
If Arnold's prediction is to be supported, a 
significant correlation between Ss' HI's and 


their scores on one or all of these tests should 
be found. 


*On the question of how to treat negative swa: 
Eysenck (9, p. 187) and Berreman and Hilgard 4) 
disagree. The treatment indicated is in accord with the 
views of the latter, whose evidence seems to bear more 
directly on the present study. In this study (possibly 
because of the emphasis on relaxation in the instruc- 
tions) no case of negative sway observed exceeded 1.5 
inches, and hence the r: displacement would be 
relatively unimportant. 

5 Subsequent calculations have shown insignificant 
differences between the inferred r's reported and 
product-moment correlation coefficients which were 
exmputed using normalized rankings. (See Table 2.) 


> 


Wum N. McBain 


' TABLE 1 


CORRELATIONS BETWEEN INDICES OF 
SUGGESTIBILITY AND HYPNOTI-ABILITY 


r 
(rho- 2* 
inferred) 


Group N 


First Sway Test** 


All Ss 60 45 <.01 7 | 
Control Ss 30 .8 — «0i l 
Exper. Ss 30 A «.05 F 
—r hi 
Second Sway Testt | 
Control Ss 30 .70 «.0t 
Exper. Ss 


30 24 > 15 


* Refers to difference from zero correlation. 
+e “Falling” instructions to all Ss. | 
t Control Ss: "falling" instructions repeated; experimental | 
Ss: “imagery” instructions. 


Only the scores from the Progressive Finger } 
Tracing Test correlate significantly with the: 
HI's (r = .36). The Memory for Designs) 
Test gives an r of .13 and ‘the Paper Cutting) 
Test one of —.05, neither significantly differ 
ent from zero. However, since Arnold found a” 
combination of two imagery modes more effec 
tive than either used singly in certain sway) 
tests she carried out, a multiple correlation 
was performed in which Finger Tracing andi 
Memory for Designs scores together wee) 
correlated against HI’s. The resulting correla 
tion was .39, a slight increase over the coet 
cient of .36 obtained when Finger Tracing 
scores alone were correlated with HI's. For 
tunately, the*optimura weightings were suci 
that a simple addition of the scores for the# 
two tests give a “two-test imagery ind 
which also correlated .39 with the HI's. 
subsequent computations this composite in 
of imagery was used. Because of their lack 
significance, no further use was made of the 
scores from the Paper Cutting Test. 

Confirmation of the significance of the © 
relation thus obtained was made by comp" 
ing the imagery means for Ss in the upper 97 
lower quartiles and in the upper and low! 
deciles. In the first case a difference W 
demonstrated at better than the 2 per 
level, while in the second 2 much higher ! 
of confidence was attained. These resul 
clearly support the imagery hypothesis, 97 
e the prediction from which it was 
rived. 


= 


» 


« ¢. IMAGERY AND SUSE suas . 


: e 
` Further Analysis 

At this poit the evidence indicates that, 
while individuals with higher imagery scores 
tend to be more susceptible to hypnosis, 

.direct appeal to imagination in a sway test 
.gives SI's that are no more closely related to 
HI's than SI’s obtained using the standard 
falling instructions. Since, in terms of Arnold's 
hypothesis, this evidence seems contradictory, 
further analysis is indicated. 

If the ability to image is central both to the 
results of sway tests and tests of hypnotizabil- 
ity, an estimate of what the relation between 
the latter two sets of scores would be if the 
influence of the former were removed should 
indicate a considerably lower coefficient of cor- 
relation. The use of partial correlation to ob- 


| tain 72,3 (in this case the estimate of what 


correlation would exist between SI's and HI's 
with the effect of imagery partialed out) gave 
for each of the four sway tests a higher rather 
than a lower correlation. If significant, this 
might mean that*whatever is measured in 
these imagery tests serves as a variable which 
suppresses the relationship between suggesti- 
bility and hypnotizability, rather than as a 
common factor. 


Discussion 


The hypotheses tested are derived from 
only two of several predictions made by Ar- 
nold, But they seem to be ones which are cru- 
cial to her general position, and hence merit 
individual consideration. 


The Sway Hypothesis ©. ` s : 

The findings here appear to be clear-cut 
“and to leave little room for disagreement as 
to interpretation. The sway test is specified 
in the prediction tested and is generally ac- 
‘cepted as a standard test of ideo-motor or 
prestige suggestibility. The Friedlander and 
Sarbin scale seems to provide the best avail- 
able index of hypnotizability. Theeresults are 
significantly in a direction opposite to that 
predicted. 

From the viewpoint of learning theory and 
the generalization of a stimulus situation, 
however, it might have been predicted that, 
for the experimental group, the results of the 
first sway test would be more closely related 
to the HI than those of the second. Both 


* Dr. Leo Postman did, in fact, make such a “blind” 
predictión. 


e Bay a 
during lihe first sway tesf and the administra- 
tion of the Friedlander and Sarbin scale S is 
told that certain things are happening or are 
about to happen. The situation is such that Æ 

comes the active and dominant figure, with 
nothing required of S except relaxed atten- 
tion. The S in the experimental group, on the 
other hand, who carries out the second sway 
test where he is asked to imagine falling “in as 
vivid a manner as possible," is being asked to 
participate actively.in the situation. He is re- 
quired to make a personal contribution to the 
experiment, to choose the modality and situa- 
tion upon which to focus his imagination. 

The effect of these divergent conditions is 
demonstrated in Table 2. Although in both the 
experimental and the control groups the re- 
sults of the two sway tests are highly corre- 
lated, the multiple correlation of the two sway 
tests gives considerably less prediction of 
hypnotizability for the experimental group. 
This may indicate that different elements are 
common to the two sway tests given to the 
experimental group than are common to those 
given to the control group. To the extent that 

«elements common to the stimulus situations 
of the sway tests given to the experimental 
group do not appear in the stimulus situation 
when the HI's are determined, it seems pos- 
sible to say that there will be a decrease in 
ability to predict the HI from the results of 
the sway tests. 
® Stated in this manner, the question of 
imagery becomes secondary to the determina- 
tion of those characteristics shared by the 
stimulus situations. We will return to this. 
point later. 


The Imagery Hypothesis 
In her paper Arnold has indicated no opera- 
tion or procedure for selecting **. . . those sub- 


TABLE 2 
MULTIPLE CORRELATION BETWEEN RESULTS OF Two 
Sway Tests AND Inpices oF HYPNOTIZABILITY 
Usne NORMALIZED RANK-ORDER Scores (N = 30) 


Ti ris T23. 
Group (HI &e (HI & (ST Rin 
ST 1) ST2 1&2 
Controls 01046 10:667 1011,68. 00:06 
Exper. 47 .24 -13 .49 
© 


Note.—All correlations reported in Table 2 are product mo- 
ment, computed from normalized rankings. Comparison with 
Table 1 indicates the consistency of this treatment of data with 
results from rank-difference correlations. 

© 


42 4 
jects who can imagin* sharply and well.” If 
the results from the procedures used to secure 
the two-test imagery index are accepted as 
indications of the Ss’ imagery ability, there 
can be little ambiguity in the interpretation), 
of the relevant findings. It would then seem 
quite clear that Ss who have a high degree of 
imagery ability do tend to be more susceptible 
to hypnotic stimulation. But is it justifiable 
to consider the scores obtained as indices of 
imagery ability? 

Justification of imagery scores. Their use 
may be justified in three ways: by appeal to 
past usage and to authority, on rational 
grounds, and by recourse to experimental 
evidence. The researches on eidetic imagery 
by Jaensch and his students have stressed the 
extent to which Ss have accurately recalled de- 
tails of briefly exposed pictures. Carmichael 
(5) has explicitly suggested the use of the 


` Binet figures as a screening procedure for 


selecting eidetic imagers. Eidetic imagery may 
be taken to show an extreme position on the 
continuum of ability to image visually. Finally, 
Arnold in a personal communication has 
indicated her acceptance of the procedures 
used. 

Imagery, according to Dexter, “. . . connotes 
the mental calling up of sensations" (7, p. 133). 
lf it is further assumed that “goodness” of 
imagery is a function of the accuracy with 
which such called-up sensations replicate the 
original, then it seems reasonable to believ@ 
that a person with well-developed imagery 
ability should show a superior performance on 
reproduction tests. It should be noted that 
these tests are not taken to be pure measures of 
imagery in the particular modalities, since 
there often seems to be interaction between 
modalities such as kinesthetic and visual. On 
the other hand, other abilities undoubtedly 
enter into performance; for example, it seems 
likely that some sort of ability to cope with 
spatial relations may be common to visual and 
kinesthetic test results. 

Davis (6) has supplied experimental evi- 
dence which strongly indicates the functional 
validity of the concept of imagery. He pre- 
pared several tests designed to be best carried 
out through the use of various modalities of 
imagery. His results show a high degree of 
correspondence between the scores obtained 
and reports as to the modality of imagery used 

m 


D 


Wru1ax N. McBary 


, mining the limits of the class of such operations. 


in their performance; that is, Ss reporting 
use of visual imagery on tests ¢zsigned to 
carried out in this way showed significantly 
better scores than Ss who reported the use oí 
another modality, or who made no use 
imagery. This seems to support the ratio 
approach. 

Imagery in operational lerms. It may be 
that the measures used in the present experi 
ment are simply retention scores with 
particular bearing on the concept of ima; 
There is no doubt that they are retention 
scores, but this hardly necessitates the eliming: 
tion of imagery as a relevant variable. We ma 
inquire into the mechanisms of retention, 
find that the postulation of the interveni 
construct of imagery adds a useful parameter 
to memory experiments. Its occurrence may 
prove to be a function of the type of stimul 
materials used, tending to be at a maximum 
for example, when the materials offer the least 
possibility for association. 

While each of the operations specified for 
securing measures of imagery must be objec 
tive and reliable, it will be possible in deter 


to draw upon the accumulation of highly 
stimulating introspective research in this are& 
When some degree of correspondence has beeni 
shown between the traditional subjective con 
cept of imagery and the results of the opera 
tions which have been chosen to redefine it 
experimental research must proceed without 
further direct recourse to the original concepti 

The .process of imagery may be conceived 
of as some sort of comparison between à 
tively permanent sensory trace and the present 
stimulus situation. Introspectionists allowt? 
this interpretation to be neglected, because X 
many cases where such a process must be b. 
sumed no reports could be obtained as to its 
occurrence, To further reconcile the classi 


old of report.” It seems possible to consid? 
that for situations where there is frequent 1% 
course to this process of comparison, habitu" 
tion leads to a raising of the threshold and 
process becomes subliminal. One might the! 
refer to “imageless imagery," but in the prese? 
context this is not an absurdity. 1 

Imagery scores as predictors. Researchers if 
learning and memory have tended to di 


—————— 


e * IMAGERY AND SUGGESTIBILITY HS 
€ 


their efforts toward the discovery of laws and 
relationships ef the widest possible generality. 
Little attention has been paid to individual 
variation, and there has been little concern 
for prediction from performance in this area 
to more general types of behavior. The concept 
of imagery as a process in which individual 
differences occur seems likely to suggest an 
emphasis upon the use of the operations by 
which it is defined to predict other types of 
performance for the individual. 

An example of the predictive value of such 
operations may be taken from the present 
study. In the control group the first sway test 
correlates only .46 with hypnotizability. The 
addition of the two imagery tests in a multiple 
correlation results in a coefficient of .70, a very 
respectable increase in predictive power. It is 
possible that the use of such imagery tests 
may provide another predictor of hypnotiza- 
bility which is independent of the sway test, 
such as Eysenck (9, p. 171) reports the heat 
illusion test to be.e 

The stimulus situation. Yt may be profitable 
to consider the.tests of imagery in relation to 
the similarity of the stimulus situations in- 
volved. As will be recalled, the index of 
kinesthetic imagery correlates moderately with 
the HI, while that of visual imagery gives a 
correlation that is not statistically significant, 
although its inclusion in & multiple correlation 
incredses the predictive power of the imagery 
tests. A scrutiny of the verbal stimuli in the 
Scale of Hypnotic Depth shows that the only 
reference to vision or yisual sensation.has to 
do with the fixation on the light seufce and 
with the tiredness of the eyes. On the other 
hand there is continual reference to kinesthetic 
sensations: the emphasis on muscular relaxa- 
tion, the mention' of heaviness, limpness, 
strain, etc. In addition, all of the five chal- 
lenges have to do with the movement of certain 
muscular groups, E asserting that S will not 
be able to perform the movement. Of these 
challenges the fifth (“you cannot say your 
name") is probably the least concerned with 
kinesthetic sensation, while the fourth (“you 
cannot separate your clasped hands”) relates 
much more to this area. When a biserial cor- 
anon is made Between failing to resist the 
"name" challenge and scores on the finger 
mure test, a Coefficient of .38 is obtained. A 
similar computation for the ‘“handclasp” 

Lj 


f 


43 

€ 
challenge gives a coeffic'ent of .65. Thus, once 
again there is an increasing degreeeof relation- 
ship between scores when the objective stimu- 

lus situations are more similar. 
f Phenomenal. aspects. Up to this point only 
passing "mention has been made of the motiva- 
tions of the various Ss. Even though the at- 
tempt was made to equalize degree of motiva- 
tion, a superficial noting of the reactions in 
the testing sessions showed a tremendous 
variety. Certain Ss volunteered because they 
hoped that hypnosis would help them to 
solve their study problems. Some seemed 
frankly apprehensive, while others regarded 
the hypnotic situation as a test of will power. 
Traditionally, small numbers of Ss have been 
used in experiments involving hypnosis, and 
not a great deal of attention has been paid to 
what motivated them to participate or to their 
emotional reactions to this fascinating sub- 
ject. The effects of motivation on perception 
have been well documented, and emotion is a 
powerful motivator. The lack of a method of 
equating motivation is a real difficulty in the 
precise interpretation of the present experi- 
ment. 

It would seem highly likely to the writer 
that future work in this area cannot ignore the 
effects of stimulus generalization and the 
similarity of situations in which related scores 
are obtained. But, because of the probable 
effects of differential motivation, efforts should 

“Bot be concentrated entirely upon obtaining 
measures of objective similarity. To achieve 
optimum results the investigator must secure 
evidence that perceptual or phenomenal 
similarity has been achieved. 


SUMMARY 


An experimental investigation was carried 
out concerning two predictions made by 
Arnold. They are based on her hypothesis that 
the common mechanism active in both “pres- 
tige” suggestion and hypnosis is the occur- 
rence of ideo-motor action as a consequence of 
the subject’s imagery. 

The first prediction is that sway test scores 
secured when S is asked to imagine falling 
should show a higher correlation with scores of 
hypnotizability than sway test scores obtained 
with the standard Hull “falling” instractions. 
Two groups of 30 undergraduates each, 
equated on an initial sway test, were sub- 

€ 


© 


4 
44 J 2 


2 

jected to a second such'test. The control group, 
repeating the original standard test, showed a 
significantly higher correlation with scores on 
the Friedlander and Sarbin Scale of Hypnotic 
Depth. The experimental group, asked to) 
imagine falling in the second test, showed a 
significantly lower correlation. These results 
are opposite in direction to the prediction. 

A second prediction, that only those who can 
imagine most vividly and well will be capable 
of attaining the deepest states of hypnosis, 
implies a correlation between imagery ability 
and susceptibility to hypnosis. Using the 
scores of all 60 Ss on the Friedlander and Sar- 
bin scale, a significant though moderate 
multiple correlation was secured with two 
tests designed to secure measures of kines- 
thetic and visual imagery, respectively. This 
evidence supports the second prediction. 

A partial correlation holding constant the 
influence of imagery on scores of suggestibility 
and hypnotizability gives no evidence that it 
is a factor common to both. Regardless of the 
metaphysical status of the concept of imagery, 
it is suggested that the testing procedures 
described may usefully serve as predictors of 
other behavior, specifically, susceptibility to 
hypnosis. 

Attention is called to the extent to which it 
appears that results may be attributed to the 
similarity of the stimulus situations involved 


in the procedures used. Because of the strengtli® 17 


and variety of the motivations apparently in- 
volved, it is suggested that further investiga- 
tions should attempt to equate phenomenal as 
well as objective similarities. 


REFERENCES 


1. ARNOLD, Macpa B. On the mech: 
tion and hypnosis. J. 
1946, 41, 107-128. 

2. Ascu, S, E. The doctrine of 
and imitation in social 
Rev., 1948, 55, 250-276. 


anism of sugges- 
abnorm. soc. Psychol., 


suggestion, prestige 
psychology. Psychol. 


WILLIAM N. McBarv TH 


3. 


4. BERREMAN, J. V., & HircAnp, E. R. The effects of 


6 ^» m 9?» t 


11. 


12. 


13. 


^M. 


15. 


16. 


18. 


19. 


21. 


. CARMICHAEL, L. Eidetic imagery and the Binet 


. Davis, F. C. The functional significance of imaget 


. EDWARDS, A. S. The measurement of static ataxia 
. EvsENck, H. J. Dimensions of personality. London 
10. 


. JeNNEss, A. Hypnotism. In J. McV. Hunt (Eà) 


. Ware, M. M. The physical and mental traits % 


Barry, H., MacKinnon, D. W., & Murray, H. 
Studies in personality. A: Hypnptizability asa 
personality trait and its typological relations. 
Hum. Biol., 1931, 3, 1-36. 


| 
| 
personal heterosuggestion and two forms 


autosuggestion upon postural movement. J. sog 
Psychol., 1936, 7, 289-300. 


test. J. educ. Psychol., 1925, 16, 251-252. 


differences. J. exp. Psychol., 1932, 16, 630 
Dexter, E. S. What is imagination? J. ge 
Psychol., 1943, 28, 133-138. | 


Amer. J. Psychol., 1942, 65, 171-188. 


Routledge & Kegan Paul, 1947. 

ExsENck, H. J., & Furneaux, W. D. Primary and 
secondary suggestibility: an experimental andj 
statistical study. J. exp. Psychol., 1945, 86, 48 
503. 

Fearinc, F. S. The factors influencing static) 
equilibrium: an experimental study. J. coma 
Psychol., 1924, 4, 92-121. | 

FRIEDLANDER, J. W., & SAnBIN, T. R. The depth 
of hypnosis. J. abnorm. soc. Psychol., 1938, 88; 
453-475. 5 

Garrett, H. E. Statistics in psychology and edi 
tion. (2nd Ed.) New York: Longmans, Green 
1939, 


waking suggestion. J. abnorm. soc. P: 
1929, 24, 153-169. 1 
Hott, C. L. Hypnosis and suggestibility: an expary 
mental approach. New York: D. Appleton 
Century, 1933. 
JacoBsoN, E. Progressive relaxation. Chicago) 
Univer. of Chicago Press, 1938. E I 


Personality and the behavior disorders. Vol. 
New York: Ronald, 1944. Pp. 466-502. 

McBain, W.N. An experimental study of an hyp?) 
thetital mechanism of suggestion and hypno 
Unpublished master's thesis, Univer. of Brit) 
Columbia, 1951. 

Terman, L. M., & MERRILL, Maun A. Measurihi 
intelligence. Boston: Houghton Mifflin, 1937. 0 


individuals susceptible to hypnosis. J. ab"? 
soc. Psychol., 1930, 25, 293-298. 

Warre, R. W. A preface to the theory of hyp2?) 
tism. J. abnorm. soc. Psychol., 1941, 36, 477- 0». 


Received October 7, 1952. 


PSYCHOLOGICAL DEFENSES AND “EGO STRENGTH" IN TH 
RECALL OF COMPLETED AND INCOMPLETED TASKS 


CHARLES W. ERIKSEN” 
The Johns Hopkins University 


HE interrelationships between “ego 
strength” and psychological defenses 

have been frequently recognized in 
clinical practice and theory, but the nature of 
the relationships is not clear and not ex- 
plicitly stated. The reason for this lack of 
clarity resides primarily in the lack of opera- 
tional meaning or definition of the concept of 
ego strength and also in the lack of experi- 
mental data on defensive processes. The pres- 
ent experiment is an attempt to shed some 
light on this problem by investigating some 
of the personality correlates of individual 
differences in reactions to threat to self-esteem. 
The completed-imcompleted task method 
was selected as a device for threatening self- 
esteem, and the subsequent recall of incom- 
pleted and completed items as a measure of 
individual differences in reaction to this threat. 
The choice of this particular technique as a 
locus of investigation was determined in part 
by the confusion that exists over the personal- 
ity variables that are involVed in the differen- 
tial recall of successful and failed tasks. Alper 
(1), using an estimate of ego strength arrived 
at from clinical judgments, found that in- 
dividuals characterized as "strong egos” 
favored incompleted tasks in tecall ynder task- 
oriented conditions and completed tasks in 
their recall under conditions of threat to self- 
esteem. Individuals characterized as having 
"weak egos" showed'a reversal of this rela- 
tionship. They tended to recall completed 
tasks under task-oriented conditions and in- 
completed tasks when self-esteem was objec- 
tively threatened by the experimental situa- 
tion. The implication of these results is that 
weak egos are threatened by the objective, 
nonthreatening, task-oriented condition and 
bolster self-esteem by recalling successes. 
However, when the threat becomes objec- 
tively more severe, they are overwhelmed by 
their inadequacy, and recall their failures. 
Strong egos, on the other hand, are less easily 
threatened, and when threat does occur they 
cope with it by emphasizing successes rather 


than being overwhelmed by their inadequacy. 

Opposed to the ego-strength interpretation 
of differential recall is an interpretation in 
terms of defensive processes. Some investi- 
gators have compared differential recall with 
the defensive mechanism of repression (7, 12). 
Rosenzweig and Sarason (13) have carried 
out this comparison by showing that the 
tendency to recall completed tasks when self- 
esteem is threatened is related to hysteria. 
Eriksen (3, 4) has shown further that the 
tendency to forget failed tasks is related to 
avoidance behavior in the presence of other 
forms of psychological threat. 


HYPOTHESES 


In the present study the following general 
hypotheses were set up to be tested: (a) Ego 
strength should be directly related to the 
tendency to recall incompleted tasks under 
task-oriented conditions and inversely related 
to incompleted-task recall under self-esteem- 
oriented conditions. (b) Hysteria should be 
inversely related to incompleted-task recall 
under self-esteem-oriented conditions. (c) Psy- 
chasthenia should be directly related to in- 
completed-task recall under self-esteem-ori- 
ented conditions. 

This latter hypothesis is based upon Ey- 
senck's report (6) that hysteria and psychas- 
thenia are on opposite poles of a bipolar 
factor "'extraversion-introversion" that he 
found in his neurotic sample. Also Lazarus, 
Eriksen, and Fonda (9) found that patients 
diagnosed as psychasthenic used psychological 
defenses that were less characterized by per- 
ceptual-cognitive avoidance than the defenses 
used by hysterics. 


E 
METHOD 


The experiment involvedethree sessions. One session 
was devoted to the completed-incompleted task ex- 
periment and the measure of recall. A second session 
was used for administering the measure of ego strength, 
and a third session for administering the group form of 
the Minnesota Multiphasic Personality Inventory 
(MMPI). e 

© 


J 
46 } z. 


» 
Compleied-incompleled tasks. The tasks used for 
this part of the experiment were 16 sentences! of 20 
words each which had been scrambled into two- and 
three-word phrases. The S's task was to rearrange these 
phrases so as to obtain a meaningful sentence. Success 
and failure were controlled by having eight of the 
sentences solvable and the remaining eight unsolvable. 
Each sentence was given a “title” by underlining one 
of the two-word phrases and placing this phrase first 
in the order of scrambled phrases as well as above the 
entire scrambled sentence. The sentences were ad- 
ministered to groups of 30 to 40 Ss. Each sentence was 
projected on a screen for 3 min., during which time 
each S was to write his solution on a page of an examina- 
tion booklet. Separated pages were used for each sen- 
tence, and Ss were strongly cautioned against review- 
ing. Following the sixteenth sentence, Ss spent 5 
min. writing down all the five-letter words they could 
think of that began with the letter /, ‘A 5-min. recall 
period followed during which Ss wrote down the title 
of all of the sentences they could recall. 

The sentences were administered to the experi- 
mental group (objectively self-esteem threatened) under 
the guisé of an intelligence test. They were told that 
E was interested in selecting Ss who were bright enough 
to participate in an experiment on abstract reasoning. 
They were further told that anyone with an IQ of 115 
or better should have little or no difficulty solving all of 
the sentences within the time limits. To increase de- 
ception, classmate stooges were scattered throughout 
the group. These stooges pretended to complete all of 
the 16 sentences. ] 

. The sentences were administered to control Ss 
(no objective self-esteem threat) under identical condi- 
tions except for different instructions and the absence 
of stooges. They were told that the sentences had been 
made up by E to keep Ss in a learning experiment 
occupied during rest intervals so they could not re- 
hearse, and that E simply wanted to determine if 
the sentences were difficult enough to keep Ss Büsy 
for the necessary lengths of time. The control group 
was told that E was already certain that some of the 
‘sentences were too difficult to be solved within the 

_ time limits. The Ss were asked to work as rapidly 
| as possible in order to make their performance com- 
parable to that of the learning Ss. 

Ego-strength scale. In order to study the role of ego 
strength in completed-incompleted task recall, it was 
first necessary to define and/or measure what is meant 
by this construct. There are probably as many different 
meanings of this term as there are people who use it. 
Most of these different usages, however, seem to refer 
to a common aspect of behavior, that is “the degree to 
which an individual’s behavior is generally in keeping 
with the realities of the objective situation.” The instru- 


1The ego-strength scale, the 16 scrambled sentences, 
and a more detailed treatment of the procedure have 


Service, Library of Congress, Washington 25, D 
remitting $1.25 for 35-mm. microfilm or $125 (2 
photostats. Make checks payable to Chief, Photodup- 
lication Service, Library of Congress. 

5 


- 


CHARLES W. ERIKSEN 


ment used to assess this quality of behavior is genen 
a clinical psychologist. However since a 
psychologist is too expensive an Instrument to b 
lying around the laboratory, it was necessary 
purposes of the present experiment to devise someo 
measure of ego strength. 1 

For the present, ego strength is defined as 
individual's capacity for appraising the reas 
limits in his interpretations and perceptions 
environment." The operational meaning of this 
is in terms of the extent to which an individual's yi 
responses to suggested interpretations of the Rorsth 
inkblots are in agreement with norms derived f 
other individuals of comparable intellectual leve 
social background. 

The rationale for this definition is based upon| 
assumption that behavior is a consequence of the} 
ceived situation, Thus, for behavior to be “in keg 
with the realities of the situation" the individualn 
be able to perceive these realities. For any given 
lus situation, he must be able to discriminate be 
what are real or reasonable perceptual interpretati 
and what are unreal or unreasonable ones. The cri 
for determining what are real or reasonable interpu 
tions depend not only on the physical chara 
qualities of the stimulus, but also upon other vati 
such as intelligence and social-cultural values | 


. Most of us would agree that! 
interpretation “bat” to Card V of the Rorschad 
reasonable interpretation, while the interpretation) 
man raping a woman” is an unreasonable one. | 
The interpretation of inkblots would seem to afi 
a method for assessing an individual's capacity for 
praising “reasonable limits.” The general unfamiliall 
of inkblots throws the S upon his own resources, y 
the same time considerable agreement can be obi 
among observers as to what are reasonable in 
tions of given inkblots or inkblot areas. Using a st 
ard set of inkblots and interpretations of them, 
possible to determine the extent to which any 
individual's judgments ¢f reasonable and unre: K 
interpretations agreé with the judgments of a majo! 
of other observers? (For further discussion of th 
tionale, see 5.) | 
An ego-strength scale was constructed based úf 
the 50 Rorschach concepts used in the McRey? 


„° The above rationale is very similar to that W 
lying the F-+% score which is used in the Rorsd 
test. F+ and F— responses are scored on the b? 
whether they occur more frequently among a 9 
population or among a psychiatric group. AS 
Beck (2) and other clinicians, F+% is consid! 
an indicator of ego strength. 

? See footnote 1. 


EGÓ STRENGTH AND THE RECALL OF TASKS * ¢ 
. 


this experiment would be drawn. Accordingly, the 50 
concepts were ad¥pinistered to a random sample of 100 
male undergraduate students at The Johns Hopkins 
University. 

The test was administered individually. Each S was 
given a general explanation of the inkblots and their 
interpretation. He was then given the following specific 
instructions: “I’m going to point out various areas in 
these inkblots and ask you if they could look like various 
objects or things to you. For example, I'll point out an 
area and say, ‘Could this be a___?’ I want you to look 
! at the area I indicate and say, ‘yes,’ if you think it looks 
like what I suggested, and ‘no’ if you don’t think it 
does. Some’ of the concepts I am going to point out to 
you most people would agree are reasonable interpreta- 
tions for the particular area of the inkblot. Other con- 
cepts I point out most people would agree are not 
reasonable interpretations or concepts. However, you 
are to use your own judgment. If the concept looks like 
a reasonable interpretation to you, say, ‘yes’; if it 
doesn’t, say, ‘no.’ Are there any questions?” 

The percentage of students answering “yes” to each 
of the concepts was computed. Concepts to which 50 
4 per cent or more of the students answered "yes" were 
defined as reasonable. Twenty-six concepts met this 
criterion. The remaining 24 were considered as un- 
reasonable interpretations. The results of this analysis 
agreed quite closely with Beck's plus and minus cate- 
gories. Only two concepts which are scored minus by 
Beck received 50 per cent or more “yes” responses. 
They were "dancer" for area Ds 5 in Card II (56 per 
cent) and "face" for area D 8 in Card III (57 per cent). 
Only one concept scored, plus by Beck was answered 
“yes” by less than 50 per cent of the sample. It was 
“man’s face" to area Ds immediately above left D 4 in 
Card VI (44 per cent). 

The reliability of this scale „was determined by the 
odd-even method. When corrected by the Spearman- 
Brown prophecy formula, the correlation between num- 
ber of “yes” responses was .85. 

The above scale was administered individually to 
the Ss in this experiment. An S’s score on the scale was 
the total number of *correct* responses he gavé. That 
is, his score consisted of the number of, unfeasonable 
concepts to which he had answered "no",plus the num- 
ber of reasonable concepts to which he had answered 
"yes." Thus a high score on the scale reflects a good 
capacity to appraise reasonableness as here defined. It 
should be noted that low scores can result from either 
saying “no” to reasonable interpretations or saying 
“yes” to unreasonable ones. 

MMPI. The group form of the MMPI was adminis- 
tered to the Ss during regular class session. Fot the 
present experiment only the hysteria and pSychasthenia 
scales with the K correction factor were scored. — " 

Subjects. In addition to the standardization group 
for the ego-strength scale, 66 male undergraduate 
students at The Johns Hopkins University participated 
in the experiment/“However, not all of these Ss were 
available for all three experimental sessions. An addi- 
tional 13 control Ss and 32 experimental Ss were used 
in the completed-incompleted task phase of the experi- 
ment, but were drépped from the remainder of the 
experiment because of their failure to complete all of the 
solvable tasks. 

w 


d 


47 
€ 
TABLE 1 a 


NUMBER or INCOMPLETED AND COMPLETED TASKS 
RECALLED BY THE EXPERIMENTAL 
-AND CONTROL GROUPS 


f A EXPERIMEN- CONTROL 
TAsks TAL (N = 44) | . (N = 23) " 
Mean SD | Mean SD 
Incomplete 2:0 1.2 | 3.4 1.4 | 4.29 
Complete 3.6 1.5 3 d: .28 
(I-C)/I--C) | —.23 .44] —.05 .36 | 1.76 


* 4 values of 1.67 and 2.39 are significant at the .05 and the .01 
levels, respectively, for a one-tailed test. 


RESULTS 


Our first concern is with performance in the 
completed-incompleted task phase of the 
experiment. In Table 1 the experimental and 
control groups are compared on the number 
of completed and incompleted tasks recalled 
and on a recall score computed for each S by 
subtracting the number of completed tasks 
he recalled from the number of incompleted 
tasks recalled and dividing by the total re- 
called. From this table it can be seen that 
*both groups recalled the same number of com- 
pleted tasks, but the objective self-esteem- 
threatened Ss recalled significantly fewer in- 
completed tasks. On the basis of recall score 
the two groups again differed. The recall 
scores for the experimental group reflect their ' 
preference for completed as opposed to incom- 
pleted tasks in recall. Recall scores for the 
experimental group are also more variable 
than for the controls, although this difference 
in variability is not statistically reliable 
(F = 1.48). 

These results from the completed-incom- 
pleted task phase of the experiment are in 
general agreement with the findings of other 
investigators (7, 10, 12). It seems safe, there- 
fore, to assume that the present experiment 
was successful in producing two degrees of 
self-esteem threat. 

Our second concern is with the personality 
correlates of individual differences in selective 
recall in the corhpleted-incompleted task 
situation. Table 2 shows the correlations ob- 
tained between scores on the ego-strength 
scale and recall scores under the two degrees 
of self-esteem threat. For the control gtoup a 
positive correlation of borderline significance 
is obtained between the ego-strength scale 


@ 


) 


48 oí! 
LI 

and recall score. Since high recall scores reflect 
the tendency to favor incompleted tasks in 
recall, this correlation means that people 
characterized by strong egos tend to recall 
relatively more incompleted tasks under à 
condition of little or no objective self-esteem 
threat. For the experimental group the direc- 
tion of the relationship is reversed. Here we 
find that Ss characterized as strong egos now 
tend to favor completed tasks in their recall, 
while weak egos favor the incompleted or 
failed tasks. These results are in keeping with 
the previously stated hypothesis, and they 
support Alper's (1) data to the extent that we 
can assume that the ego-strength scale used 
here is measuring essentially the same thing 
that Alper measured as ego strength. 

A third question concerns the relation be- 
tween scores on the hysteria and psychasthenia 
scales and recall of completed and incompleted 
tasks under the condition of objective self- 
esteem threat. Table 3 shows these correla- 
tions. Both hysteria and psychasthenia are 
related to recall score in the predicted direc- 
tion. For hysteria this relation is negative, 
indicating that Ss with high hysteria scores 
tend to recall relatively fewer incompleted 
than completed tasks. The positive correlation 
between psychasthenia and recall score, on 
the other hand, shows that Ss with high psy- 
chasthenia scores tend to recall relatively 
more incompleted than completed tasks. ES 
though these correlations are in the directio: 
that would be predicted from theory and 
previous results, they need to be further 
analyzed before we can conclude that the 
personality dimensions of hysteria and psy- 
chasthenia are involved in completed-incom- 
pleted task recall independently of the ego- 
strength dimension. 

It is possible that the hysteria and psychas- 
thenia scales correlate with recall scores 
because they themselves are measures of ego 
strength in addition to being measures of the 
hysteria and psychasthenia dimensions. That 
this may be the case is suggested by the —.24 
and —.29 correlations "between the ego- 
strength scale and the hysteria and psychas- 
thenia scales, respectively. When scores on 
the ego-strength scale are partialled out, or in 
effect held constant, the partial correlation 
between the hysteria scale and recall score is 
PES and between psychasthenia and recall 


* 
- 


CHARLES W. ERIKSEN 


1] 
H 

TABLE 2 i 

CORRELATIONS BETWEEN THE EGO-$PRENCGTH Scare | 


AND COMPLETED-INCOMPLETED TASK RECALL Score 
FOR THE EXPERIMENTAL AND CONTROL GROUPS 


EXPERIMENTAL CONTROL 
43 19 
re —.60 .38 
? .001 055 


— | 1 
* The significance of r is evaluated via Fisher's s transformi: | 
tion. The probabilities are for a one-tailed test. | 


| 


TABLE 3 LH 


CORRELATION BETWEEN CompLetep-INCOMPLETED | 
Task RECALL SCORE AND THE HYSTERIA AND | 
PSYCHASTHENIA SCALES FOR THE EXPERI- |. 
MENTAL GROUP 


HYSTERIA PSYCHASTHENIA 
N 40 40 | l 
r* -.21 28 
? .05 .05 I 


* The significance of r is evaluated via Fisher's s transfor i 
tion. The probabilities are for a on@tailed test. 

i 
score, .14. The size of the partial correlatiodi 
between the hysteria scale and the recall score 
is large enough to suggest that the hysterii 
scale is related to recall score independently 
of the correlation with ego strength. Howeveh 
the small partial correlation between recall 
score and the psychasthenia scale sugges 
that the correlation between these two vati 
ables is not larger than can be accounted fo!) 
by their respective correlations with the eg% 
strength scale. Thussthe evidence supports 
the hypóthesis that hysteria and ego streng 

are independent variables related to com 
pleted-incompleted task recall, but is incon 
clusive with respect to the relation of 
recall and psychasthenia. ` 


Discussion 


The present experiment bears some interest 
ing relations to the results of an extensiv 
experimental study reported by Eysenck (6) 
Using a large hospitalized neurotic samp! 
Eysenck succeeded in isolating two princi 
factors. There is good reason.to believe th 
the first of these factors, neuroticism, is 
same variable that is being measured by 
ego-strength scale employed, in the prese? 
Study. This belief is supported by the clos 
similarity between the ego-strength scale U 


*Edo STRENGTH AND THE RECALL OF Tasks ^. (| 
LJ 


here and a measure that Eysenck found to be 
heavily saturated with his neuroticism factor. 
He used a slight modification of the Harrower- 


Erickson Group Rorschach (8) in which he 


had his Ss rank the nine alternative responses 
suggested for each of the inkblots in terms of 
its acceptability as an interpretation. Scores 
were obtained by adding the rank values that 
Ss assigned to the four “neurotic” responses 
that were among the nine alternatives for 
each of the blots. Thjs technique was found to 
discriminate effectively between the extremes 
of his neurotic population as well as between 
the neurotic population and a sample of 
normals. Although Eysenck termed this a 
measure of "conformity," the similarity of 
his technique and the present measure of ego 
strength in both method and rationale is 
quite obvious. 

The second factor found by Eysenck was 
termed “extraversion-introversion.” This fac- 
tor was found to characterize hysterias at the 
extraversive pole and anxiety and psychas- 
thenic neurotics at the introversive pole. In 
the present study the hysteria scale was found 
to contain at least two sources of variance. 
One of these sources was in common with the 
ego-strength scale, while the other source was 
independently related to selective recall in 
the completed-incompleted task study. There 
is some basis for assumihg that this second 
source*of variance in the hysteria scale is re- 
lated to Eysenck's extraversion-introversion 
factor in view of the relationship between 
clinically diagnosed hysteria and.extraversion. 
Additional support for this assumption is 
found in the characterization of the extravert's 
and introvert's reactions to failure. Eysenck 
reports that in level-of-aspiration experiments 
Ss high on the extraversive pole tend to 
overestimate their performance and are little 
bothered by failure, while Ss falling on the intro- 
versive pole tend to underestimate their, per- 
formance and are preoccupied witlf their fail- 
ures. Such reaction to failure and threat would, 
at least superficially, seem to be related to dif- 

ferential memory for completed and incom- 
pleted tasks. By emphasizing the successful 
tasks and ignoring the failed ones, extraverted 
Ss would be expected to recall relatively more 
completed than incompleted tasks while for 
the introversive Ss this tendency would be 
reversed. 


» 


f 


49 
e 

Such an analysis would, of cousse, still leave 
unexplained why the extraverts usê an avoid- 
ance response to threat to a greater degree 
than do the introverts. A possible explanation 
‘might bg that an avoidance response is more 
compatible. with the general features of the 
extraversive personality, thus permitting its 
use to a greater degree than is possible for the 
introverts. Although this is speculation, it 
should lend itself to experimental test. 

The relation of the present data to Eysenck's 
findings suggests some interesting possibilities 
and avenues for further research, but it must 
be stressed that these relationships have not 
been proven. There is still much to be desired, 
not only in identifying the present data with 
the neuroticism and extraversion-introversion 
factors, but also in clarifying and substantiat- 
ing the relationships obtained in the present 
study. 

One particular point concerns the psychas- 
thenia scale and its intercorrelations with the 
ego-strength scale and recall scores. Although 
the psychasthenia scale was related to the dif- 
ferential recall of completed and incompleted 
*tasks in a predicted way, the size of the corre- 
lation was not greater than could be accounted 
for on the basis of the intercorrelation of both 
of these measures with the ego-strength scale. 
Whether this reflects on the validity of the 
hypothesis or the validity of the psychasthenia 
sale cannot be determined from the available 

ata. Further work, using other measures of 
psychasthenia and of hysteria and extraver- 
sion-introversion, are needed to clarify these 
interrelations. 


SUMMARY 


The present experiment investigated the re- 
lation of ego strength, hysteria, and psychas- 
thenia to recall of completed and incompleted 
tasks under conditions where self-esteem was, 
and was not, objectively threatened. A meas- 
ure of ego strength, based upon S's capacity 
to judge what were and were not reasonable 
interpretations of inkblots, was constructed. 
The criterion of reasónableness was determined 
by a normative approach. Hysteria and psy- 
chasthenia were measifred by these respective 
scales in the Minnesota Multiphasic Person- 
ality Inventory. » 

The results of this study indicate: (a) Ego . 
strength is directly related to the tendency to 


* 
@ 


4 
50 o 
à 
recall relatively more incompleted than com- 
pleted tasks when the situation is not objec- 
tively self-esteem threatening, and inversely 
related to this tendency when the situatio 
does objectively threaten self-esteem. G 
Scores on the hysteria scale are inversely re- 
lated to the tendency to recall relatively more 
incompleted than completed tasks when self- 
esteem is objectively threatened, while scores 
on the psychasthenia scale are directly related 
to this tendency. (c) The correlation between 
hysteria and completed-incompleted task re- 
call was found to be independent of the inter- 
correlation of these two variables with ego 
strength. However, the correlation of psy- 
chasthenia with completed-incompleted task 
recall was not significant when the intercorre- 
lation with ego strength was partialled out. 
The relation of these results to the neuroti- 
cism and extraversion-introversion factors re- 
ported by Eysenck was discussed. 


REFERENCES 


1. ArPER, THELMA G. Memory of completed and in- 
completed tasks as a function of personality: an 
analysis of group data. J. abnorm. soc. Psychol., 
1946, 41, 403-421. 

2. Beck, S. J. Rorschach’s test. Vol. Y: Basic processes. 
New York: Grune & Stratton, 1944. 

3. Eriksen, C. W. Defense against ego-threat in 


10. 


11. 


12. 


13. 


» 


Caarrgs W. ERIKSEN 3 


memory and perception, J. abnorm. soc. Psychol. 
1952, 47, 230-235. [a 


. Ertxsen, C. W. Individual differences in defensive 


forgetting. J. exp. Psychol., 1952, 44, 442-447. 


. Errxsen, C. W., & Lazarus, R. S. Perceptual de- 


fense and projective tests. J. abmorm. soc. 


Psychol., 1952, 47, 302-308. 


. Eysenck, H. J. Dimensions of personality. London: 


Kegan Paul, Trench, Trubner, and Co., 1947. 


. GLIXMAN, A. F. Recall of completed and incom- 


pleted activities under varying degrees of stress. 
J. exp. Psychol., 1949, 89, 281-295. 


. HARROWER-EnICKSON, M., & STEINER, M. E. Large 


scale Rorschach techniques. 
Charles C Thomas, 1945. 


Springfield, Ill.: 


. Lazarus, R. S., EnrksEN, C. W., & Fonpa, C. P. 


Personality dynamics and auditory perceptual 
recognition. J. Pers., 1951, 19, 471—482. 

Lewis, H. B., & FRANKLIN, M. An experimental 
study of the role of the ego in work. II. The sig- 
nificance of task-orientation in work. J. exp. 
Psychol., 1944, 84, 195-212. 

McRrEvNorps, P. Perception of Rorschach con- 
cepts as related to personality deviations. J. 
abnorm. soc. Psychol., 1951, 46, 131-141. 

ROosENZWEIG, S. An experimental study of “repres- 
sion" with special reference to need-persistive 
and ego-defensive reactibns to frustration. J. 
exp. Psychol., 1943, 32, 64-74. 

ROSENZWEIG, S., & Sarason, S. An experimental 
study of the triadic hypothesis: reaction to 
frustration, ego defense, and hypnotizability. 
I. Correlational approach. Charact. & Pers. 
1942, 11, 1-20. 


Received September 29, 1952. 


Mn 


3 


^ 
ro 


EGO STRENGTH AND THE RECALL OF TASKS: 


SIDNEY M. SOURARD 
Emory University 


OSENZWEIG (8) and others (1, 4) have 
observed that completed tasks are 
recalled with greater frequency than 

interrupted tasks when the tasks in question 
have been submitted to subjects (Ss) under 
ego-involving conditions. It has become 
fashionable to interpret this peculiarity of 
recall as an instance of ego defense, i.e., as 
indicative of repression of the memory of 
failure or, alternately, as a case of bolstering a 
threatened ego by means of recalling only 
successful performances. 

In order to account for individual differences 
in recall preferences under ego-involving 
conditions a number of workers have had 
recourse to the theory of ego strength. Sanford 
(9), for example, explained the increase in 
recall of interrupted tasks with age as a sign 
of increasing ego strength. e 

Alper (2) pointed to a “strong ego” recall 
pattern and a “weak ego” recall pattern; under 
task-oriented conditions, weak egos recalled 
more successes than failures, and under ego- 
involving conditions, “they recalled more 
failures than successes. Strong egos behaved 
in the opposite manner. 

The experiments of these two workers have 
certain limitations: beth utilize tasks of a 
fairly homogeneous Sort--jigsaw puzzles in 
the case of Sanford, and scrambled sentences 
in the case of Alper. It can be argued that the 
"Rosenzweig effect" may be related only to 
failure on tasks of a limited Sort, while an 
experiment based on a broader sampling of 
tasks might produce different results. 

Further, Alper's study was based on,an V 
Of 10 cases, a fact which certainly raises the 


‘possibility of unreliability in her findings. 


Finally, Sanford did not measure ego strength 
in his study, and Alper’s estimates of ego 


1 The present” work is based on a dissertation 
submitted to the Graduate School of the University of 
Buffalo, in partial fulfillment of the requirements for 
the degree of Doctor of Philosophy. The writer acknowl- 
edges with gratitude the assistance and encouragement 
from the members of his thesis committee, Drs. M. 
os chairman; Walter Cohen, and Llewellyn 

TOSS. 


51 


strength appear to be based on clinical judg- 
ment rather than on more objective tests. 
The over-all purpose of the present experi- 
ment was to determine if there is any relation- 
ship between ego strength and task-recall 
preferences. Ego-strength measures were 
derived from the widely used Rorschach test, 
and the interruption-of-tasks procedure was 
altered to make allowance for the critique 
noted above. Finally, the number of Ss was 
increased to meet the criticism of a restricted JV. 


HYPOTHESES 


The following three hypotheses were con- 
sidered in the experiment: 

1. There will be a greater recall of com- 
pleted than of interrupted tasks when the 
tasks are submitted to a group of Ss under 
conditions designed to induce ego involve- 
ment and ego threat. 

Despite the absence of a control group 
taking the tasks under task-oriented conditions 
we expected our data to support this hy- 
pothesis, in view of the findings of other 
Sorkers (1, 4, 8, 9). 

2. Preference for the recall of interrupted 
and completed tasks will vary as a function of 
ego strength. Strong egos will recall a pre- 
ponderance of interrupted tasks, and weak 
egos will recall a preponderance of com- 
pleted tasks. 

This is the major hypothesis of the present 
study, and it is formulated in accord with the 
view propounded by Sanford (9). According 
to this view, recall of failures (interrupted 
tasks) betokens a relatively strong ego which, 
in the experimental situation, does not resort 
to the repression of the memory of these tasks. 

3. For all of our Ss, irrespective of measured 
ego strength, tasks of one category will be 
recalled with no greater frequency than tasks 
in any other category 

This null hypothesis was formulated in order 
to study what, if any, effect on recall was 
produced by the introduction of a fairly broad 
range of different tasks into the experiment. 
Tf all of the tasks are equivalent in ego- 

o 


52 - p 


involvement, induction, then the hypothesis 
should be upheld. Contrariwise, if the tasks 
do induce differential ego involvement, it is 
possible that task recall may reflect this, either, 
in terms of the frequency with which tasks of 
a given category are recalled or in terms of 
interruption-completion recall patterns which 
differ from category to category. 


METHOD 
Apparatus 


The interruption-completion (I-C) test. Thirty tasks 
were devised by the writer so that each could be used 
as an interrupted or as a completed task by varying 
the length of assignment of each task. Thus, for the 
task “necklace,” if it was to serve as an interrupted 
task, the instructions were as follows: “Make a necklace 
from these 24 beads such that no two identical beads 
are touching.” 

If this particular task was to be completed S was 
given only 12 beads, but with identical instruction. 

In order to vary the tasks as widely as possible, they 
were designed so as to sample five broad categories. 
The categories, with an example illustrating each, 
were: 

1, Naming tasks. “Name twelve presidents of the 

USA.” 

2. Mathematical tasks. “Multiply 24537 by 3789.” 

3. Grammar tasks. “List 10 superlatives.” 

4, Drawing tasks. “Draw 23 stick figures of a 

person.” 

5. Manual tasks. “Put these blocks together to 

make a pattern like the one on the card.” 

There were six tasks for each category, and names 
were assigned to each task so as to describe it broadly. 
The five tasks noted above were named, in order, 
presidents, multiplication, superlatives, stick persons, 
and block pattern. 

In order to rule out the effects of serial order on 
recall, the position of each task in each presentation 
series was determined from a table of random numbers. 
The order of interruption-completion was likewise 
assigned from a random-number table. Each S, thus, 
received a different I-C test, although it was composed 
of similar parts. Each task, furthermore, appeared in 
each of the 30 positions of the series to an approximately 
equal degree, and served as an interrupted and as a 
completed task equally. 

Measures of ego strength. Two scales derived from 
the Rorschach test served as ego-strength measures. 
One of these, a six-point scale, was devised by the 
writer. The other, a 15-point adjustment scale? was 
adapted from the one utilized by Davidson (3). As the 
latter is described elsewhere, only the first will be 
described in detail herein. 

The writer’s scale was based upon five Rorschach 
factors which have been employed both clinically and 
experimentally as indices of ego strength. The factors 


2 Two factors from Davidson’s scale were deleted in 
the present study owing to relative difficulty in rating 
them. These two factors were “shading shock” and 
“olor shock." 


2 


Smwney M. 


JoURARD ) 


and the limits within which they were estricted in 
order to yield a one-point score are: (a, F+% for 
total record between 80-90; (b) F+% for colored 
cards between 70-100; (c) F% between 30-50; (d) M 
greater than FM; (e) FC greater than CF plus C. 
From this scale, an ego-strength score could be obtained 
which would yield a range of six points, viz., 0-5. 


Subjects 

Fifty-eight nursing students enrolled in the Uni- 
versity of Buffalo School of Nursing served as Ss. 
Thirty-four, with a median age of 20, were in their 
junior year, while the remainder were seniors, with a 
median age of 21 years. The girls were selected at 
random from a larger group of about 150 students. 


Procedure 


The Ss were tested singly, one in the morning and 
one in the afternoon. The general sequence is sum- 
marized as follows: (a) introductory comments; 
(b) administration of the Rorschach test; (c) ad- 
ministration of the I-C test, 45 seconds for each task; 
(d) interview for biographical data for five minutes; 
(e) S writes out her recall of tasks for three minutes; 
(f) administration of a series of digit-symbol tasks, 
lasting 14 minutes; (g) S writes ont her recall of tasks 
again; (4) S is required to sort into categories the cards 
bearing the names and description of each task; 
(i) S is required to sort the cards into two piles, those 


>tasks which were liked and those which were disliked. 


Verbatim instructions and description of the 
procedure are not presented here, owing to space 
limitations, but may be seen elsewhere.* 

Induction of an atmosphere of stress. In order to 
induce a set of ego involvement, the I-C test was 
presented as a college-level test of intelligence. The 
apparatus was drawn from a conspicuously labeled 
Wechsler box, and a stop watch was displayed and 
used throughout to record times. Since only 45 seconds 
were allowed for each task, each S experienced the 
sensation of being rushed aitd of failing one half of the 
tasks. In additipn, each S was told that “in order to 
receive IQ credit for a given item, it is necessary that 
it be completed correctly; partial credit is not allowed. 
Each item is passed or failed on an all-or-none basis." 
No independent measure was obtained of the existence 
or magnitude of stress induced ‘by this procedure, but 
from the spontaneous comments, the headaches 
reported, the perspiration exuded, and the number of 
cigarettes smoked, it appeared to E that at least 
some stress was induced in Ss. No S appeared, to the 

> 


3Rated by comparison with Hertz’s frequency 
tables (6). 

4To save printing costs, this information and the 
raw data for the experiment have been deposited with 
the American Documentation Instityte. Order Docu- 
ment No. 3978 from ADI Auxiliary Publications 
Project, Photoduplication Service} Library of Congress, 
Washington 25, D. C. remitting $1.75 for 35-mm. 
microfim or $2.50 for photocopies. Make checks pay- 
able to Chief, Photoduplication Service, Library of 
Congress. The original dissertation (7) is available 
through regular library facilities. 


-- — WES 


BRUT 


Eoo StrENcTH AND THE RECALL or Tasks ¢ l 53 


writer, to be untouched by the procedure. Each S 
thus underwent figexperience of being rushed on a test 
of intelligence, and of failing one half of the items in 
the presence of a man who apparently expected her to 
be able to do all of the tasks. 


RESULTS 


General recall results. General recall data 
are shown in Table 1. The symbols at the head 
of each column in this table refer to ratios 
which were computed from the recall data. 


T Jie does 
CFI is the ratio of interrupted tasks recalled 


to all tasks recalled. Thus, if an S recalled 10 
tasks and five of these were interrupted, his 
score would be 50. Each of these ratios was 
then multiplied by 100; throughout the report, 
all recall ratios are corrected in this manner. 
The next two symbols describe the proportion 
of interrupted and completed tasks (15 in 
each case) which were recalled. The fourth 
column heading, Drr—cr, describes the dis- 


CR 


crepancy between 74 and ds The third row 


.in this table shows the correlation between 


these indices for the first and second recall 
periods, Since they appeared to be rather 
high, signifying fair reliability, subsequent 
analyses between recall data and the Rorschach 
data were based upon the ratios derived from 
the first recall period. * 

The validity of the first hypothesis may be 
examined in the light of these data. If there 
were a clear-cut group preference for the 


` recall of completed tasks, it would have been 


signified by an cz I ratio*significantly less 
than 50.00. It may be seen from Table 1 that 
the obtained value for this ratio was almost 
exactly 50.00, thereby: refuting the hypothesis. 

A slightly different way to test the hy- 
pothesis is to test the reliability of the dif- 
ference between the two Saee and CR 

aeS 
Table 2 shows the result of such a comparison, 
signifying that the proportion of interrupted 
tasks recalled is almost exactly equal to the 
proportion of completed tasks recalled. 

We may conclude from this analysis, there- 
fore, that no preference, either for the recall 
of completed or of interrupted tasks, is 
manifested by the Ss. 

Ego strength and recall preferences. The 


€ 


TABLE 1 Li 
MEANS AND STANDARD Deviations OF Foer RECALL 
MEASURES 
e RECALL MEASURES 
USA UO. 
C+I 15 15 TM 
Mean 49.72 48.33 49.43 33.07 
SD 12.84 14.16 14.88 20.51 
n.:2* 15 69 -70 .62 
* Dig cR obtained according to the formula [E = P3 
+ 33. 
** rı, z refers to the correlation between measures from the 
first and the second recall periods. 


TABLE 2 


COMPARISON or PROPORTIONATE RECALL OF 


IR 
INTERRUPTED TASKS q5 | "m PROPORTIONATE 


RECALL OF COMPLETED TASKS E 


RECALL MEASURES 


EU MES 

Tb 15 

* Mean 48.33 49.43 

SD 14.15 14.88 

SDu 1.86 1.95 
Diffyr/is—cr/15 1.10 
71RI16 x CRI16 7 
SDpitt 2.45 
t 45 

= 


distribution of scores on the author’s six-point 
ego-strength scale was skewed in the direction 
of lower scores; the mean was 2.05, with a 
standard deviation of 1.08. 

Scores on the Davidson Adjustment Scale 
yielded a more nearly normal distribution, with 
a mean of 8.26, and a standard deviation of 
2.64. The range was from 3 to 15. 

Owing to the skewed distribution for the 
ego-strength scale, contingency coefficients 
rather than Pearsonian r's were utilized in 
correlating scores for this variable with the 
recall data. Pearsonian r's were calculated 
for the adjustment.scale scores and recall 
measures. 

The second hypothesis states that the 
stronger the ego, the greater the tendency to 
recall interrupted tasks. If the hypothesis is 
to be upheld, it should be reflected in correla- 
tions between the ego-strength scores and the 

e 


\ 


54 ha 


recall ratios, Table 3 shows that none of the 
correlations attains statistical significance. 

A study was then made of the correlation 
between each of the five ego-strength scale 
components and the recall ratios. Each of the 
Rorschach factors was first converted as 
follows: 

M greater than FM. This factor was ex- 
pressed by the formula (M — FM) + 5, and 
is denoted by the symbol (M > FM). 

F % between 30 and 50: expressed as devia- 
tions from 40% (F% = 30 to 50). 

FC greater than CF + C: expressed by the 
formula [FC — (CF + C)] + 6, and indicated 
by the symbol (FC > CF + C). 

F+% 80 to 90: expressed as deviations from 
85% (F+ 76 80 to 90). 

F+% colored cards 70 to 100: expressed as 
deviations from 85% (Fc+% 70 to 100). 

The correlations between these converted 
ego-strength signs and the recall scores are 
shown in Table 4. Only two of the r's were 
sufficiently large to attain the .05 level— 


TABLE 3 
CORRELATIONS OF THE EGO-STRENGTH AND Davinson’o 
ADJUSTMENT SCORES WITH THE 
REcALL Ratios 


RECALL MEASURES 


SCALE IR CR 
15 15 Cu] Duo 
rx 
Ego-strength scores* .01 — .03 .02 .02 
Davidson adjust- —.17 —.02 AS .07 
ment scores? 


* Contingency coefficients based on an N of 47; eleven cases with 
ego-strength scores of 2 eliminated from this analysis. 
17 = .26 for .05 level; r = .33 for .01 level, N = 58. 


TABLE 4 


CORRELATIONS BETWEEN EGO-STRENGTH SIGNS AND 
WA RECALL Scores 


RECALL MEASURES 
I IR CR 


Rorscuaca FACTOR ———. = 
IPAE. 1s 15 


Drag-on 


M > FM VERMIN oat | DONARS 

F% 30 to 50 —05 —.04  .03 ed 
FC>CF+C — —.02 —.08 09 _'06 
F + % 80 to 90 „17 08 —26* — 13 
Fo 4.957010100 — .12 — .01 —22 09 


*r = .26 for’.05 level. 
** 7 = 33 for 01 level. N = 58. 
: 


) 


Si»xev M. Jourarp E 


that between the M > FM factor and the 
v 


I i e with an r of .27, and that 


between the F+% factor and the recall ratio 


recall ratio 


CR 

15 with an r of —.26. These correlations 

have contradictory connotations. The first, 
I 

that between M > FM and IFC upholds 


the hypothesis; the positive direction of the r 
signifies that the greater the positive dis- 
crepancy between M and FM, the greater is 
the tendency to recall interrupted tasks in 
preference to completed ones. The slight 
magnitude of the correlation, however, indi- 
cates that only a very small amount of the 
variance in this recall ratio is attributable to 
the ego-strength factor, 

The negative correlation of —.26 between 


the F+% factor and the e ratio is opposed 


to the hypothesis. The rgcall ratio reflects a 
preference for the recall of completed tasks, 
and the r may be roughly interpreted to 
indicate that the greater the deviation of 
F+% from the optimal limits of 80-90 (which 
we have interpreted as ego weakness), the 
lesser is the tendency to recall completed tasks. 
This particular ego-strength sign, then, ap- 
pears to refute the present hypothesis, and 
to uphold Alper’s inferences. 


This scanty yield of positive findings 


prompted the writer to undertake a correla- , 


tional analysis of a number of other Rorschach 


factors and ‘the recall ratios. R, W% DdS 96, | 


F+%, F9b, H%, At%, and A% were all 
correlated with each of the four recall ratios. 
All yielded insignificant r’s, with one exception. 
A%, the percentage of responses with animal 


i I 
content, correlated —.30 with IX rand 3 
zm "CR 
wita the ratio 1s" Both of these r's are 


significant at the .02 level. While these findings 
are post hoc, it should be noted that they 
Support our hypothesis. 4% is generally 
regarded as a measure of stereotypy in think- 
ing and behavior by Rorschach workers, 
especially if it occurs in excess in a given 
record. Indeed, Davidson included 4% within 
certain optimal limits as one of the factors 
in her adjustment scale. The negative 7 


i Ao | ae 
Eso Srrencra AND THE RECALL OF TASKS f l 55 


: I 
between A ?5*agd the ratio IFC shows that 


a decrease in preference for the recall of 
interrupted tasks is associated with an in- 
crease in A%. The positive r between 4% 


DICA 
and the ratio 15 


the proportion of completed tasks that are 
recalled also increases. 

These opposed results from two Rorschach 
factors which were thought to sample the 
same trait universe, and the unforeseen find- 
ings for A % raised the general question of the 
extent of independence of each factor from 
the others. 4 

The interrelations of the ego-sirenglh factors. 
A study of the interrelations among the five 
factors of the ego-strength scale, converted in 
the manner described above, along with A% 
yielded the data in Table 5. The M » FM 
factor is seen to correlate significantly with 
the F% factor at the .01 level. Yet the latter 

: I 
did i io ——— 
not correlate with the recall ratio THI 
in the same direction as the M > FM factor, 
or to a significant degree. The F% factor did, 
however, correlate at the .05 level with the 
FC > CF + C factor, yet neither of these 
correlated significantly with any of the recall 
ratios. The FC > CF + £ factor correlated 
significantly with the F+% factor, but not 


1 CR 
with the 15 recall ratio. The F+% factor, 


except for its low correlation with the FC > 
FC + C scores noted above, correlated with 
only one other factor, F+% on cofored cards, 
which is a spurious r. 

The A% correlated most highly with all of 
the ego-strength factors except the F+% on 
colored cards. 

Thus, it appears that some of the Rorschach 
factors included in the ego-strength scale are 
intercorrelated, but not in a simple gr uniform 
manner, nor to a very high degree. E 

Task categories and task recall. The third 
hypothesis was concerned with the effects of 
different kinds or categories of tasks on recall. 

The writer had devised tasks so that they 
would represent fivé different types, with six 
tasks to a type. It is relevant to inquire if the 
Ss Perceived the tasks as belonging to the 
categories intended by the writer. Part of the 
experimental procedure consisted in having 


shows that as A % increases, ` 


E 


TABLE 5 4 
CORRELATION MATRIX FOR RORSCHACH EGG STRENGTH 
Factors 
e i 8 
è o n 2 
RORSCHACH Ed ar iS & 
Facror E] 8 R kd E 
D 9 E 
Au AT CERE 
= E g ma X SU 
M > FM —.38** .002 —.05 —.03 —,35%* 
- F%w to so ` 25 FREN] 32* 
FC» CF-C —.29* 08 as» 


F + % 80 to 90 so 67t —,40"* 
Fo +% 70 to 100 


* Significant at the .05 level. 
** Significant at the .01 level. 


Ss sort cards on which were typed the name 
and a brief description of each task into piles 
which “seemed to them to belong together.” 
After the cards were sorted, S was asked what 
it was about each group that made the cards 
belong together. In each case, agreement with. 
the writer's a priori grouping was high. Con- 
sidering each task singly, agreement with 
the writer's categories ranged from 61 to 
96 per cent. By categories, agreement ranged 
rom 65 per cent for the naming tasks to 
92 per cent for the mathematical tasks. Over- 
all mean agreement was 77 per cent. It is 
evident, then, that Ss’ perception of task 
categories agreed with the arrangement into 
categories that was intended by the writer. 
fn analysis was undertaken of the number 
of tasks recalled in each category and of 
recall ratios for each task category. The 
latter study was predicated on the assumption 
that recall ratios denoting preference for 
interrupted or completed tasks which were 
based on all tasks might obscure recall prefer- 
ences that might appear only in certain classes 
of task. Analysis of variance for correlated data. 
was undertaken for both of these studies, and 
both F ratios for between-groups were in- 
significant, as shown in Tables 6 and 7. 

Table 6 is straightforward, signifying that 
there were no significant over-all differences 
between categories in the number of tasks 
recalled from each task category. Table 7 
requires some clarification. The first row of 
numbers in the upper half of the table refers 
to recall ratios computed from the formula 


I 
for each task category. It will be 


CTI 
recalled that a score of 50.00 signifies equale 
o 


56 Si»uEx M. 


1 
\ 
3 à 
3 TABLE 6 


VARIANCE ZABLE FOR COMPARISON OF THE NUMBER OF 
Tasks RECALLED IN EAcH CATEGORY* 


Sum or VARIANCE 
Source SQUARES df ESTIMATE 
Persons 94.33 57 1.65 
Categories 9.61 4 2.40 
Residual 366.39 228 1.61 
"Total 470.33 289 


* F ratio for between categories: 1.49; 2.37 required for .05 level 
with 4 and 228 df. 


TABLE 7 


VARIANCE TABLE FOR COMPARISON OF RECALL 
PREFERENCES IN EACH Task CATEGORY* 


L 
MEAN RECALL RATIO [eal FOR EACH 


CATEGORY OF TASK 


Naminc MATHE- Grammar DRAWING MANUAL 


MATICS 
58.83 54.86 51.86 53.72 41.90 
VARIANCE ANALYSIS 
VARIANCE 
Source Sum Squares df Est. 
Task categories 1283.67 4 320.92 
Individuals 26021.27 57 456.51 
Remainder 83014.73 228 364.10 
Totals 110319.67 — 289 
- * Recall preferences were tabulated as deviation scores accord- 
ing to the formula X = L7 — 50.0, 
t ratio for between-categories = .88. » 


recall of completed and interrupted tasks; 
à score below 50.00 signifies greater recall of 
completed than of interrupted tasks, and a 
Score greater than this number signifies the 
reverse recall trend. If analysis of variance 
were computed for these recall ratios as they 
stand, it is possible that a significant F ratio 
for between-groups might be obtained, but it 
would not reveal the significance of the 
deviation from 50,00 of each task category. 
Consequently, a deviation score was com- 
. puted for each S in each task category, ac- 
cording to the formula: j 


T 
aor 50.00. 


deviation score = 11 
px C+I 


The variance analysis shown in the lower 
half of Table 7 reveals that the observed 
differences from 50.00 are not significant. 


$9 


2 


Jourarp 


We may conclude from these analyses that 
no class of task was favored in recall over any 
other, nor did any class of task show any 
reliable recall preferences. 


Discussion 


The absence of a group recall preference. 
Fairly extensive literature has been reported 
which seems to uphold the generalization that 
more completed than interrupted tasks are 
recalled by persons who, undertake the tasks 
under stressful conditions. The absence of a 
group recall preference of this sort in the 
present experiment seems to challenge the 
generality of this proposition. 

It may be argued that the Ss in the present 
study were not “stressed” or “threatened” 
by the conditions which were aimed at the 
production of this state; the writer did not 
allow for an independent estimate of the 
magnitude of the stress induced. Yet, there 
appeared to be ample clinical evidence that 
each S felt disturbed at "failing" so many of 
the problems, at appearing "stupid." Many of 
the girls felt that if they “had more time” 
they “could have done better.” 

Another criticism which might be leveled at 
the present study concerns the selection of Ss. 
It has often been flippantly stated by experi- 
mentalists of the writer’s acquaintance that 
nurses are the exception to any psychological 
tule. Yet, both on the Rorschach test and on 
the recall scores Ss manifested wide varia- 


bility; while the group was homogeneous in * 


occupation and socioeconomic background, 
they ce?tainly wete not on the variables con- 
sidered in this work. 

A more likely explanation for the absence of 
a clear-cut “Rosenzweig effect” may lie in the 
factor of task variability. The over-all recall 
ratio for all tasks was, as noted above, not 
significantly different from 50.00; when recall 
Tatigs were plotted for each task category 
separately; no reliable differences were found. 
The observed differences between the categories 
in recall ratios do suggest, however, that 
further study of the effect of different tasks on 
recall preferences is warranted. A definitive 
answer might be provided by a study of the 
present sort conducted on two groups of Ss 
with widely different interests and values— 
Say, a group of artists and a group of chartered 
accountants. If both accountancy tasks and 


—- 


—— 


Ego STRENGTH AND-THE RECALL OF Tasks / [ 57 
e e 


art problems were presented to each group, and 
half in each«category were interrupted under 
stress conditions as well as under neutral 
conditions, it might reveal reverse findings for 
the two groups. 

At any event, we do not know precisely why 
a pronounced recall preference was not ob- 
tained in the present work, and this constitutes 
a serious deficiency in experimental design. 
Its absence, however, in an experiment similar 
in broad design to that of other workers, pre- 
sents a challenge tọ the generality of the 
conclusions reported by other workers with 
the interruption-of-tasks technique. 

Ego strength and task recall. Neither the 
six-point ego-strength scale of the present 
writer, nor the Davidson Adjustment Scale 
correlated significantly with the measures of 
recall. In so far as the two scales may be 
regarded as ego-strength measures, the major 
hypothesis of the study appeared to be refuted. 
But the data do not warrant such a blanket 
assertion, since three of the Rorschach factors 
showed nonchance correlation with recall 
measures. One of the five originally selected 
components of the ego-strength scale, M > FM, 
and an unexpected factor, the A %, correlated 
with the recall ratios in a manner upholding 
the hypothesis, while the F+ %, in so far as it 
Is an ego-strength index, correlated slightly 
with the tendency to recall completed tasks. 
It is nat simple to interpret these findings. 

In an individual Rorschach protocol, the 
person who achieves a preponderance of M 


` over FM, who has an A% within the limits of 


about 30 to 50 per cent, ‘and, whose F4- 46 lies 
between 80 and 90 per cent is regarded as 
fairly well adjusted (provided other aspects of 
the record are not pathognomic). 

The data in column six of Table 5, however, 
do not uphold this theoretically expected 
correlation; low 4% scores are related to a 
high preponderance of M over FM. , but not 
to a low deviation from optimum F ‘+ % of 85. 
Tf all three of these factors measure ego 
Strength, why, then, do the M > FM and A % 
factors correlate with a tendency to recall 
a preponderance of interrupted tasks, while 
the F+% factor correlates with the reverse 
recall preference. 

These data do not provide any answer to 
this question. One implication which may be 
drawn from our results, however, is that if our 


e 
two sets of variables (Rorschach data and 
recall scores) are indeed phenotypig expres- 
sions of the construct ego strength, then the 
relationships between them should have been 
higher. It is likely that a host of variables 
besides the Ss’ ego strength functioned so as to 
determine the arrays of scores that we ob- 
tained. A further implication may be drawn 
concerning the intercorrelations among the 
Rorschach factors—they seem to signify that 
ego strength may be a many-faceted trait, with 
dimensions that can vary independently of 
each other. 

For future testing of propositions arising 
from a theory of ego strength, it will be neces- 
sary to formulate the theory more precisely 
than has heretofore been done (although 
Harris [5] has taken some preliminary steps 
in this direction), and to develop more refined 
Measurement techniques than the present 
ones. 


SUMMARY 


A study was made of the relation between 
selected Rorschach factors believed to measure 
ego strength and recall preferences obtained 
rom a modified interruption-of-tasks pro- 
cedure. 

Fifty-eight female Ss were tested, and the 
data were studied for their bearing on three 
hypotheses: 

1. There will be a greater recall of com- 
pleted than of interrupted tasks when the 
tasks are submitted to Ss under conditions 
designed to induce ego involvement and ego 
threat. 

2. Preference for the recall of interrupted 
and completed tasks will vary as a function of 
ego strength. Strong egos will recall a pre- 
ponderance of interrupted tasks, and weak 
egos will recall a preponderance of completed 
tasks 


3. For all of our Ss, irrespective of measured 
ego strength, tasks of one category will be 
recalled with no greater frequency than tasks 
in any other category. : 

The first hypothesis was rejected; recall 
ratios for the group as a whole did not reveal 
any recall preference, either for completed or 
for interrupted tasks. 

Two Rorschach factors, M > FM and A 96, 
correlated slightly (at the .05 level) with 
recall ratios in a manner which supported the 


e 


58 Srmpney M. 


ego-strength hypothesis. One factor, F + 96 
of 80 to 90, correlated with the recall data ina 
manner opposed to the hypothesis. None of 
the other ego-strength measures correlated 
significantly with the recall data. 5 
It was concluded that further development 
and testing of a theory of ego strength awaits 
more precise spelling out of propositions and 
the development of more adequate measuring 
instruments than those used herein. 


REFERENCES 


1. Atper, THELMA G. Memory for completed and 
incompleted tasks as a function of personality: an 
analysis of group data. J. abnorm. soc. Psychol., 
1946, 41, 403-420. 

2. Arper, THELMA G. Memory for completed and 
incompleted tasks as a function of personality: 
correlation between experimental and personality 
data. J. Pers., 1948, 17, 104-137. 

3. Davinson, H. H. A measure of adjustment ob- 


JounARD 


tained from the Rorschac protocol. J. proj 
Tech., 1950, 14, 31-38. 7 

4. GLIXMAN, A. F. Recall of completed and incom- 
pleted activities under varying degrees of stress, 
J. exp. Psychol., 1950, 89, 281-295. 

5. Harris, R. E. Psychodiagnostic testing in psychiatry 
and psychosomatic medicine. In M. R. Harrower 
(Ed.), Recent advances in diagnostic psychological 
testing. Springfield, Ill.: Charles C Thomas, 1950. 
Pp. 13-29. 

6. Hertz, M. Frequency tables to be used in scoring the 
Rorschach inkblot test. Cleveland: Brush Founda- 
tion, Western Reserve Univer., 1936. 

7. Jounanp, S. M. A study of ego strength by means 
of the Rorschach test and the interruption of 
tasks experiment. Unpublished doctor's disserta- 
tion, Univer. of Buffalo, 1953. 

8. Rosenzweic, S. Experimental study of repression 
with special reference to need-persistive and ego- 
defensive reactions to frustration. J. exp. Psychol., 
1943, 32, 64-74. 

9. Sanrorp, R. N. Age as a factor in the recall of 
interrupted tasks. Psychol. Rev., 1946, 63, 234-240. 


Received September 30, 1952. 


La 


= 


| 
| 
| 


f 


e ees 


e 
SEXUAL BEHAVIOR OF UNIVERSITY STUDENTS 
IN THE ARAB NEAR EAST 


e 
LEVON MELIKIAN AND E. TERRY PROTHRO 
American University of Beirut 


OMPARATIVE data on the incidence of 
various kinds of sexual behavior in 
different cultures are quite meager. 


C 


Anthropologists have gathered some informa- . 


tion on this subject, but their efforts have been 
concentrated on isolated and  preliterate 
("primitive") groups and their reports largely 
confined to generalizations. While this work is 
of great value in pointing out possible pat- 
terns of behavior, it does not lend itself to de- 
tailed cross-cultural comparisons. There are 
almost no studies of non-Western groups 
which can be compared with American data. 
Wolman's investigation of Israeli youth (4) 
seems to be a step in this direction, but it is 
clear from his discussion that his subjects are 
drawn from Western culture. 

In the investigation here reported we have 
examined sex practices and beliefs of a group 
of university students in the Arab Near East 
with a view to comparing the results with 
those of American investigations. 


METHOD’ 


P 

The difficulty of securing frank and honest replies 
to questions about sex practices is at least as serious 
in the Near East as in the United States. It was there- 
fore important that a procedure be employed which 
would maximize the possibility of obtaining accurate 
and reliable responses. Moreover, it wab important 
that the approach used should yield data*which could 
be compared with data from American studies. The 
questionnaire approach developed by Finger (1) seemed 
most suitable for these purposes. It can be used under 
circumstances which are unlikely to give rise to serious 
objection from subjects and it yields reliable data. 
Kinsey et al. (2, p. 25) found that this approach yielded 
data on college males which were comparable with 
those obtained by the interview technique.«Obviously, 
the dangers of rumor and of. developing opposition are 
far less under the questionnaire than under the inter- 
view approach. 


The Questionnaire , 


The subjects (Ss) were drawn from advanced psy- 
chology classes at the American University of Beirut 
so that it was possible to lead up to the administration 
of the questionnaire in a logical and objective fashion. 
Each class was told that accurate data on sex practices 
are quite incomplete. Care was taken not to give data 
on American and other studies until after the adminis- 
tration of the questionnaire. 


sO 


The questionnaire was given in large classrooms, 
with the students spaced apart so as to make observa- 
tion of other papers impossible. An introductory speech 
was made by the senior author in which he stressed the 
anonymity of the questionnaire and the need for co- 
operation. Finger’s “prefacing instructions,” which 
appeared on page one of our questionnaire, were then 
read aloud. These instructions made it clear that $ 
could omit any or all of the questions he wished, and 
that the omission would in no way be detectable either 
by the instructor or by classmates. We did not ask the 
student to place any private “trademark” on his paper 
lest this raise a question of identification. A blank 
sheet of paper was stapled on the front of the question- 
naire, so that all responses were completely private. 

There appeared to be excellent cooperation. The Ss 
worked carefully and with great seriousness. Only one 
paper was returned blank. 

The questionnaire itself consisted of 30 questions 
divided into the following categories: sources of in- 
formation about sex, dreams of sex, masturbation, 
homosexuality, heterosexuality, and general attitude, 
Definitions of the terms nocturnal emission, masturba- 
tion, heterosexual intercourse, and homosexual intercourse 
were provided. The definitions explicitly restricted 
the terms to acts in which orgasm and ejaculation 
occurred. 

Personal data on Ss’ nationality, etc. were obtained 
at another session of the class, so that anonymity would 
be complete. 


Subjects 

A total of 113 Ss returned the questionnaire. All 
were male students in advanced psychology classes 
at this university, Approximately half of them were 
Lebanese and another 45 per cent were from the nearby 
states of Syria, Jordan, and Iraq. The remaining few 
came from Bahrein Island, Cyprus, Saudi Arabia, 
Ethiopia, Egypt, and the Sudan. Thus, almost all Ss 
could be considered Arabs, and all came from the area 
where Islamic culture is dominant. Distribution with 
respect to religious sects was similar to that for the 
entire university. Moslems (Sunni, Shi’ite, and Druze) 
and Christians (Greek Orthodox, Armenian Orthodox, 
Catholic, and Protestant) were about equally repre- 
sented. 

The age range of Ss was from 17 to 28, with both 
mean and median at 21.0. Sixty per cent of the students 
were 19, 20, or 21 years of age. “Age” was defined as age 
at last birthday. 


[4 


Cultural Background of Subjects 


In spite of the dangers of oversimplifying and over- 
generalizing, it seems desirable to set forth some state- 
ment of Arab Near-Eastern culture as it relates to sex 
so that the data gathered here may be meaningful. t 

The subject of sex is avoided in the home, ang the 


60 


prevailing parental attitude is that the entire question 
of sex should wait upon marriage. Boys receive so little 
sex training that even nocturnal emissions may produce 
strong feelings of guilt. Masturbation is condemned 
more severely than in the United States. The “peer 
code" of youth often accepts sex as a masculine ac- 
tivity, but views masturbation as far mire shameful 
than visiting prostitutes. There is a taboo on homo- 
sexuality, but it is not so strong as in America. The 
active homosexual role in particular is thought of as 
compatible with virile masculinity. There is a double 
standard of morality. Patai's comments on the whole 
Middle East are applicable in this region: 

"Marriage customs, sex mores, the position of 
women, and the division of labor between men and 
women are completely analogous, and in many cases 
identical, in the nomadic camp, the agricultural village, 
and the town (with the exception of the thin middle 
and upper classes in the latter). Marriage is highly 
endogamous. Polygamy is permissive rather than 
customary, and certainly fewer than 5 per cent of the 
married men have more than one wife simultaneously. 
The relationship between the sexes is governed by a 
rigid code of sex mores and ethical ideals with special 
emphasis on female purity and chastity, both premarital 
and postmarital. . . . The traditional mores—not recog- 
nized by state laws—afford men the right to kill their 
female dependents if apprehended in an illicit sex 
relation" (3, p. 9). 

Dating and romantic love are much rarer here than 
in America. “Petting” is rare, for a girl can hardly be 
considered “naughty but nice.” There are some “virgin 
prostitutes” who will for money engage in “petting” 
to the point of an orgasm. The distinction between 
“good” and “bad” girls is very sharp, and it plays an 
important role in Arab films. Virginity at marriage is 
extremely important. The Moslem marriage contract 
has a space for specifying whether the bride is a virgin. 
In many villages, Christian and Moslem, a blood- 
stained sheet is exhibited after the first night of mar- 


i 
} ) Levon MELIKIAN AND E. TERRY PROTHRO 


riage, as evidence of the bride's virginity and the 
groom's virility. 2 


RESULTS 


Sex practices. The replies of our 113 Ss with 
respect to their sex behavior are summarized 
in Table 1. It should be noted that frequencies 
within the past 12 months refer only to those 
persons who had a positive history during that 
time. In the column headed “homosexual 
intercourse” we have presented data on fre- 
quency in Ss' entire life rather than in the 


past 12 months, because only two Ss admitted © 


such a practice during the last year. 

In connection with questions on the inci- 
dence of nocturnal emissions, Ss were asked 
what they dreamed of on these occasions. Dis- 
tribution of responses with respect to sex 
object in the dream was as follows: girls or 
women, 54; both girls and boys, 16; boys 
only, 3; animals, 3; girls and animals, 3; self, 
3; parents or relatives, 2. There was an in- 
teresting difference in dream content between 
those Ss with a positive history of homosexual- 
ity (including those with both homosexual 
and heterosexual experiences) and those with 
heterosexual but no homosexual experience. 
Of those with some homosexual experience, 
15 dreamed only of girls, 14 dreamed of girls 
and boys, 3 could not recall, 3 dreamed of 
self, 1 dreamed of boys, 1 dreamed of girls 
and animals, 1 dreamed of parents, and 1 of 
relatives. Of those with heterosexual but no 


TABLE 1 
INCIDENCE OF CERTAIN TYPES OF SEXUAL BEHAVIOR AMONG University STUDENTS OF THE 


ARAB Near East” 
QUESTION ÁSEED NOCTURNAL MASTUR- HETEROSEXUAL HOMOSEXUAL 
Emissions BATION INTERCOURSE INTERCOURSE 
Ever? 3 
e of replies 113 102 (103) 113 (111) 113 (111) 
% Yes 92.9 91.1 (92.8) 69.0 (45.0) 38 (27) 
oe at first experience? 
ean 14.4 14.6 (13.3) — 16.9 (15.6) — 13.2 (12.2) 
ata i 14 14 (13) 17 (15.9 14 (12.8) 
g 12-18 8-23 (7-19) 8-21 (7-20) 6-19 (5-23) 
Frequency** 
Ae with positive history 91 97 (93) 78 43 (30) 
xm 20.0 35.2 (36.0) — 14.3 7.7 (10) 
ue lian 10 10.5 (23) 6 3 (3) 
nge 1-100 1-540 (1-150) 1-150 1-50 (1-150) 


with positive history during the interval. 
D 


l; 


SEXUAL BEHAVIOR OF STUDENTS IN THE ARAB NEAR (afr (06 


D e t 
TABLE 2 i 
; ESTIMATEDeINCIDENCE OF CERTAIN Types or SEX BEHAVIOR AS A FUNCTION OF PERSONAL HisTory* 
+ 
NEGATIVE PosrrivE 


QUESTION ASKED 


PERSONAL HISTORY PERSONAL HISTORY 


| No. og No. oF 
| RrEPLES MEAN REPLIES MEAN 
| Masturbation 
| 1. Percentage of men over 18 in your community who have 8 40.5 92 75.1 
| masturbated (8) (59.8) (100). (92.1) 
2. How many of 10 best male friends have masturbated 10 4.0 97 8.0 
n (8) (5.3) ^ (100) (9.6) 
l Homosexuality 
; 1. Percentage of men over 18 in your community who have 58 13.7 39 42.9 
had a homosexual experience. (71) (9.8) *(26) (53.5) 
2. How many of 10 best male friends have had a homo- 63 0.9 41 4.4 
sexual experience? (75) (.3) (28) (5.4) 
Heterosexuality 
1. Percentage of men in your community over 18 who have 34 51.4 73 67.7 
had pre- or extramarital intercourse (61) (60.9) (48 (72.6) 
2. How many of your 10 best male friends have had pre- or 35 4.4 77 TA 
extramarital intercourse? (61) (4.5) (51) (7.0) 


* Figures in parentheses are from Finger's study (1) of American students, and are presented for comparison. 


homosexual experiences, 39 dreamed only of 
girls, 2 dreamed of boys, 2 dreamed of boys 
and girls, and 2 dreamed of girls and animals. 
All Ss who reported dreams of both boys and 
girls indicated that dreams of girls were pre- 
dominant at the time of reporting. 

Additional information was obtained on 
recent heterosexual intercourse by asking 
whether Ss had visited houses of prostitution 
within the past 12 months. “Special houses,” 
not licensed by the government but neverthe- 
less operating as places of prostitution, were 
specifically included. Sixty-seven of the stu- 

: dents gave data indicating that thgy had 
' : visited brothels in the past year. Frequency of 
visits ranged from one to 50. Mean number of 
visits was 9.9 and median was 6.1 It seems that 
prostitutes provide the major, though not the 
only, heterosexual outlet for these students. 

_Sex beliefs. The Ss were asked not only to 
give information about their own sex behavior, 
but also to express their opinions about*sex 
activity of others. Each S was asked to make 
estimates of the percentage of men over 18 
years of age in his own community who had 
ever engaged in masturbation, homosexual 
Intercourse, and extramarital heterosexual 
intercourse, He was then asked to estimate 


* Throughout this report “median” refers to crude 
median. That is, when the middle case fell in a one-point 
interval containing more than one case, the midpoint 
of that interval was taken as the median. 


how many of his 10 best friends had practiced 
each of these. Although these 10 best friends 
were not necessarily drawn from the student’s 
own community, there was considerable 
Similarity of response on the two questions, as 
can be seen from Table 2. From this table it 
is also obvious that persons who had not used 
a certain sex outlet and those who had used it 
differed considerably in their estimates. 

The Ss were also asked to estimate the 
percentage of women in their community who 
had had premarital heterosexual intercourse. 
One hundred four estimates were received, and 
these ranged from 0 to 50 per cent, except for 
one estimate of 80 per cent. This last individual 
reported seduction by an older girl at the age 
of 17, and reported 150 heterosexual experi- 
ences in the previous 12 months. The mean 
estimate of the 74 Ss with some heterosexual 

ience was 12.2 per cent, and the mean 
estimate of the 30 without heterosexual ex- 
perience was 6.7 per cent. The distribution of 
the estimate was badly skewed, however, and 
the median estimate of those with positive 
history was 5 per cent while the median esti- 
mate of those with negative history was 3 

cent. 7 

When asked to give their own attitude 
toward sex, a large majority of Ss expressed 
the belief that sex was necessary or that it 
was not to be avoided altogether. At the same 


62 }} 
time they expressed many reservations, and, 
therefore, their acceptance of sex must be 
considered highly conditional. Some of the 
common comments were: “not with prosti- 
tutes,” “heterosexual only,” “must exercise 
caution against V.D.,” “practice itso as not 
to lose too much time thinking about it,” “if 
it doesn’t lessen the girl’s opportunity for 
marriage,” “not with more than one partner,” 
“in moderation only,” “only harmless sex, 
ie., masturbation.” 

From these responses it is clear that our Ss 
accept the necessity of a sex life, but have 
many inhibitions with respect to sex. It is not 
possible to divide the respondents, as Finger 
did his, into two categories of “condoning” 
and “not condoning” heterosexual activity. 

Additional insight into Arabs’ attitudes 
toward sex can be obtained from their answers 
to the question “How do you feel after sexual 
activity?” Replies to this question seem to fall 
into three categories of roughly equal size: 
those who feel disgusted, guilty, or angry with 


` themselves; those who feel relaxed or happy; 


and those who have a combination of the first 
two. 

Only six persons reported that they had 

been threatened with punishment for mas- 
turbation, but almost all had heard that ill 
effects ensued from such behavior. The most 
commonly reported purported ill effects were: 
deterioration in health, tuberculosis, insanity, 
epilepsy, and impotence. Less frequently re- 
ported were blindness, withering of genitals, 
early death, arrested growth, loss of spinal 
fluid, and anemia. 
. Sex information. Friends seem to be the 
primary source of information about sex. 
Only nine of our 113 Ss reported obtaining 
sex information from parents and eight of these 
mentioned books and school courses as addi- 
tional sources; 95 mentioned getting this in- 
formation from friends, and 57 of these also 
mentioned books or school courses as addi- 
tional sources. Only five persons mentioned 
books and school as the principal source, 

In answer to the question “How did you 
learn about masturbation?" almost all of our 
Ss mentioned friends. Although sources of 
knowledge about sex are unquestionably 
multiple and interrelated and many of these 
may not be recalled, it is clear that our Ss 
consider their friends to be the principal 
Source. 


m 


perience in masturbation, homosexual inter- 


Levon MELIKIAN AND E. TERRY PROTHRO 


DISCUSSION 


Sex practices. The data on Nocturnal emis- 
sions presented in Table 1 show remarkable 
similarity to those which Kinsey et al. found 
among American college males. Nearly 93 
per cent of our Ss reported such occurrences, 
and the Kinsey report (2, p. 342) states that 
94 per cent of males in the 13+ educational 
group of ages 16 through 20 had had such an 
experience. Median age at first experience was 
14 for our Ss; Kinsey et al, (2, p. 520) report 
that at age 14 a total of 52 per cent of those 
in the 13+ educational group had had at least 
one such experience. Finally, the mean of 20 
emissions in the past twelve months reported 
by our Ss (a rate of .38 per week) is close to the 
means of .44 and .46 which Kinsey found in 
the groups aged 16 through 25 years. Although 
Finger did not gather data on nocturnal emis- 
sions, it seems safe to conclude that the inci- 
dence and age of onset reported by our $8 
resemble closely those reported by college | 
males in the United States. 

The difference in dream content at the 
time of nocturnal emissions between persons 
with some homosexual experience and those 
with heterosexual but no homosexual experi- - 
ence was in accord with standard clinical | 
theory regarding dreams. Of those persons 
with only heterosexual experience, 39 reported 
that these dreams were of girls only and 6 | 
reported other dreams. Of the Ss with some 
homosexual experience, 15 reported that their 
dreams at emission were of girls only, and 21 
reported other dreams. Ifa 2 X 2 table were — 
constructed of these data, the value of chi y 
Square wonld be 17.1. Thus the possibility | 
that this distribution is a chance one is les | 
than one in a thousand. It is safe to conclude 
that there is a relationship between previous 
sexual experience and dream content at noc- 
turnal emissions. These data may be in- | 
terpreted in several ways. They may indicate 
that dream perceptions are conditioned rè- | 
Sponses to need-reductive action, or that 
persons trained to avoid homosexuality in l 
practice manifest this training in their dreams, | 
etc. Our data give empirical evidence for only | 
the existence of a relationship between these 
dreams and actual sex practice. 

Our Arab Ss seem to have had their first ex- 


| 
| 
| 


Course, and heterosexual intercourse about à 


7 


SEXUAL BguavioR OF STUDENTS IN THE ARAB NEAR (gal : 
e e 


year later on the average than did Finger’s Ss. 
The age diff€reace holds for both mean and 
median. In view of the fact that nocturnal 
emissions did not occur later in our Ss, the 
slowness in beginning these other activities is 
probably not physiologically caused. It is more 
likely to be a result of the repression of sex in 
the home, where the subject is never men- 
tioned, : 

The average age of our Ss at the time of 
testing was more than a year greater than 
that of Finger's Ss, so' that the time between 
initiation of these several sex practices and the 
administration of the questionnaire was ap- 
proximately the same for both groups. Conse- 
quently the various frequencies reported for 
the two groups draw from approximately 
equal periods of activity. 

Comparison of our Arab Ss with Finger's 
Virginians on incidence of various types of 
sexual activity suggests that more of our Ss 
have had heterosexual relations, more have 
had homosexual relations, and about the same 
number have masturbated. The repression of 
Sex at home would appear, therefore, to be 
only temporarily effective. Masturbation 
seems to be practiced more often by the Vir- 
ginians, for their median is more than twice 
that of our group. The means are similar be- 
cause of a few exceptional Arabs, one of whom 
estimated that he had masturbated more than 
10 times:per week for the past year. The groups 
with positive history are similar with respect 
to frequency of homosexual activity, although 
the exceptional case was this time found in 
the Virginia group. Unfortunately, Finger 
asked his respondents to report the ‘number of 
heterosexual experiences not for the past 12 
months but for their entire history. Neverthe- 
less, it is clear that heterosexual experiences 
occur more often among our Ss, for 69 per 
cent reported at least one experience whereas 
only 45 per cent of Finger’s Ss reported that. 
Moreover, the median of our group with 
Positive history was six experiences in the past 
year whereas the median of the Virginia stu- 
dents with positive history was eight experi- 
ences in entire life. 

The greater number of heterosexual experi- 
ences by the Arab students depends, of course, 
on the number of experiences which they have 
had with prostitutes. Only 11 students had 
had intercourse with “companions” only (to 
borrow Kinsey’s term), and 28 had had inter- 


63. 
f 
course with both companions and prostitutes 
in the previous year. Kinsey reports that only 
28 per cent of his Ss in the college group, aged 
21 to 25 years, had ever visited a prostitute, 
In eontrast, we find that 59 per cent of our 
sample ha@ visited prostitutes in the preceding 
year. Visiting prostitutes seems to be the 
tension-reductive technique which is best 
suited to the cognitive structure of our Ss, 
Such sex activity meets the “physical needs” 
of the organism, it involves little romance, it 
does not reduce the partner’s chance for mar- 
riage, it does not violate a family’s code of 
honor, etc. 

Sex beliefs. The functional significance of be- 
liefs has been stressed by many social psy- 
chologists. According to such a view, we would 
expect that beliefs about sex practices in one’s 
own community would be greatly influenced 
by one’s own cognitive organization. Cer- 
tainly one’s sex desire, sex practices, and the 
sex mores which one has learned would enter 
into this organizational process. Therefore, we 
would expect that persons who violate the 
sex mores of premarital abstinence would be 
more likely to hold to the belief that the mores 
are only apparent and not generally adhered to 
than would persons who did not violate the 
abstinence rule. That is, we would expect the 
belief to be influenced by one’s own practices. 

This expectation is borne out by the data 
presented in Table 2. In every instance the 
group with a positive history estimated the 
incidence among their friends and their com- 
munity to be higher than did those with a 
negative history. Finger observed this dif- 
ference in the beliefs of his Ss, and concluded 
that there is a tendency to project one’s own 
behavior. It is interesting to note that Finger’s 
Ss with positive and negative histories dif- 
fered most in their estimates of the incidence 
of homosexuality in the general populations. 
Those with negative histories estimated the 
incidence to be less than half what it was found 
to be in that sample, and those with positive 
histories estimated the incidence to be twice 
as great as it was. This distortion seems to be 
a product of the strong American taboo on 
homosexuality. Among our Arab Ss the taboo 
on homosexuality itself (apart from the general 
sex taboo) is not so great, and the disagreement 
in estimate between those with positive and 
negative history is not so great. 

There was a general tendency for our Ss to 


e 


Ru) 
estimate the incidence of various kinds of 

‘sexual behavior among friends and among 
members of their community to be lower than 
our data suggest it to be. The "conspiracy of 
silence" about sex which characterizes” the 
home life is apparently successful in’ convincing 
Ss that masturbation, homosexuality, and 
heterosexuality are rarer than is actually the 
case. Consequently the estimates of those with 
positive histories come much closer to agree- 
ing with our incidence data than do the esti- 
mates of those with negative histories. 

It is interesting to note that our Ss estimated 
the number of women in their community 
who had engaged in premarital sex activity to 
be quite low. Median estimate was 4 per cent 
and mean estimate slightly more than 10 per 
cent. Yet 34 per cent of those Ss had had 
heterosexual experience in the previous year 
with women other than prostitutes. Finger's 
data are in sharp contrast to these. Forty-five 
per cent of his Virginians reported some hetero- 
sexual experience, and the Ss estimated that 
44 per cent of women (presumably women of 
their own communities) entered into marriage 
with a history of sexual intercourse. A 

Our Arab Ss apparently -believe that the 
men of their community have heterosexual ex- 
perience before marriage, but that the women 
of their community do not. Who, then, are 
the men’s partners? Close examination of those 
questionnaires indicating intercourse with 
companions, but estimating the incidence of 
premarital sex activity by women of their 
community as very low, presented some pos- 
sible answers. Some of these students had 
begun heterosexual activity in the compara- 
tively Western, urban environment of Beirut. 
Some had found heterosexual partners among 
“older women” or among widows, so that the 
sex activity was not premarital so far as the 
Women were concerned. Others mentioned 
servants as sex partners. 

These preliminary questionnaire data on 
beliefs Taise many interesting hypotheses, but 
they cannot give us definite confirmation of 

.these hypotheses. As Finger has concluded, “a 
more definite explanation of the experience- 
estimation relationship waits upon more de- 

„tailed analysis" (1, p. 66). Clinical case studies 
utilizing the phenomenological or field ap- 
proach present practical difficulties, but they 
will unquestionably yield considerable data on 
beliefs which are not obtainable from ques- 
tionpaires. 


Levon MELIKIAN AND E. TERRY PnbrERO 


Summary AND CONCLUSIONS 


pe 

In order to obtain data on sex practices and 
beliefs of male university students in the Arab 
Near East, and to compare them with data 
from American universities, a questionnaire 
was administered to 113 students at the 
American University of Beirut, Lebanon, under 7 
conditions designed to insure anonymity and 
cooperation. 

Data on nocturnal emissions were similar to. 
those obtained by Kinsey e/ al. Dream content 
at time of emission was found to be related to 
type of previous sex outlet. First experience 
with masturbation, homosexual intercourse, 
and heterosexual intercourse was later for our | 
Ss than for similar Ss in America. The per- 
centage of our Ss with a history of masturba- 
tion was about the same as in American 
studies, but frequency of masturbation was 
less in our group. More Arab students had | 
positive histories of homosexual intercourse | 
and more had positive histories of heterosexual © 
intercourse. The principal heterosexual outlet 
was with prostitutes. 

The principal source of information about 
sex was friends. Very few Ss had obtained this 
information from parents. 

Persons with a positive history of each kind 
of sex behavior estimated the number of per 
sons in their own community who engaged in 
such behavior to be higher than did those with 
negative histories. The general tendency 
seemed to be in the direction of underestim&- | 
tion, so that those with positive histories gave 
estimations which càme closer to the incidence 
figures aciually obtained. » 

Estimates of the number of women wh? | 
entered marriage with a history of hetero ^ 
sexual intercourse were far lower than the | 
estimates made by American students. 

Some attempt was made to relate the data 
to known elements in the culture of this area. 


> 


— 


—€— 


REFERENCES 


1. Fincer, F. W. Sex beliefs and practices among male 
college students. J. abnorm. soc. Psychol., 1941, 
42, 57-67. 

2. Kinsey, A. C., Pomeroy, W. B., & MARTIN, C. E 
Sexual behavior in the human male. Philadelphia: * 
Saunders, 1949, 

3, Partar, R. The Middle East as 
Middle East J., 1952, 6, 1-21, 

4. Worman, B. Sexual development in Israeli ado- 
lescents. Amer. J. Psychother., 1951, 5, 531-559: | 


a culture ar » 


Received October 11, 1952, | 


€x. 


THE EFFECT OF PRIOR REINFORCEMENT ON THE INTERACTION 
OF OBSERVER PAIRS: 


BERNARD MAUSNER" 
University of Massachusetts 


HE study of the manner in which groups 

of observers arrive at perceptual norms 

has been a central problem in experi- 
mental social psychology since Sherif's work 
(8). In his experiment, subjects (Ss) whose 
previous judgments of the extent of autokinetic 
movement had differed tended to approach 
a norm when they worked in groups. Subse- 
quent investigations using a similar design 
have disclosed that this phenomenon of con- 
vergence toward group norms is not restricted 
to judgments of autokinetic effect (5, 6, 7), 
and that its occurrence will be affected by a 
number of factors. Among these are: (a) the 
nature of the stimulus field (5), (b) the past 
history of contact among members of the 
group (2), (c) attitudes towards group member- 
ship of partners (1, 3), and (d) the personalities 
of the Ss (4). 

It is probable that in the group situation 
each S has some conflict between opposing 
response tendencies. The S may have a ten- 
dency to continue to e the responses he 
has previously made toward the stimuli being 
presented; he may also have a tendency to 
be influenced by his partner or partners. The 
S's personality, his impression of the partner, 
and any past experiences they may hfve had 
in common may affect the strength of this 
latter tendency. If a population of Ss is so 
chosen that the tendency to agree can be as- 
sumed to be randomly distributed, it may be 
Possible to affect the extent of convergence by 
varying the opposing tendency, i.e., the tend- 
€ncy to continue in the former judgment range. 
Random distribution of the tendency to agree 
may be assumed (a) if the population of Ss is 
relatively homogeneous, and (5) if, in addition, 

* Number two in a series entitled Studies in Social 
Interaction. The pfesent report is based on a disserta- 
tion (6) submitted to the Faculty of Pure Science, 


Columbia University, in partial fulfillment of the re- 
quirements for the Ph.D. The writer is indebted to 


Professors Otto Klineberg and W. N. Schoenfeld for’ 


guidance in the design and conduct of this investiga- 
tion. Apparatus and facilities for the experiment were 
made available by the Department of Psychology, 
Washington Square College, New York University, of 
Which the writer was formerly a member. 


coacting Ss have not been previously ac- 
quainted. 

Laboratory studies of learning suggest that 
one possible determinant of the strength of the 
tendency to maintain the former range of 
judgments may be the degree to which pre- 
viously made judgments have been reinforced. 
In the present experiment the following specific 
hypothesis was tested: for any given type of 
judgment, if the responses of S have a history 
of past positive reinforcement they will show 
less tendency to converge than if they have 
a history of nonreinforcement or of negative 
reinforcement. 

This hypothesis has been tested independ- 
ently by Kelman (4). Although his work is not 
directly comparable to the present study, since 
he used a different procedure and treatment of 
data, Kelman’s results do furnish support of 
the above hypothesis. 


METHOD 


The task in this experiment was judgment of lengths 
of lines. The apparatus has been described elsewhere 
(6). The Ss were 38 members of undergraduate psy- 
chology classes. All Ss were naive regarding the purpose 
of the present experiment and previous work in the 
field. 

In an alone session 200 judgments were made: 20 
lines varying in length from 8 in. to 18.5 in. in steps of 
.5 in. were each presented 10 times. The judgments were 
made in inches or fractions of an inch. The S was told 
that he would be informed whether he was right or 
wrong after each judgment. The supposed criterion of 
“right” was accuracy within one inch. Each series of 
20 trials contained the entire group of stimuli in a 
different order so that no one line followed any other 
more than once in the total sequence. The same order 
of stimuli was followed for all Ss. Half the Ss were 
told they were right on 82 per cent of the trials (group 
R); half were told they were wrong on 82 per cent 
(group NR). Alternate Ss were assigned to groupe t 
and NR at the first sessions. 

The position of the “wrongs” for group R’ and 
“tights” for group NR was the same in the entire series 
of trials. Lines from 8 in. to 9.5 in., and from 17 in. to 
18.5 in. were consistently “right” for group R and 
“wrong” for group NR. This was done to avoid a tend- 
ency which appeared in a pilot study for group NR to 
continue to make "right" judgment of the easily 
identified longest and shortest lines. * 


66 


For the setond session Ss made their judgments in 
pairs. The'Ss in each pair, previously unacquainted 
with each other, were so chosen as to minimize overlap 
between frequency distributions of judged lengths of 
any single line in the first session. Pairs were of three 
kinds: seven consisted of two Ss from grovp R (R-R), 
six of one S from each group (R-NR), six of two Ss 
from group NR (NR-NR). The Ss were told that two 
Ss would work concurrently in order to complete the 
experiment within the scheduled time. Each S judged 
every stimulus. The members of the pair alternated as 
first to judge. The order of presentation of stimuli was 
the same as in the first session, but now S was not told 
whether he was right or wrong. 


"TREATMENT OF DATA 


The following procedure was used to measure 
change in judgment: the frequency with which 
each "number" response (e.g. “10 inches") 
was made by S was obtained separately for 
alone (A) and together (T) situations. For 
each of the number-response categories the 
difference in frequency of occurrence between 
alone and together situations was calculated 
(“T — A” score). All number-response cate- 
gories which occurred in either of the two 
situations were then arranged in order of 
“magnitude.” The midpoint of the series of 
“magnitudes” was found. Algebraic sums of the 
T — A scores were obtained separately for 
the group of response categories above and 
below the midpoint of the series of categories, 
The two sums of T — A scores were then added 
without respect to sign to give a score which 
Tepresents the amount of shift in response 
frequency from alone to together situations. 
This henceforth will be referred to as the D 
score. J 
. The direction of shift was measured as 
follows: the number of plus and minus signs 
of the T — A scores was taken separately for 
response categories above and below the mid- 
point. Where no systematic shift occurred, 
the number of plus and minus T — A scores 
Should be equal in each half of the judgment 
Category series. Where shift occurred, two 
possibilities are present: if shift was “up,” 
there were more high-number responses in the 
together series; there was, therefore, a pre- 
ponderance of negative T — A scores below 
the midpoint and of positive T — A scores 
above it. If shift was “down,” the reverse 
obtained. All signs in the lower half of the 
Tesponse category series were reversed, and 
the total algebraic sum of positive and nega- 
tive T — A signs then taken. This total will 

2 


BELNARD MAUSNER ) i 


TABLE 1 j 
CALCULATION Or D AND C Scores From 
200 Jupcments (IN INCHES) oF 
LENGTHS OF LINES ! 
(One S hypothetical data.) | 
=— ————————— 2 
Jupoment F FREQUENCY SoETOTÁ ] 
cam ALONE (A) ok T-A T 
3 2 0 j 
4 18 0 | 
5 n 0 | 
6 43 6 1 
1 43 A } 
8 38 45 
MM MM midpoii j 
9 2 38 
10 8 33 
u 2 26 | 
12 0 4 | 
13 0 1 1 
14 0 8 | 
200 200 99 


Calculation of D score 
Z(T—A) above midpoint 99 


Calculation of C score 
No. + T—A below mid- + 
point | 
Z(T—A) below midpoint 99 No. — T—A below mid- -5 | 
— point =a 
D score 198 Reverse sign, subtotal +4 | 
f 
No. + T—A above mid- | 
point | 


No. — T—A above mid- -0 
point US I 
Retain sign, subtotal +6 
C score T 


be referred to as the C score. A positive C 
Score indicates movement “up” in the number 
category series; a negative C score shows move | 
ment “down.” In summary, the D score gives ^ 
a measure of degree of shift in judgment, the 
C score of direction. For examples of calcula- 


tion of D and C scores see Table 1. 


RESULTS AND DISCUSSION 

Table 2 gives C and D scores for all Ss, and | 
mean and c of the distribution of D scores for f 
each group. Only one S in group NR, and | 
three in group R, had D scores more than . 
1.5c from the means of their respective groups: 
Comparison of mean D scores for the two 
groups shows that group NR has a D score 
significantly higher than group R (t = 4:15; } 
df = 33, p < 01). Thus, the nonreinforced - 
Ss showed a significantly greater tendency 
than the reinforced Ss to shift judgments from j 
the alone to the together situation, sub- | 
stantiating the hypothesis presented above- 1 


| 


A 


DY 
| 
4 


D : 
PRIOR REINFORGEMENT AND INTERACTION OF OBSERVER Pues 67 


TABLE 2 
Dt anD C Scores For Ss or EacH Pair Li $ 


© 


[i 


dg 


Reinforced Ss indicated by (R), nonreinforced Ss by (NR), high S (H), low S (L); negative C shows movement 


“up,” positive C “down.” 


SUBJECT D (S SUBJECT DAC SUBJECT DUPE 
MEY (R, H) 39| 41 MAR (R, H) 31| —8 ABR (NR, L) 9| —11 
GRE (R, L) 72 | +12 FEI (NR, L)* —|-— ROD (NR, H) 201 |.—17 
KRA (R, L) 18 0 FRA (R, H) 13 | —4 SLA (NR, L) 212 | +13 
BAT (R, H) 25| +3 MAK (NR, L) 199 | +21 ROS (NR, H) 78 | —12 
LEV (R, H) 's| +2 NAD (R, H) 31 0 KLE (NR, L) 204 | 410 
ERU (R, L) 104| —2 BER (NR, L) 95| +8 MCC (NR, H) 159 | —16 
WEI (R, L) - 7| +1 RUO (R, L) 26| —2 | COF (NR, H) 103 | —19 
HOG (R, H) 107 | —14 LEW (NR, H) 87 | —17 AMS (NR, L) 30| —8 
TYL (R, H) 104 | +16 COH (R, L) 14| —2 FER (NR, H) 97 | —22 
LUR (R, L) 38 | —9 MOL (NR, H)- |260| —25 FLE (NR, L) 117 | +11 
MEE (R, L) 7| +6 UNG (R, H) 2 0 TEP (NR, H) 229 | —26 
FEL (R, H) 17| —4 STE (NR, L) 84| 49 PLI (NR, L) 81| —10 


* D and C scores for three Ss, FEI, RAN, and VAL were not computed because of incomplete data. Graphic treatment shows 


all three responded according to the experimental hypothesis (6). 
f Mean D score, group P, 37, ¢ = 34; group NR, 142, ¢ = 100, 


Of the three Ss in group R with D scores 
more than 1.5 « from the means of their group, 
the preponderance of judgments of two moved 
away from those of the partner, as demon- 
Strated by C scores opposite in sign from those 
of their partners (TYL and BRU, cf. Table 2). 
The results for the third S (HOG) are not in 
accord with the experimental hypothesis, since 
this S, although his judgments were previously 
reinforced, had a high D score, and converged 
toward his partner. The, in group NR, with 
a D score more than 1.5 c lower than tke group 
mean was a member of an NR-NR pair which 
did arrive at a group norm. 

, Examination of the C scores to indicate 
direction of shift shows that what shift did 
occur among the 12 Ss in R-R pairs was more 
often away from the partner than toward 
him: seven Ss showed this tendency, one had 
a zero C score, and four shifted towards the 
partner. All of the NR Ss in R-NR pairs con- 
verged toward the partner, high Ss giving 
negative C scores and low Ss positive C scores. 
The R Ss in these pairs had two zero C scores, 
two C scores showing tendency to shift toward 
the partner, and two away from him. Nine 
of the 12 Ss in NR-NR pairs showed a tendency 
to converge toward the partner. Here, how- 
ever, in all cases a group norm was reached, 


as can be demonstrated by graphs showing 
median judgments for each 20 trials in the 
course of the experiment. 

These plots of median responses through 
the course of the experimental sessions permit 
comparison of the results of this study with 
Sherif’s (8). The predictions advanced in this 
experiment were that the convergence by both 
Ss, as found in Sherif’s work, would occur only 
in the NR-NR pairs; that in an R-NR pair 
only the NR member would converge; and 
that neither member of an R-R pair would 


‘converge. Partial confirmation of these pre- 


dictions was found in the characteristics of the 
C scores discussed above. The median graphs 
for the R-R group (Fig. 1) show that in all 
but one case neither S converged toward his 


" partner. The plots of R-NR pairs (Fig. 2) con- 
firm the previously noted trend for the NR. 


member to converge toward the judgments of 
the reinforced partner, and for the latter to 
maintain his judgment range. The charac- 
teristic funnel-shaped curves which denote 
convergence by both partners can be found 
only for subject pairs in which both members 
came from the NR group (Fig. 3).? 

2 Figures 1, 2, and 3 are representative of their 


respective groups. Complete graphs for all Ss are avail- 
able in microfilm form (6). 


SUMMARY AND CONCLUSIONS 


The hypothesis that the interaction of ob- 
server pairs is a function of the degree of prior 
reinforcement of the judgment responses was 
tested on 38 Ss who first made judgments of 
lengths of lines when alone, and then in pairs, | 
The pairs were so chosen that the alone judg- 
ments of the two Ss overlapped minimally, 
In the alone situation half of the Ss were told 


68 à BERNARD MAUSNER 
a 

uso > AN 

S e / Vo 
z y N 

E 5 / 

z x V x-TYL (R) 
[n] 4 x 

= xex o 

$5 20| ©-LUR (R) 
[*] 

2 

vcl 

z's 

a 

fa] 

Z110 


ALONE 


TOGETHER 


TRIALS 
Fic. 1. MEDIAN JUDGMENTS OF LENGTH OF LINES FOR 
Grours or 20 Trias. Borm Ss WERE 
POSITIVELY REINFORCED. 


a 

# 30 x 

S x 

wes \ 

E ae 

ui X- MOL (NR) 
Š 20 

[2] 0- COH (R) 
= x, 

z ISE x 3 Soe, ex à 
[s] Vex 
E 10 


ALONE 


TOGETHER 


TRIALS 


Fic. 2. MEDIAN JUDGMENTS or LENGTH OF LINES FOR 
Groups or 20 Trrats. MOL Was NEGATIVELY 
RermNrorcep, COH Posirivery REINFORCED. 


D es 
x 
z 
= 2 
ie 
$ 
E (7K—X X 
8 Ge 
2 
z x- ROSINA) 
a 0- SLA (NR) 
I" 
z 
ALONE TOGETHER 


TRIALS 
Fic. 3. MEDIAN JUDGMENTS or LENGTHS OF LINES ror 
Groups or 20 Terrats. Born Ss WERE 
NEGATIVELY REINFORCED. 


they were right on 82 per cent of the trials 
(group R); half were told they were wrong on 
82 per cent of the trials (group NR). 

Mean D score (a measure of shift in judg- 
ment) was significantly higher at the .01 level 
for group NR than for group R. This indicates _ 
a greater tendency for NR Ss to be influenced | 
by their partner's judgments in the group 
situation. 

Graphical and quantitative measures of f 
direction of shift show that Ss in group NR | 
tended to converge toward their partners; 
Ss in group R maintained their former judg - 
ment ranges or shifted away from those of 
their partners. 

On the basis of these findings it is concluded 
that Ss with a past history of positive rein- | 
forcement in a given type of judgment are | 
less influenced by their partners in a group 
judgment situation than Ss with a history of 
negative reinforcement. 


REFERENCES 


1. Berenpa, Rora W. The influence of the group on the | 
judgments of children. New York: Kings Crown 
Press, 1950. j 

2. Bovazp, E. W., Jr. Group structure and percep- 
tion. J."abnorm. soc. Psychol., 1951, 46, 398-405. 

3. Bray, D. W. The prediction of behavior from two | 
attitude scales. J. abnorm. soc. Psychol., 1950, 
45, 64-84. i 

4. Kerman, H. Effects of success and failure on “sug | 
gestibility” in the autokinetic situation. J. ab | 
norm. soc. Psychol., 1950, 46, 267-285. | 

5. Lucuins, A. S. Social influences on perception of ll 


— 


ts drawings. J. soc. Psychol., 1945, 21, 257- 

6. Mausner, B. The effect of prior reinforcement 0n - 
the interaction of observer pairs. Ann Arbor: | 
University Microfilms publication No. 2118, 1950. 

7. Scuonsar, R. A. The interaction of observer-paits | 
in judging visual extent ard movement: the 
formation of social norms in a structured situation: 
Arch. Psychol., 1945, No. 299. 

8. Smer, M. A study of some social factors in per | 
ception. Arch. Psychol., 1935, No. 187. | 


Received October 13, 1952, 


BIAS IN POSTDICTION FROM PROJECTIVE TESTS 


WILLIAM F. SOSKIN! 
University of Chicago ® 


N A preceding investigation (1, 2) an effort 
I was made to study the rater bias or set 

found in personality appraisals based on 
projective tests and other types of personal 
documents by comparing such appraisals with 
others based on more extensive information 
and observation. In this earlier study the cri- 
terion consisted of a series of trait ratings ar- 
rived at by a three-man team after five days 
of study of the individual through objective 
and projective tests, interviews, various per- 
sonal documents, and observations of behavior 
in standard situations. However, inasmuch as 
the criterion ratings consisted of a series of 
judgments of traits, ie., of certain abstrac- 
tions currently familiar in personality studies, 
the validity of the appraisals which were in- 
vestigated was pointed out to be a. "relative 
validity" contingent upon the validity of the 
judgments of the team making the criterion 
ratings. Since the consensus of "experts" may 
or may not correlate satisfactorily with an 
external criterion of validity, an effort was 
made to conduct an analogoys investigation in 
which the appraisals could be evaluated against 
a more rigorous criterion, viz., a criterion test 
dealing more directly with specific behaviors 
of the subject which had occurred in the past 
rather than with the appraisal of so-galled 
traits. The present report deals with this 
effort. : 


METHOD 


_ Criterion test. As the criterion for this study a post- 
diction test was constructed consisting of an array of 
multiple-choice items which were built primarily 
around incidents known to have happened in the recęnt 
past history of a single subject (S), hereaśter called 

Linda,” Although the majority of items concerned 
Specific situational behaviors, some items were de- 
veloped from commonly shared characterizations of S 


t 


* Grateful acknowledgement is hereby expressed to 
Drs. William E. Henry and Donald W. Fiske at the 
University of Chicago, Dr. E. L. Kelly at the University 
of Michigan, and Dr. Julian Pathman of Downey 
Veterans Administration Hospital, Downey, Illinois, 
for obtaining the participation of the several groups in- 
volved in this study. 


69 


provided by several intimate acquaintances. Several 
examples of the items are: 

4. In the immediate neighborhood she (a) has a 
wide circle of casual friends who frequently “drop in”; 
(b) is the confidant of several young mothers; (c) has 
one intimate friend, and has only fleeting contacts with 
others; (d) is considered a gossip, hence not too popular. 

5. As a housekeeper she is (a) meticulous and tidy, 
rather intolerant of toys, clothes, etc. not in their 
proper places; (b) rather untidy—doing about as much 
as needs to be done; (c) spends a considerable amount of 
time adding new feminine touches to the interior; (d) 
so incompetent that on two occasions neighbors have 
complained to the Board of Health. 

8. Last summer the family went on a long trip to 
visit Linda’s parents-in-law. Linda (a) was rather 
excited about the trip since she likes her husband's 
parents better than her own parents; (6) privately felt 
a little bitter about spending the money this way, 
when it would be no vacation for her to have to take 
care of the children as usual; (c) was opposed to taking 
the trip because the money would have to be taken out 
of a savings they were accumulating to buy a house; (d) 
induced her husband to fly instead of driving—to get 
the maximum thrill out of the trip. 

The information on which these items are based was 
collected in the following manner: 

Through friends with contacts in social circles dif- 
ferent from his own, the investigator obtained a list 
of several persons previously unknown to him but 
judged by these informants as likely candidates for this 
type of investigation. The S was the first and only one 
contacted from this list. After the nature of the project 
had been carefully explained to both the S and her hus- 
band, the investigator conducted three private inter- 
views with S, each lasting one and one-half to two 
hours. The interviews were not unlike those carried on 
in the initial stages of psychotherapy. The S was given 
maximum freedom to talk about her present, her past, 
etc.; the interviewer only occasionally interposed a 
question when it seemed that certain areas might not 
be touched on spontaneously. During the same interval 
of several weeks the investigator also interviewed pri- 
vately the husband and several of S's closest friends. 
The latter informants, persons in virtually daily con- 
tact with S, were designated by S herself upon request 
of the investigator. 

From the information gathered in this total of ap- 
proximately 15 to 20 hours of interviewing three general 
types of questions were developed: (a) items involving 
factual material reported by S, e.g., that she attended 
college for a specified period of time; (b) items involving 
S’s expressed attitudes and feelings, e.g., that she was 
never very close to her mother and that even today she 
feels her mother’s relation to her is an exploitative one; 
(c) items in which S met a certain situation in a man- 
ner that was presumed to be fairly characteristic of 
her. The material for these latter items was taken 


70 ; Wigurax F. Soskin 


y 
Y 


largely from situations reported spontaneously and in- 
dependently by several informants. The material thus 
collected was developed into multiple-choice items of 
the type shown above, the three distractor alternatives 
in each instance being sheer inventions designed to 
present a range of plausible choices suitable to the 
item under consideration. Y 

After the first interview Linda was given the Ror- 
schach and the TAT, both administered by the inter- 
viewer. The verbatim record of the 20 TAT stories of 
the adult female series was duplicated so that each 
judge had his own copy. Similarly the Rorschach was 
reproduced in detail along with an appropriately 
marked location chart. This report is primarily con- 
cerned with postdictions based on the blind interpreta- 
tion of these protocols. 

Climical judges. The Linda test, consisting of 24 
multiple-choice items, was administered to three dif- 
ferent groups. The first, hereafter referred to as Group 
N or “novices,” consisted initially of 42 graduate stu- 
dents taking their first course in TAT interpretation, a 
one-quarter, regularly scheduled graduate course which 
met for 12 two-hour sessions, This group worked only 
with the TAT. Group Estereo consisted of 12 advanced 
trainees and three supervisory-level clinical psychol- 
ogists; Group Eproj consisted of three advanced trainees 
at a large mental hospital and two supervising psychol- 
ogists in the hospital’s diagnostic service. Individuals 
in the E groups had from two to five or more years of 
experience in use of projective tests, whereas the 
majority of students in Group N had no previous ex- 
perience or training in the use of projective tests. 

Procedure for Group N. At the conclusion of the 
second meeting of the introductory TAT course Group 
N was given a general explanation of the nature of the 
investigation, i.e., that it concerned a study of post- 
diction of specific behaviors by means of blind in- 
terpretation of projective devices. Then copies of the 
Linda test were circulated and student judges were 
provided the following basic information about S: that 
she had had some college training, that her father was 
2 moderately successful businessman, that she was a 
white married woman, 26 years of age, mother of two 
children, etc. This information, incidentally, with the 
exception of her age, was also stated as facts contained 
in the headings of one or another of the test items. Given 
only this basic information (before they were permitted 

to see Linda’s TAT protocol), the judges were asked to 
study the entire postdiction test carefully and to try to 
guess the correct answers, The purpose of this procedure 
was threefold: (a) to have every individual thoroughly 
familiar ‘with the specific postdictions he would be 
called to on make after studying the TAT, (b) to deter- 
mine this group's pretest stereotype for a person like 
Linda, and (c) to obtain from the stereotype profile an 
additional base besides the true answers against which 
to evaluate the postdictions. 

After completing the stereotype scoring (hereafter 
called stage S), judges were given the E protocol 
and asked to rescore the postdiction test after careful 
study of the TAT. (This stage is hereafter referred to 
as T-L) Neither the judges nor the instructor of the 
course was given any information about the true 
answers at any stage of the investigation. At the con- 
clusion of the course the test was administered once 

2 


more, although this time no pretest stereotype was 
asked for. (This stage is hereafter refc:red to as T-2) 
Procedure for E Group. Group Eproj participants 
took the test individually. No attempt was made to 
determine the stereotype of this group. Each person 
who volunteered to participate in the investigation was 
provided with a packet of materials which included (g) 
a copy of the Linda test, (b) a sealed envelope contain- 
ing the Rorschach protocol, summary, location chart 
and a standard IBM answer sheet, and (c) a second 
sealed envelope containing a typed copy of the 20 
TAT stories, and a second answer sheet. 
Judges were provided with both written and oral 
directions explaining that they were first to answer the 
Linda test on the basis of the Rorschach alone. Then; 
after returning their answer sheets to the supervisor 
they were to open the second envelope, study the TAT 
protocol, and complete the test a second time using tht 
combined resources of the Rorschach and TAT. | 
Judges were permitted to do this work on paid time; 
and those who cared to do so were invited to code tht 
answer sheets so the test scores could be made available 
at a later time. 
Group Estereo had no projective test data; they were 
provided only with the basic information initially 
available to Group N, and were asked to guess the eu 
rect answers. * 


RESULTS 


1 
The procedure followed with Group x 
yielded three different sets of scores, one fot 
each of the conditions under which the Lindt 
test was taken. The first of these, stage S, wae 
the set of answers selected when the judg) 
were asked merely to guess the correct answer 
before having seea the TAT. The second, Th 
was the set of answers selected after the E. 
exposure to Linda’s TAT protocol, at the el | 
of the second meeting of the course. The third, 
T-2, was the final set, the responses made al 
the cenclusion of the course. m 
The general character of the performance 
Group N under each condition is summariZ# | 
in Table 1. For stages S and T-1 only th 
judges were studied who completed all item 
satisfactorily on both administrations. F% 
stage T-2 only those judges were studied WA 
completed all items on all three stages. a 
The comparison of before-and-after acei 
racy scores suggests very slight changes 1197 
the first to the third test administration; Wü 
this is undoubtedly interesting in so far í 
number of correct postdictions is concerne 
the comparison fails to suggest at all one 
the more striking findings of the investigatio? 
As a crude measure of the amount of shift 
occurred from S to T-1 to T-2, the 24 ite” 
were ranked for each stage in terms of the! 
quency with which the item was answe 


Bras iy POSTDICTION FROM Pgojective Tests t 71 


TABLE 1 


MEAN AND R&NGE oF SCORES OBTAINED BY Group N 
ON THREE DIFFERENT ADMINISTRATIONS 
OF PosrDICTION TEST 


STAGE N MEAN RANGE 
S 24 7.32 4-16 
T-1 24 8.48 5-12 
T-2 19 9.42 6-14 

TABLE 2 


RHO BETWEEN RANKS OF POSTDICTION TEST ITEMS 
ORDERED IN TERMS OF DIFFICULTY 
AT EACH STAGE 


STAGE RHO 
S—-T-1 —.23 
S-T-2 —.15 

T-1—-T-2 -90 


rectly. gre rho for each of the several combina- 
tions iS'reported in Table 2, where it will be 
seen that the relative difficulty of items in the 
pre-TAT stage (S) is negatively correlated 
with the relative difficulty of these same items 
in the two post-TAT stages (T-1 and T-2). 
From inspection of the data it is evident that 
items which were correctly answered by a con- 
siderable number of judges on the basis of only 
basic information were less often answered cor- 
rectly when the TAT was made available ; con- 
versely, those items most often answered cor- 
rectly with the aid of the TAT were relatively 
difficult to answer correctly with the basic 
information alone. The "negative correlation 
between S on the one hand and T4 afid T-2 
on the other suggests that a marked shift in 
orientation occurred after initial study of the 
TAT, a shift which did not alter substantially 
even at the conclusion of the course. 

Examination of individual records helped 
reveal the nature of this shift. The characteri- 
zation of Linda that emerged from a compila- 
tion of most-frequently-chosen alternatives 
under condition S suggested a pre-TAT stereo- 
typing of Linda as a well-adjusted, fun-loving 
young woman, a happy wife, and a good 
mother. By contrást, the stereotype suggested 
by most-frequently-chosen alternatives in the 
two post-TAT administrations characterized 
her as severely maladjusted, an irritable, 
moody person, one dissatisfied with her lot 
both as wife and as mother, etc. 


In order to obtain a better di of 
this shift, weights on an adjustmeht-malad- 
justment continuum were determined for each 
of the alternatives in the test. Three persons 
not involved in the postdiction tasks inde- 
pendently* ranked the four alternátives for 
each item on the continuum, and the mean of 
the three ranks was assigned as the weight for 
a particular alternative. (Several items which 
did not lend themselves to such ranking were 
omitted from further consideration.) The ster- 
eotype profile for each condition, determined 
by computing the mean weight for each item, 
is presented in Figure 1. The items are ar- 
ranged in terms of the mean weight assigned 
to the correct answer: items at the top of the 
figure are those for which the correct answer 
was judged to constitute relatively adjustive 
behavior; those at the bottom are items for 
which the correct answer was judged to con- 
stitute relatively maladjustive behavior. The 
fact that there is no overlap between the S 
profile and the profiles obtained from T-1 and 
T-2 is strikingly evident. 

To test the significance of the shifts revealed 
in Figure 1, a score was determined for each 
judge by summing the adjustment weights as- 
signed to each alternative he selected in the 


Item Number 
m 
^ 


$0 25 20 LS 1.0 
Adjustment Weight 


40 35 
High 
Fic. 1. SHIFT IN Group N MEAN BY ITEM FOR STAGES 


S, T-1, AND T-2 WHEN ALTERNATIVES ARE WEIGHTED 
ON ADJUSTMENT-MALADJUSTMENT SCALE 


72 j 


ee test and performing £ tests between 
groups oi judges for the several administra- 
tions of the test. Between S and T-1, ¢ = 2.807 
(df = 46); between S and T-2, ¢ = 3.617 (df = 
36). Both are significant beyond the .01 level. 
The ¢ of .708 (df = 36) between T-1 and T-2 
is not significant. It is clear that a rather 
marked change in stereotype developed as a 
consequence of exposure to the TAT protocol; 
in effect, the group as a whole tended to shift 
Írom a stereotype of an essentially healthy 
person to that of a quite maladjusted one. Fur- 
thermore, the shift was somewhat extreme 
within the limits permitted by the nature of 
the alternatives. Although Linda, as known to 
the investigator and to his informants, was 
neither perfectly adjusted nor unusually dis- 
turbed emotionally, the tendency among stu- 
dents was to see her first as one, then as the 
other. 

No claim can be made for the representative- 
ness of the items sampled. It is not known, for 
example, what proportion of the total of 
Linda's behaviors over a three- or four-month 
period might be judged as adjustive or malad- 
justive. But the items were not intentionally 
selected to give the balance suggested by the 
profile of correct answers in Figure 1. In this 
Study all behavior or events which seemed 
amenable to clear statement in the form of a 
multiple-choice item were utilized. The weights 
for correct answers were not determined until 
after the judges’ responses had been collected. 

The negative rho reported in Table 2 be- 
tween items ranked for difficulty under condi- 
tion S and under conditions T-1 and T-2 sug- 
gested what was verified by inspection, viz., 
that although mean accuracy scores changed 
very little before and after TAT, the items 
answered correctly by Group N judges under 
the two circumstances were by and large dif- 
ferent seis of items. Under condition S, since 

the group initially entertained a stereotype of 
a well-adjusted person, they appear to have 
guessed correctly items for which the correct 
answer conformed to the stereotype and erred 
on those items in which the correct answer did 
not conform to that stereotype. Conversely, 
when, after studying the TAT, the group’s 
stereotype shifted to a conception of an ex- 
tremely maladjusted person and selections 
tended rather consistently to favor the alterna- 
tive most characteristic of maladjustment, the 


2 


WigiAM F. SoskiN 


group tended to err on those items in which, 
so far as is known, Linda's behavior was actu- 
ally characteristic of good adjustment. 
Effect of experience. To explore the possible 
effects of further training and experience in the 
use of projective techniques on ability to make 
accurate postdictions in this type of setting, 
and also to study the contributions of different 
kinds of projective information to postdiction, - 
the Linda test was administered to the two E 
groups, persons more experienced than the. 
students in Group N. Essentially two kinds of 
comparisons were desired, viz., comparison of 
stereotype scores and comparison of scores | 
based on analysis of projective test data. 
Unfortunately, circumstances under which 
the cooperation of the more experienced work- 
ers was secured did not permit for the collec- 
tion from a single group of the kinds of data 
necessary for the two comparisons mentioned. 
above. Rather, the stereotype data were col | 
lected from one group and the postdiction data ^ 
from another. Group Estereo, as was mentioned 
earlier, was comprised of 12 advanced trainees 
in the clinical psychology program of a large 
Midwestern university and three supervisory- | 
level clinical psychologists, whereas Group | 
Eroj consisted of three advanced trainees and | 
two supervisory-level psychologists at a large | 
mental hospital. | 
The stereotype Was based on essentially the | 
same procedure as that utilized with Group N | 
Group Estereo was given the same basic it- | 
formation regarding Linda's age, marital st i} 
tus, etc., and was then asked to read throu 
the entire set of questions and alternativ® , 
before attempting to guess the answers. Scoté 
were weighted in the manner described earlier: 
The comparison of weighted means for eal 
item is presented in Figure 2, along with the 
stereotype curve of Group N. It is evident 
from the figure that the more experienced indi- 
viduals—even though they had no differen! | 
or additional data about her—tended at the i} 
Very outset to select a higher proportion Di | 
maladjustive responses than did the novices. | 
In 18 out of 21 items Group Estereo deviated 
from Group N in the direction of the malad | 
justive pole. By the sign test the direction ?' 
difference is significant at the .01 level. How | 
ever, the ¢ between adjustment-weight scor® | 
of the two groups is only 1.719 (df = 37; 05 4. 
p < .10). ji 


....... Mean Weight 
77 Correct Answer 


7 
23 —— Group N (S) 
5 =- Group Estereo 


item Number 
m 
^ 


1.0 
Low 


30 25 20 15 


Adjustment Weight 


Fic. 2. PROFILE or MEAN WEIGHT FOR THE STEREOTYPE 
Scores or Novices (Group N) AND FOR More 
EXPERIENCED USERS OF Projective Tests (Group 
Esreero) As COMPARED WITH THE PROFILE OF 

WEIGHTED CORRECT ANSWERS 


40. 3.5 
High 


TABLE 3 
Comparison or Scores or Group N Usne TAT AND 
OF Group Ep; USinc Boru 
RonscHacH AND TAT 


Group N Mean RANGE 
Group N on T-2 " 19 9.42. 6-14 
Group Ep: on Rorschach 5 10.90 7-13 
Group Epro on Rorschach 5° 9.60 6-13 


plus TAT 


A second comparison was made between the 
Group N score with the TAT at the conclusion 
of its course of training (T-2) and the scores of 
Group Ero; first with the Rorschach alone, 
then with Rorschach plus TAT. Group Er; 
was provided with the same basic information 
Initially available to Group N and, in addition, 
had available a typed Rorschach protocol with 
à rather detailed inquiry and location chart, 
and, later, typed manuscripts of the entire 20 
TAT stories. 

As Table 3 indicates, there was no signifi- 
cant difference between mean scores of novice 
Judges and this small sample of more experi- 
enced judges. Furthermore, two different tests 


— Group N TAT (T-2) S 
—-— Group E Rorschach 


---- Group E Rorschoch 
and TAT 


~- Correct Answer 


item Number 


10 
Low 


40 3.0 2.5 20 15 


High Adjustment Weight 


Fic. 3. Group WEIGHTED MEAN Bv ITEM For GROUP 
N (T-2) anp rog Group E (RORSCHACH) AND 
Group E (Ronscuacu AND TAT)| 


3.5 


were not significantly better than one test 
alone as aids to this type of postdiction. The 
group means of adjustment-weight scores for 
Nr» and Er+rar, or between the two Eproj 
groups are not significantly different according 
to £ tests. This is best revealed in Figure 3 
where the stereotype profiles for Group E»; 
using Rorschach alone and then Rorschach 
plus TAT are compared with the profiles ob- 
tained for Group N under condition T-2 (TAT 
at end of course). With minor variations, the 
three profiles show a striking similarity. They 
suggest that within the limits of variation per- 
mitted by the Linda test items, individuals 
with from two to five years more experience in 
the use of projective techniques tended, for 
the purpose of postdiction, to conceptualize 
the subject in more or less the same way as did 
the novices. Both Group Ep; and Group N, 
after study of the projective devices, tended in 
a majority of items to pick alternatives more 
suggestive of maladjustment than adjustment. 


Discussion 


While the results are in certain respects 
quite clear, their limitations should be made 
explicit, since both the accuracy of postdictions 


and the tendency toward bias are here studied 
e 


74 i 
within the context of a difficult and somewhat 
uncommon task. For one thing, users of pro- 
jective tests are not often called upon to post- 
dict specific situational behaviors. Further- 
more, it may be argued that projective devices 
of the sort employed in this study were not 
designed to supply the information requisite 
for such postdictions. Nevertheless, it would 
appear that neither unfamiliarity with the 
task, nor shortcomings in the criterion, nor 
claims for the specialized nature of projective 
devices should vitiate certain of the results. 
When novices and more experienced persons 
used the Rorschach and/or TAT as the basis 
for conceptualizing a subject they appear to 
have been predisposed to anticipate more mal- 
adjustive tendencies and, hence, in fitting this 
conceptualization to the alternatives of the 
criterion test, were led to select alternatives 
more indicative of maladjustment. It seems 
reasonable to assume that if the conceptualiza- 
tion were translated into a diagnostic report 
rather than postdictions, a similar distortion 
would have maintained, although in less meas- 
urable form. 

Whether this tendency toward distortion is 
a function of the selective information con- 
veyed by the projective instruments or is at- 
tributable to characteristics of the users of the 
information is not unequivocally established 
in this study. It might well be, for example, 
that novices characteristically develop dis- 
torted conceptualizations, whereas among more 
experienced persons some continue to be rela- 
tively undiscriminating while others gradually 
learn more accurately to integrate signs of ad- 
justment with those of maladjustment. The 
Population available for this study was too 
small to permit investigation of this possibility. 
The results indicate merely that both experts 
and novices are subject to this distortion. 

As the foregoing implies, these results can- 
not be taken to indicate that there is no differ- 
ence between novices and more experienced 
persons in their ability to use projective de- 
vices accurately. It seems sounder to assume 
that which is most in accord with observation, 
viz., that some people learn with experience 
while others do not, and that among more ex- 
perienced users of these instruments there will 
be both “good” and "poor" predictors. Neither 
the size of the population nor the sophistica- 
tion of the criterion test warrants attempting 


> 


Wgrram F. SOSKIN 


to investigate this question from the available 
data, however. 5 


SUMMARY 


A postdiction test consisting of 24 multiple- 
choice items pertaining to behaviors deemed 
characteristic of an adult female by the in- 
vestigator and by informants well acquainted 
with the S was administered for comparative 
purposes to a group of graduate students tak- 
ing their first course in TAT interpretation and 
to two groups of workers more experienced in 
the use of projective techniques. The following 
trends were observed: ' 

1. Novices, at the beginning of the course, 
provided with only a few basic items of infor- 
mation such as age, race, marital status, etc, 
tended to characterize the S as well-adjusted. | 

2. Although novices held an initially favor- 
able stereotype of 5, a marked change occurred 
immediately after reading her TAT—before 
training in interpretation had begun—and the 
group's characterization shifted in the direc 
tion of maladjustment. 

3. After completion of the training course, 
the novices’ characterization of S remained es- 
sentially as it stood at the beginning of the 
course, on their first exposure to the TAT pro- 
tocol, and there was no significant improve 
ment in accuracy scores. 

4. A group of mare experienced users of pro- 
jective techniques using either Rorschat 
alone or Rorschach plus TAT earned postdic 
tion accuracy scores not significantly different 
from those earned by the novices at the en 
of their, course, and showed the same tendency L 
to postdict preponderantly maladjustive be » 
haviors after study of the projective tests. 

These results, within the limitations impo: 
by the particular array of test items and by 
the generalizability of results obtained from | 
the study of a single person, suggest that the 
data of projective tests predispose toward 4” 
overestimation of maladjustive trends in pos 
diction situations. : | 


REFERENCES 


1. Kerry, E. L, & Fiske, D. W, The prediction M 
performance in clinical psychology. Ann Arbor, 
Univer. of Michigan Press, 1951. | 

2. Sos, W. F. A study of personality ratings based 
on brief observation of behavior in standard 
situations. Unpublished doctor's dissertatio 
Univer. of Michigan, 1948. 4 


Received October 16, 1952. | 


| 
| 


t 


p 


t 


f 


RORSCHACH CORRELATES OF RESPONSE TO SUGGESTION! 


HARRIET B. LINTON 


Yale University 


STUDY that relates experimental meas- 
A ures of behavior to the results of a 
personality test may have two aims: 
to use the experimental measures as a stand- 
ard for evaluating the test, or to use the test 
as a standard for interpreting the experimental 
results. The same study may do both; in any 
event, the discovery of significant correlations 
between responses in situations superficially so 
different (experiment and test) can lead to 
fruitful ideas about the bases underlying the 
correlations. Similarly, the discovery that a 
personality test fails to correlate with be- 
havioral measures which it is supposed to 
represent does not necessarily damn the test, 
but should rather be used to gain a clearer 
picture of the kinds of interpretations that 
can validly be made of the test variables. 
Since even the most enthusiastic supporters of 
a test like the Rorschach do not claim that the 
test does everything, it seems less useful to 
"prove" or “disprove” the test than to find 
out what it can and what it cannot do. 

This study investigatés relationships be- 
tween 11 Rorschach variables and change of 
judgments of autokinetic movement under the 
influence of planted “judgments” given by the 
experimenter’s confederáte. It tries to use the 
Rorschach to learn about personality factors 
underlying change of judgments as well as to 
test the ability of the Rorschach to predict such 
behavior. Previous studies in this area have 
produced conflicting results. Schumer (10) 
found significant relationships between the 
M response and response to suggestion, as 
well as other experimental measures. Steisel 
(12), however, found no significant relatión- 

1 This study was carried out while the writer was a 
Public Health Research Fellow of the National Institute 
of Mental Health. The autokinetic data were gathered 
as part of the writer’s dissertation experiment and the 
Sponsors of that experiment, Drs. L. W. Doob, C. I. 
Hovland, and S. Sarason, were of much help in planning 
that portion of the experiment. Special thanks are due 
to Dr. H. A. Witkin for the use of the facilities of his 


laboratory at Brookl 
MEL Psi yn College, as well as for many 


75 


ships among 12 Rorschach measures and 5 
measures of suggestibility. 

One reason for the failure of many studies 
to find reliable relationships between Ror- 
schach measures and overt behavior is that 
they tap different levels of the person. Ex- 
periments usually elicit single acts of overt 
behavior that may have a variety of deter- 
minants. Rorschach workers unanimously 
state that in order to predict behavior each 
test variable must be considered in relation to 
the total test pattern, and that a piecemeal 
interpretation of a protocol is not valid. If this 
is so, it is to be expected that predictions based 
on the interpretations of isolated Rorschach 
measures will often turn out to be invalid. It 
is, however, difficult to make predictions based 
on more complex Rorschach patterns when the 
meaning of the single variables is not suf- 
ficiently understood. 

The present study attempts to deal with 
this dilemma by obtaining supplementary in- 
formation about each subject’s feelings and 
reactions in the experimental situation, with 
the aim of making the experimental measure 
less crude. Since different factors may produce 
the same overt response in different subjects, 
this technique is intended to provide informa- 
tion about the specific factors that contribute 
to the overt behavior of each subject. If this 
can be done, it should be easier to develop 
hypotheses about personality characteristics 
and test scores associated with each specific 
factor than to predict the personality cor- 
relates of a single measure that summarizes 
the subject’s reaction to the total situation. 

Each subject was interviewed to learn more 
about his reactions to the autokinetic situation 
than could be learned from his overt behavior 
alone; in addition, one behavioral observa- 
tion was made. From this three distinct 
variables could be isolated which predispose a 
subject to change or not to change his judg- 
ments; they are described below. The nature 
of these variables then led to the choice of 
Rorschach variables that might be expected to 
be relevant. 


76 


) 


9 
EXPERIMENTAL PROCEDURE 


Subjects. Ss were 45 male freshmen at Brooklyn Col- 
lege. All were volunteers and all were naive about the 
Rorschach test and the nature of the autokinetic 
situation. The age range was from 16 to 1934 years, 
with a mean of 17.5. 

Autokinetic apparatus and procedure. To save space, 
the apparatus and procedure will not be described fully 
here (cf. 7 for fuller description); only a few significant 
features will be noted. The procedure follows Kelman’s 
(5), with the difference that Ss were brought blindfolded 
into the lightproof room and their blindfolds were 
replaced between groups of trials. They did not see 
the room at all and saw no light except the autokinetic 
light during the experiment. The same confederate? 
was used for all Ss; he was a college student and tried 
to resemble Ss as much as possible in dress and manner. 
He was presented as another S. 

There were three groups of trials. The first consisted 
of 20 trials in which S wrote his responses in a booklet. 
The E then averaged S's estimates of the light’s move- 
ment for trials 11 to 20. Before the next series, E asked 
for S's class and major. All were lower freshmen; the 
confederate replied that he was a lower senior psy- 
chology major. This was done to endow him with po- 
tential prestige to which S might or might not react. 
"Then followed a series of 40 trials in which estimates 
were given orally, with the confederate speaking first 
on each trial; his responses were formed by adding to 
the S's initial mean (to the nearest integer) a series of 
numbers averaging five. A series of 20 written trials 
followed the oral series. 

The S was then interviewed, with the following 
points covered: how easy or difficult judgments had 

been, factors that helped or hindered him, which 
method he preferred (written or oral) and why, whether 
he felt that the presence of another “S” interfered with 
his accuracy, how far away he thought the light was, 
direction(s) of movement seen, and whether he felt 
that the average movement varied in the three series. 

Rorschach test administration. The time between the 
autokinetic experiment and the Rorschach test ranged 
from 3.9 to 7.2 months, with a mean of 5.5 months. 
The test was administered individually, according to 
the procedure outlined by Klopfer and Kelley (6). In all 
cases the actual test was completed in one session; in a 


few cases the inquiry had to be completed in a later 
session. 


AUTOKINETIC SITUATION 


Change of judgments. The t ratio of the mean 
difference between the oral series and trials 
11 to 20 of the initial series was computed for 
each S. Fourteen Ss with 7 = 4.50 were classed 
as "changers" and the remaining 31 Ss as 
“nonchangers.” The reasons for dichotomizing 
are too lengthy to present here, but may be 
found elsewhere (7). This criterion has an 7 of 
.91 with Ss’ own statements that they were or 
were not influenced. 


* The writer would like to thank Mr. Arthur Shaw 
for his skillful performance in the role of confederate, 


» 


HARRIET B. Linton 


Supplementary variables. Certain responses 
occurred often enough in the iriterview to 
be categorized; in addition, E noted each S's 
ability to handle himself while blindfolded. It 
was thus possible to score each S for three 
major variables. 

1. Coping is a measure of the ability to deal 
with the impact of an unfamiliar and difficult 
task in which S is not only without help, but 
is deprived of the visual field. It is based only 
on reactions occurring before the confederate's 
responses were available to S. 

The Ss were rated as "coping" if they re- 
ported activity directed toward achieving a 
stable basis of judgments (e.g., visualizing a 
ruler, estimating speed and time), did not have 
outstanding difficulty in handling their bodies 
(walking, finding seat) while blindfolded, and 
did not report being upset by the darkness or 
strangeness of the situation. In addition, one 
S who reported effort to reach a basis of judg- 
ment and reported being upset, but showed 
outstandingly good handling of his body, was 
rated as coping. The Ss were rated as “non- 
coping" if they did not report effort to reach a 
basis of judgment, or if they did, but in addi- 
tion either reported being disturbed or showed 
very poor handling of themselves while blind- 
folded. There were 25 copers and 20 non- 
copers. 

2. Responsiveness to confederate refers t0 
reactions occurring after the confederate’s 
response became available to S. The Ss were 
classified as “responsive” (17 Ss) if, when they 
realized that the other person's judgments 
differed, from their own, they felt that they 
must be wrong, or that he must know mole 
about it, possibly because of his superior 
status, or they felt impelled to follow him. 

3. Negativism, like responsiveness, is a reac- 
tion to the other person, but in the opposite 
direction. Twelve Ss reported deliberate at- 


tempts to avoid being influenced, such 33 | 


making up their own minds before they heard 
the confederate. Three of this group were also 
responsive. 


ee 


Sia 


Autokinetic interrelations. All of the supple | 


mentary measures are highly related to change 


of judgments; changers tend not to cop | 
(r = —.93), to be responsive (r = .90), and not | 


to be negativistic (r — —.68). The supple 
mentary measures also relate to each other, 
but not as much. The only substantial corre- 
lation among them is —.61 between coping 


e 3 H 
RORSCHACH CORRELATES OF RESPONSE TO SUGGESTION € 77 


and responsiveness; negativism correlates .30 
with copifig and —.24 with responsiveness. 

The correlations support the notion that 
the supplementary measures, while not them- 
selves measures of  influencibility, reflect 
aspects of the person that predispose him to be 
more or less easily influenced, since each cor- 
relates highly with change of judgment, but to 
a lesser extent with the others. If lack of cop- 
ing, responsiveness, and lack of negativism 
are considered as attributes predisposing to 
change of judgments; all 10 Ss with three pre- 
disposing factors do change, 4 of the 12 Ss 
with two factors change, and none of the 23 
Ss with one or no factors changes. Since it 
appears that these factors represent relatively 
distinct aspects of the person, each of which 
plays a role in determining susceptibility to 
influence, investigation of Rorschach corre- 
lates of each of them should contribute to a 
fuller understanding of the meaning of what- 
ever relationships are found between change 
of judgments and Rorschach measures. 


RORSCHACH MEASURES 


„The analysis of reactions to the auto- 
kinetic situation suggests that relevant aspects 
of personality are (a) adequacy and manner of 
coping with new, unstructured tasks; (b) self- 
attitudes, such as self-esteem, richness of in- 
ternal resources, self-assertion, and acceptance 
of own impulses; and (c) responsiveness to 
external stimuli, including its magnitude, ade- 
quacy of control, and relationship to S’s 
Tesponsiveness to feelings originating within 
the self. To evaluate these aspects, a Series of 
indicators, or signs, is used? The specific 
Criteria used here are not necessarily applicable 
to another sample, but are based on the dis- 
tributions of responses in the present sample. 

W sign. A good whole response, since it 
entails dealing with the impact of the blots as 
à whole, reflects ability to cope successfully 
with a complex, unstructured stimulus. The 
Sign is given to 18 Ss with three or fewer ade- 


* The Rorschach signs used in this experiment are 
based on the signs developed by Hertzman for use in a 
Study of personality and perception (14). The W, Hd, 
and P signs were used unchanged and the F%, M 
Quantity , and C Control signs were used with slightly 
modified criteria. The M Type, Animal Type, Sum C, 
M:Sum C, and C Trend signs were added for the present 
Study. "The writer is grateful to Dr. Hertzman for his 
advice and for making available unpublished material. 


quate W, or with more than three adequate 
but an equal or greater number of inadequate 
W. Adequate W are those based primarily on 
form (F, M, FC, etc. and have good form 
quality. Inadequate W are those determined 
primarily by color or shading, or those with 
poor use of form (F—, as evaluated by Beck's 
tables, 1). Cut-off W is included. 

P sign. Popular responses, since they are 
seen by a large proportion of any normal 
population, represent the most easily seen 
images. Seeing a popular figure represents an 
ability to make use of the most obvious or 
readily available features of the environment 
in an everyday, common sense manner. The 
sign isolates Ss who fail to do this to a normal 
degree, and is not concerned with the sig- 
nificance of an excessive number of P. The 
sign is given to 18 Ss with five or fewer P, 
based on a list of 20 P responses that includes 
all those given by Beck (1) and by Klopfer (6) 
with the exception of "clouds" to Card VII. 
Near-populars and those added in the inquiry 
are given a value of one-half. 

M Quantity sign. The number of human 
movement responses is used as a measure of 
inner resources and responsiveness to prompt- 
ings originating within the self. The sign is 
given to 18 Ss with one or no M; animals in 
human-like action are scored as M. 

M Type sign. Since it is likely that in M 
responses the S projects more of his self-image 
than he does in other responses (4), the nature 
and quantity of the M content are significant. 
The sign deals with the degree of self-assertion 
shown; this corresponds roughly, though not 
exactly, to Rorschach's distinction between 
extensor and flexor M (9). In all, 89 M re- 
sponses were rated as “assertive”; these con- 
sist primarily of extensor M plus a few that are 
not extensor but involve aggression (e.g., 
standing ready for a fight); all M involving 
violent activity were included. The ‘“unasser- 
tive" category included 16 cases of flexor M 
(crouching, holding onto something for sup- 
port, etc.), 9 cases of oral gratification (kiss- 
ing, sucking, eating, smoking), and 12 re- 
sponses of a neutral nature (talking, walking). 
The sign was given to 18 Ss whose unassertive 
M responses were equal to or greater than the 
number of assertive M, including Ss with no 
M at all. 

Hd sign. A high ratio of parts of human 


78 ) 

\ 
figures to Whole figures may be taken to re- 
flect a low level of self-acceptance or self-es- 
teem. While M also relates to self-attitudes, it 
represents more closely the awareness of ten- 
dencies originating within the self, which need 
not imply acceptance of these tendencies. In 
some cases, Hd may reflect active self-critical 
tendencies. The sign was given to 19 Ss with a 
number of Hd equal to or greater than the 
number of H; three Ss with no human content 
at all were excluded from the analysis. 

Animal Type sign. This sign was developed 
after testing a few Ss. It soon appeared that, 
while changers and nonchangers did not dif- 
fer in total A%, they did see different kinds 
of animals. Changers not only saw few human 
images, but their animal content concentrated 
on insects, sea life, birds, and bats. Non- 
changers saw few such animals; some saw much 
human and little animal content while others 
saw little human content, but their animals 
were those that lend themselves most easily 
to the projection of human emotions and 
Situations, such as dogs, rabbits, elephants, 
and bears (cf. children's stories). It is sug- 
gested that such human-like animals reflect 
relative acceptance of S’s own emotions, while 
a high proportion of distant animals reflects a 
tendency to depersonalize such feelings. Ani- 
mals were divided into mammal and non- 
mamma], and the sign was given to 23 Ss for 
whom nonmammals formed 40 per cent or 
more of the total human plus animal content; 
whole as well as part figures are included. The 
one departure from correct zoological classifica- 
,tion was to place bats in the nonmammal 
category, on the hypothesis that they are 
generally felt to be alien and do not elicit the 

. same identification that other mammals do. 
Nonmammals include insects, spiders, marine 
and amphibian life, reptiles, birds, and bats; 
mammals include humans and all other mam- 
mals except bats (the question of whales did 
not arise), i 

F% sign. While all of the above signs meas- 
ure various forms of constriction, the best 
measure of over-all constriction is the propor- 
tion of all responses based on form alone, un- 
enlivened by color, shading, or movement— 
responses in which S limits himself to the most 
“rational” interpretation of the figures. The 
sign is given to 16 Ss with an F% of 50 or 
more. 


g 


HARRIET B. LINTON 


Sum C sign. The total response to color 
stimulation is used as a measure oí* emotional 
responsiveness to external stimuli. The usual 
weights of 0.5 for FC, 1.0 for CF, and 1.5 for C © 
are used; all responses using color were in- 
cluded, even when the primary scoring was 
M or FM, or when color was used for only part 
of the concept. The sign is given to 22 Ss with 
Sum C of 2.5 or more. 

M:Sum C sign. While a high Sum C is | 
taken to reflect a high emotional responsive- 
ness, its effect may be counterbalanced by a 
high M score. The sign was given to 20 Ss with ^ 
Sum C equal to or greater than M. j 

C Control sign. The extent to which color — 
responses are controlled by rational considera- 
tions, i.e., form, is important. If two Ss have ` 
the same Sum C, but one concentrates on FC 
and the other on CF and C, the latter may be 
less discriminating in his emotional reactions 
and perhaps more easily influenced as a result, 
While a high F% also indicates control, it is 
control by constriction; in contrast, a large = 
amount of FC suggests a spontaneity that is 3 
under rational control. The sign is given to 12 ^ 
Ss whose total number of CF and C equals or - 
exceeds the number of FC; symbolic CF and € ~ 
are excluded since they involve a measure of | 
control. 1 

C Trend sign. This sign was also added after — 
inspection of the data, when it appeared that ). 
some Ss, over the sequence of five cards that 
include color, showed an increase in the con- - 
trol of color responses by form, while others = 
showed a loss of control. Responses to each | 
card ar? “controlled” if FC outnumbers CF 4 
and C, and “uncontrolled” if CF and C equal .- 
or outnumber FC, with symbolic CF and C 
excluded. The sign was given to 18 Ss whose 
control deteriorated, i.e., progressed from - 
controlled or no color to uncontrolled color, 0! - 
from controlled to no color. In dubious cases 
more weight was given to Cards VIII, IX, and 
X; 3 Ss with no color responses were excluded | 
fróm the analysis. 


RESULTS . 3 

Table 1 shows the tetracHoric correlations 

of the Rorschach signs with the four e 
kinetic measures. The prediction is that & 

signs will correlate positively with change 9. 

judgments and responsiveness and negatively. 
with coping and negativism. Table 1 is a 


Ronscuach. CORRELATES ‘OF RESPONSE TO SUGGESTION t 79 


Li 


TABLE 1 " 
TerzRAcHORIC CORRELATIONS BETWEEN RORSCHACH SIGNS AND AUTOKINETIC MEASURES 


FACTORS THAT PREDISPOSE TO CHANGE 


RORSCHACH SIGNS yas es roam $ 
MENTS EGATIVISM 

(lack of) ° RESPONSIVENESS (lack of) 
Ww tose .29 .03 ares 
P .66** .42* .46* —.04 
M Quantity .39T .29 —.12 "5 
M Type .66** .66** .18 .T6** 
Hd s559 .42* .12 —.94 
Animal Type + .45* -39t .19 vod 
F% .49* .29 —.01 .47* 
Sum C 34 AT 1920 .15 
M:Sum C S .43* .50* „42t 
C Control .06 —.24 .26 .30 
C Trend .16 dd .48* 35} 


Note.—The direction of all measures is such that a positive correlation supports the predictions. Since predictions were made, probabil- 


ties are based on one tail of the probability distribution. 
* Significant at the .05 level. 
** Significant at the .01 level. 
t Significant at the .10 level. 


ranged so that a positive correlation confirms 
the predictions. 

In general the predictions are supported. 
Of a total of 44 correlations, 39 are in the ex- 
pected direction; five are significant at the .10 
level, 13 at the .05 level, and 5 at the .01 level. 
(Since the direction of correlation was pre- 
dicted, significance levels are based on one tail 
of the probability distribution.) Of the ex- 
perimental measures, change of judgments has 
the highest group of correlations with the 
Rorschach Signs; all are in the expected direc- 
tion, two at the .10 level, four at the .05 level, 
and three at the .01 level. It is not surprising 
. that change correlates most highly with the 
Rorschach signs, since it is a function of the 
three predisposing factors, each of which has 
a different pattern of correlation with the 
Rorschach measures, "The possible meanings 
of these patterns will be discussed below. 


Discussion 


, Change of autokinetic judgments under social 
influence relates significantly to most of the 
Rorschach measures. The highest r is with the 
M:Sum C sign, i.e., Ss who adopt the con- 
federate's responses as their own show, in the 
Rorschach, a responsiveness to external 
stimuli that is not countered by the self- 
awareness represented by M. Change is also 
highly associated with a lack of self-assertion 


in human movement responses, i.e., Ss with a 


relatively unassertive self-image tend to as- 
sume a passive role in the experiment, while 
Ss with an assertive self-image tend to form 
their own judgments and stick to them. While — 
change correlates to differing degrees with the 
other Rorschach signs, the implications of 
each sign can best be understood by examining 
its correlations with each of the predisposing 
factors before returning to the relationships 
with change. 

Coping, or the person’s ability to handle the 
task on his own before help is available from 
the confederate, seems to be most closely re- 
lated to self-attitudes, since it relates most 
highly to the M Type sign and appreciably 
with the Hd, Animal Type, and P signs. The 
correlations with the W, F%, and M Quantity 
signs, although small, are noteworthy because 
they are in the expected direction, in spite of 
the fact that these signs do not correlate with 
responsiveness, which is highly associated 
(r = .61) with lack of coping. 

The Ss who do not cope successfully with 
the autokinetic task tend to have little spon- 
taneity (F%); the difference between the 7’s 
of the M Quantity and Sum C signs, as well 
as ther with the M :Sum C sign, suggests that 
their constriction applies particularly to 
tendencies originating within the self. Not 
only is their human content relatively static, 
but it has a high proportion of part to whole 
figures, indicating little self-acceptance. The 


80 ) 

\ 
preponderance of nonmammalian content 
further indicates an inability to accept their 
own impulses freely, since such animals do 
not easily convey human feelings. 

The most significant Rorschach sign is M 
Type, which gives a more direct picture of S's 
conception of himself. Coping Ss see much 
strong, assertive activity, while noncoping Ss, 
when they see movement at all, tend to reveal 
a passive or dependent role in relation to the 
environment (bending, holding on, eating, 
sucking). This corresponds to their behavior 
in the experiment, where they fail to take the 
initiative in dealing with a new and difficult 
task. In the Rorschach their failure to cope 
actively with a new, unstructured task is also 
reflected in their relative inability to produce 
good W responses. They also show a reduced 
ability to recognize the most easily seen forms 
(P), indicating that their inadequacy extends 
to more commonplace dealings with the en- 
vironment, as well as more demanding tasks 
Qv). 

The fact that Sum C, C Conirol, and C 
Trend are unrelated to coping suggests that 
color responses reflect responsiveness to other 
people (cf. discussion of responsiveness) 
rather than reactions to external stimuli per se. 

In summary, noncoping Ss show a lack of 
spontaneity, low self-awareness and low self- 
acceptance, lack of self-assertion, and poor 
ability to cope with new, unstructured tasks. 

Responsiveness, while it is related to fewer 
Rorschach signs than coping, is related to dif- 
ferent signs. Its strongest relationship is to 
Sum C, resulting in a strong relationship with 
the M :Sum C sign as well. It also relates sig- 
nificantly to the C Trend sign, and its low 
correlation with C Control becomes more mean- 
ingful when the negative r of C Control and 
noncoping is considered. The tendency for 
responsive Ss to have deteriorating control as 
the successive impact of color stimulation in- 
creases is highlighted by an examination of 
responses to Cards IX and X, in which the 
color stimulation is especially difficult to 
handle. The production of color responses to 
both IX and X, with the response to X un- 
controlled, is found in 47 per cent of the respon- 
sive Ss and only 4 per cent of the unresponsive 
Ss; the r of this pattern with responsiveness is 
.86 and is significant at the .01 level. 

Responsive Ss, then, not only show Ror- 


ə 


HagrET B. LINTON GAE 


schach patterns that reflect a high degree of 
emotional responsiveness to exterhal stimuli, 
unbalanced by inner resources, but their emo- 
tional reactions tend to be uncontrolled by 
rational considerations, with the control de- 
teriorating as the impact of the external 
stimulation builds up with successive exposure, 
Furthermore, the lack of adequate popular 
content shown by these Ss indicates less ability 
to produce a common sense reaction to un- 
structured situations. An inability to deal with 
a situation in a common sense, practical man- 
ner, combined with high emotional responsive- 
ness and a tendency to lose emotional control." 
under pressure, would be expected to make à 
person prone to accept someone else's judgment 
when a concrete decision is required, as in the © 
autokinetic situation. 

Responsiveness is unrelated to the signs 
dealing with constriction (F%), self-aware- 
ness, or self-acceptance (M Quantity, M Type, 
Hd, Animal Type). It also fails to relate to W; 
while it relates to the common sense form of | 
coping reflected in P, it seems unrelated to the 
more demanding and creative form of coping 
reflected in W. 

Negativism. Those who actively try to resist 
influence tend to project themselves actively 
into the Rorschach, showing little constriction 
(F%), highly assertive M content, and a pre 
dominance of mammals in their animal-human 
content. While negativism is unrelated to Sum 
C or to M Quantity per se, the correlation with 
the M :Sum C sign indicates that, although the 
degree of over-all responsiveness to external | 
stimuli, may vary, whatever such response 
negativistic Ss do show tends to be subject t0 
inner controls. Furthermore, their color re- i 
sponses tend to be under rational control 
(C Control), with an improvement in ability 
to respond to emotional stimuli in a rational | 
manner as the test proceeds (C Trend). | 

The failure of negativism to correlate with . 
the W, P, or M Quantity signs is noteworthy, 
since the M Type sign is so strongly related. I2 | 
the general sample the M Type sign correlates - 
with W (56), P (.32), and M Quantity (.63). 
A clue to this discrepancy may lie in the one | 
that reverses prediction; negativistic Ss have | 
more Hd than H. Of Ss who combine assertiv? | 
M with high Hd, 58 per cent are negativistiG 
while only 13 per cent of Ss not showing this 
pattern are negativistic; the r of this pattem 

| 


} 


SORUR i i 
RonscHtAH CORRELATES OF RESPQNSE TO SUGGESTION t 81 


with negativism is .77 (6 < .01). In other 
words, negafivistic Ss have difficulty in self- 
acceptance, but, unlike noncoping Ss, they 
do not react to this difficulty by constriction; 
instead, they show a heightened awareness of 
their own feelings (Animal Type) and are self- 
assertive—as they are in the autokinetic situa- 
tion. 

A negativistic reaction is in itself a sign of 
insecurity; a person who feels secure about his 
own judgments should feel no need to delib- 
erately resist influence. "The heightened aware- 
ness that these Ss have of the "danger" of 
being influenced suggests that unsureness 
about their own judgments makes them feel 
that they may be susceptible to influence. 
Negativistic Ss, however, react to their in- 
security very differently from the way that 
noncoping Ss do. Noncoping Ss accept out- 
Side support passively and show constriction 
and little acceptance of their own impulses, 
which suggests that they deal with the low 
self-acceptance indicated by Hd with a repres- 
sion of their own drives. Negativistic Ss, on the 
other hand, seem to overcompensate; they 
remain aware of their own drives, have a 
highly assertive self-image, and “fight back" 
in the autokinetic situation when the con- 
federate’s responses are introduced; it seems 
likely that their high Hd content reflects an 
actively self-critical attitude, not a repression 
of self. 

Negativistic Ss, then, while they assert 
themselves rather than repress their own 
drives, do show conflict in their attitudes about 
themselves. It may be tentatively suggested 
that it is this conflict which makes them less 
able to cope creatively (W, M Quantity) or 
even adequately (P) with the Rorschach task 
than they might otherwise be expected to 
on the basis of their active M. ; their coping is 
not, however, poorer than average. 

Change of judgments. The correlation of 
change with each of the Rorschach signs can, 
im most cases, be seen as the resultant of the 
correlations of the three predisposing factors 
with the signs. The one clear exception is the 
W sign, whose r ‘with change is higher than 
would be predicted on the basis of the predis- 
Posing factors. That is, while the ability to 
form more complex integrations from the Ror- 
Schach figures reflects a tendency that re- 
duces the likelihood of S’s being influenced, 


e 


this tendency does not emerge clearly in the 
interview. * 

Other studies. As noted above, other studies of 
the relationship between Rorschach test re- 
sults and behavioral measures of influenci- 
bility haye led to equivocal results. A recent 
example is Steisel's study (12), which failed to 
find any significant correlations between five 
measures of suggestibility, including three 
autokinetic measures and twelve Rorschach 
measures. Investigation of some differences 
between his study and the present one may 
help to explain the discrepant results; only his 
results dealing with the autokinetic situation 
will be considered. 

1. Steisel uses reaction time for seeing move- 
ment and distances reported as measures of 
suggestibility, assuming that more suggestible 
Ss may see more movement and see it more 
quickly. It is well established, however, that 
the perception of autokinetic movement does 
not depend on suggestion (2, 3). Furthermore, 
the work of Voth (13) and Sexton (11) indi- 
cates that Ss who see much movement have 
personality characteristics that are associated 
with resistance to suggestion (e.g. self-ab- 
sorbed, withdrawn). The present study sup- 
ports the notion that more suggestible Ss 
would have longer reaction times and see less 
movement; the multiple R of change of judg- 
ments with these two measures is .63 (p < .01). 

2. In the present study influence came from 
a student who was presented as another S, 
while Steisel gave the planted judgments him- 
self, presenting them as the “correct dis- 
tances.” When a fellow student is used, S's 
own predispositions determine whether or not 
he accepts the other fellow's judgments as 
more authoritative than his own; this would 
seem to be an important aspect of suggesti- 
bility. In Steisel’s study the relationship is not 
between peers; obviously .E knows the correct 
answer. In the present study the interviews 
showed that influenced Ss did not see more 
movement, but concluded that their scale 
had been wrong and therefore changed it. 
The E's report of the “correct distances" would 
have provided an objective, rational basis for 
changing scale without necessarily involving 
suggestibility. 

3. Steisel uses a measure of change of judg- 
ments alone, and does not try to learn, by 
interview or other means, the different bases 


82 j 
for change. This may lead to the conclusion 
that a ineasure is unrelated to change, when 
it may actually be related to personality fac- 
tors that make a person susceptible to in- 
fluence (cf. Table 1, C Trend). Even more 
important is the effect of such information on 
the choice of Rorschach factors. In the present 
study, the interview data highlighted the 
importance of S’s attitudes toward himself and 
his own drives as well as his ability to cope 
with a new, unstructured situation unaided. 
This knowledge led to the addition of relevant 
Rorschach measures that might otherwise 
have been overlooked (W, P, M Type, Hd, 
Animal Type). 

4.Steisel kept the Rorschach measures as 
continuous variables, while the present study 
dichotomized them, scoring each S for the 
presence or absence of certain signs. It seems 
more significant to know how many Ss in a 
given group fail to reach a criterion (e.g., 2 
M), than to know the total M in the group, 
since the surplus of M shown by one S does 
not add to the inner resources of the others 
in his group. Á 

Another reason for using signs is that the 
interpretation of some Rorschach measures is 
nonlinear; use of a sign isolates those Ss show- 
ing the particular aspect of the variable that 
is relevant. For example, Steisel uses F% as a 
measure of conscious control. While it is true 
that a very low F% may indicate poor control, 
a very high F% suggests rigidity and con- 
striction (6). The F% sign used in the present 
study was designed to isolate constricted Ss 
and did not deal with the issue of poor vs. 
adequate control. Another example of a cut- 
off point that isolates one aspect of a variable 
whose interpretation is nonlinear is given 
above in the description of the P sign. 
; 5. In Other cases Steisel also considers only 
one interpretation of a variable which has 
other accepted interpretations. For example, 
he suggests that a large number of responses 
may reflect suggestibility; a large R, however, 
may also be interpreted as evidence of crea- 
tivity or of compulsiveness, both tendencies 
that would work against suggestibility. 
. He uses A% as a measure of constriction. 
Although this is an accepted. interpretation, it 
is also considered a sign of immaturity, a factor 
that cannot be overlooked when testing col- 
sege students. The Animal Type sign was de- 


D 


HARRIET B. LINTON dea 


signed to overcome this problem. Although 
uninfluenced Ss concentrate On mammal 
rather than nonmammals, the mammalian 
content of some was primarily human whereay 
for others (presumably less mature but, 
other evidence, no more constricted) it wi 
primarily animal. 

6. It is likely that potential correlations i 
Steisel’s study were obscured by his use of 
male and female Ss. Significant sex differences 
are often found (8, 14), and the relationships 
between suggestibility measures and per 
sonality data are sometimes different for the 
two sexes (14). In an unpublished study mad?) 
by the writer, a measure that seems to involve | 
suggestibility (influence of attitudes toward’ 
conclusions on ability to solve syllogisms) was 
significantly associated with low self-esteem in 
male college students and (not significantly) 
with high self-esteem in female students. Sino 
our culture places very different values on 
self-sufficiency in the rearing of boys and girl 
it is likely that a lack of self-sufficiency, ot 
suggestibility, is related to different pe 
sonality patterns in the two sexes. In Steisels 
study there is the further question of the dif. 
ferent implications that influence exerted by” 
a male E would have for male and female 5 


CONCLUSION 


The findings of the present study indicit 
that, while the relationships are not simple 
certain Rorschach measures are clearly relate? 
to reactions to the autokinetic situation. T | 
three factors found through the interview, 4% 
of which is highly related to actual change a 
judgments but has its own distinct pattern 
relationships with the Rorschach measures 
suggest three personality variables that 8! 
important determinants of the single "%4 
havioral variable of change of. judgments. 

Of the three, responsiveness seems 
closest to what is usually meant by suggest 
bility, aad further investigation might pro" 
it to play a part in a wide variety of sugges 
bility measures, particularly those invol : 


interpersonal relationships. Negativism woui 
probably also prove relevant in a large num 
of situations. It may, however, be limited ! 
instances where the S is conscious that ™ 
fluence is being exerted, or to ego-involVt^. 
situations (the interview indicated that ^. 
were highly ego-involved in their attempts ” 


aug 


RORSCHACH CorRELATES OF REsPÓNSE TO SUGGESTION LPNS 


judge movement "correctly"). The fact that 
negativism seems to be an overcompensation 
may mean that such Ss would be more likely 
to be swayed by certain kinds of influence. 
“Coping,” although highly related to change of 
autokinetic judgments, may be unrelated to 
influencibility in other situations that do not 
involve the S's skill in dealing with a task 
without help. There are undoubtedly other 
personality factors that play a part in this as 
well as other experimental situations. 

The existence of different predisposing fac- 
tors may explain why attempts to find a general 
trait of suggestibility have had such varied 
success (8). The lack of correlation often found 
between different suggestibility measures has 
led some experimenters to conclude that there 
is no general factor underlying response to 
suggestion in different situations. If a pre- 
liminary analysis. of the more specific per- 
sonality factors that play a part in each situa- 
tion were made, it might be possible to predict 
more accurately which suggestibility measures 
would intercorrelate and which would not. 

The results of the present study indicate 
the importance of obtaining supplementary 
qualitative data to clarify the different mean- 
ings that a particular response may have for 
different individuals, ie., to make it possible 
to approach a genotypical rather than a 
phenotypical classification of behavior. This is 
probably especially important in studies that 
attempt to relate behavioral measures to 
personality data, since the personality de- 
terminants of any single act are more likely to 
be complex than simple. " 

SUMMARY ? 

„The purpose of the present study was to 
discover Rorschach measures associated with 
change of autokinetic judgments under the 
influence of planted judgments given by a 
confederate, and to generalize about per- 
sonality correlates of response to suggestion. 
Forty-five male freshmen at Brooklyn College 
were tested in the autokinetic situation and 
then given the Rorschach. In the autokinetic 
Situation, in addition to change, three factors 
Predisposing Ss to change were ascertained: 
adequacy of coping with the initial task, 


responsiveness to the confederate, and nega- 
tivism. After that, eleven Rorschach indicators 
were chosen. 

Change of judgments was found to relate 
highly to $he Rorschach measures; each of 
the predisposing factors had a distinct pattern 
of relationship with the Rorschach, in all 


‘cases well beyond chance. 


Possible interpretations of the obtained pat- 
terns of relationships are discussed. Possible 
reasons for the failure of other studies to find 
such relationships are suggested. 


REFERENCES 


1. Beck, S. J. Rorschach’s test. Vol. I: Basic processes. 
New York: Grune & Stratton, 1944. 

2. CRUTCHFIELD, R. S., & Epwarps, W. The effect of 
a fixated figure on autokinetic movement. J. 
exp. Psychol., 1949, 89, 561-568. 

3. GRAYBIEL, A., & CLARK, B. The autokinetic illusion 
and its significance in night flying. J. aviat. 
Med., 1945, 16, 111-151. 

4. HertzMan, M., & PEARCE, JANE. The personal mean- 
ing of the human figure in the Rorschach. 
Psychiatry, 1947, 10, 413-422. i 

5. Kerman, H. C. Effects of success and failure on 
"suggestibility" in the autokinetic situation, 
J. abnorm. soc. Psychol., 1950, 45, 267—285. 

6. KroPreg, B. & KELLEY, D. M. The Rorschach 
technique. Yonkers, N. Y.: World Book Co., 
1946. 

7. Linton, Harriet B. Relations between mode of 
perception and the tendency to conform, Unpub- 
lished doctor's dissertation, Yale Univer., 1952. 

8. Murray, G., Murrny, Lors, & Newcome, T. M. 
Experimental social psychology. New York: 
Harper, 1937. 

9. RogmscpacH, H. Psychodiagnostics. 
Grune & Stratton, 1942. 

10. ScpuwER, FLonENCE C. Some behavioral corre- 
lates of Rorschach human movement respon- 
ses. Unpublished doctor's dissertation, Yale 
Univer., 1949. 

11. Sexton, M. C. The autokinetic test: its value in 
psychiatric diagnosis and prognosis. Amer. J. 
Psychiat., 1945, 102, 339-402. 

12. Sretset, I. M. The Rorschach test and suggesti- 
bility. J. abnorm. soc. Psychol., 1952, 47, 607-614. 

13. Vorm, A. C. Individual differences in the auto- 
kinetic phenomenon. J. exp. Psychol., 1941, 29, 
306-322. 

14. Witkin, H. A., HenTZMAN, M., Lewis, HELEN B., 
MACHOVER, KAREN, MEISSNER, PEARL B., & 
WAPNER, S. Perception and personality, New 
York: Harper, in press. 


New York; 


Received November 5, 1952. 


LEVEL-OF-ASPIRATION BEHAVIOR AND FEELINGS OF ADEQUACY 
AND SELF-ACCEPTANCE | 


LOUIS D. COHEN 


Duke University 


aspiration, considerable information has 

been made available about the compli- 
cated behavior involved in goal-level setting, 
methods of goal modification following success 
and failure, and persistence in seeking to at- 
tain goals (1). In several of these studies in- 
vestigators have reported stable relationships 
between the height of goal-level setting and 
certain personality variables, notably feelings 
of insecurity (2, 6, 9, 10). 

The present study was undertaken in an 
attempt to pursue the studies of the variables 
underlying explicit goal statements and level 
of aspiration. Using as a basic variable the 
concept of self-esteem or self-regard which has 
been described previously (4, 7), and redefining 
certain aspects of this broad variable, we de- 
noted two of them, feelings of adequacy and 
self-acceptance. As a means of attaining an 
estimate of these variables within the per- 
sonality of subjects, we used data from Ror- 
schach examination. It was our hope that fur- 
ther understanding of the implicit factors in 
goal setting might be uncovered in this process. 


IE THE process of determining level of 


METHOD 


Subjects. The subjects (Ss) were 50 patients selected 
from the medical and surgical wards of Duke Hospital. 
There were 23 males and 27 females. The mean age of 
the men was 29.5, and of the women, 34.5 years. The 
mean age of the group was 32.2 years. The predominant 
diagnoses were hypertension, asthma, and psycho- 
neurosis mixed. 

Procedure. Each S was given a Rorschach and a 
series of trials on the Rotter aspiration board (8). The 
Rorschach protocols were scored according to Klopfer 
and Kelley (5), and the Rotter aspiration board results 
were analyzed according to a procedure previously 
described by Cohen (1). 

The two variables, feelings of adequacy and self- 
acceptance, were denoted in detail, and a rating scale 
with detailed description of each point in the continuum 
was outlined. Feelings of adequacy had five steps, as 
follows: (a) feels adequate to all situations, (b) feels 
adequate to some situations, (c) feels inadequate to 
most situations, (d) feels inadequate in almost every 
situation, (e) an unreal feeling of adequacy. Self-ac- 
ceptance had four steps: (a) accepts self unequivocally, 


> 


| 
| 


(b) accepts self more than rejects self, (c) rejects sell 
more than accepts self, (d) rejects self unequivocally, 

Each of the Rorschach records, with identifying 
data removed, was rated on these two variables by tht 
present reporter. From the 50 records, two samples of 
ten protocols selected at random were given to two 
experienced psychologists.* | 

Correlation studies between the ratings of the two” 
examiners and those of the present reporter gave 4 
contingency coefficient of .53 for feelings of adequacy, | 
reliable at the .05 level of confidence, and of .75 for tht 
second variable, self-acceptance, reliable at the .0l 
level. These results are strongly suggestive of the rè | 
liability of the ratings of the two variables. 

Level of aspiration was limited for this study to the 
average goal discrepancy score. This score is the aver 
age of the differences between performance and sub 
sequent aspiration bid for all the test trials. The range 
of average goal discrepancy scores on the level-ol 
aspiration task was divided into seven-step intervali, 
with highly positive scores at one extreme and highly 
negative ones at the other. The middle range was). 
plus/minus score. The entire range read 3 plus, 2 pli. 
1 plus, plus/minus, 1 minus, 2 minus, 3 minus. | 

| 
RESULTS | 
: | 

As may be noted from Table 1, the line: 
relationships between goal-level setting 8M 
feelings of adequacy and self-acceptance we 
not significant. However, the data were ™ 
examined by the use of eta, and while m 
reliable relationship was found between feel 
ings of adequacy “and goal-level setting, ^ 
reliable relationship (better than .05 level 
confidence) was found between self-acceptan™ 
and goal-level setting. In examining the scat 
ter diagrams of the relationship of goal-level 
setting to self-acceptance we noted a parabolic 
like curve, which suggests that (a) both v 
high goal setting and very low goal setting a 
significantly related to a high degree of $ 
rejection; (b) a lesser degree of self-rejectiol 
is related to both medium-high and medium 
low goal setting; and (c) self-acceptancé i 
related to low-positive or low-negative 8% 
setting. i 


1 Grateful acknowledgment is made of the assist?" 
of Dr. Burke M. Smith, Chief Psychologist, VA 
pital, Roanoke, Virginia, and Dr. William C. Hall 
Chief Psychologist, VA Hospital, Mt. Alto, Washing 
ton, D. C. 


eat. 


LEVEL-OF-ASPIRATION BEHAVIOR anb FEELINGS OF ADEQUACY io 85 


E TABLE 1 
RELATIONSHIP BETWEEN FEELINGS OF ADEQUACY, AND 
SELF-ACCEPTANCE Rares (RORSCHACH) WITH 
Goar-LEvEL RATINGS (LEVEL OF ASPIRATION) 
(N = 50) 


GOAL-LEVEL RATINGS 
(LEVEL OF ASPIRATION) 


r Nuz Ney € 
Feelings of Ade- {—.037 076 0 
quacy 275 0 
Self-Acceptance .298  .408 .493 dA 
* «05. 
DISCUSSION 


Why should we have failed to find a linear 
relationship between our variables and goal- 
level setting? We believe that this result might 
have been anticipated on the basis of previous 
investigations. If, as Holt (3) has indicated, 
goal statements are defensive reactions rather 
than indications of motivation to achieve, 
then extremes in goal-level setting should 
suggest extremes of defensive behavior. Sears 
(9, 10) has indicated that in her group of 
children the insecure protected themselves 
by the use of very high or very low goal-level 
setting. Only the very secure children could 
state goals that were closely related to their 
actual achievement. It might therefore be 
assumed that a third variable, such as realism 
or objectivity of judgment, influenced goal 
statements in the direction of adherence to 
the actual performance? Now, if a third and 
important variable was involved, theh indeed 
a straight line could not represent the rela- 
tionship, 

_ We did find a significant curvilinear rela- 
tionship between goal-level setting and self- 
acceptance, but none with feelings of ade- 
quacy. These results indicate that the attitude 
toward the self is in fact an importantecor- 
relate to high or low goal setting? The diffi- 
culty in this conception is that feeling of ade- 
quacy as a variable was presumed to have the 
same general properties as self-acceptance, a 
representative of the attitude toward the self. 
It may be important, therefore, to look at our 
description of these variables to see where the 
difficulties may have arisen. 

Feelings of adequacy presumably permitted 
a rating of self-confidence, willingness to un- 


dertake new tasks, and expectationg of success 
and failure. One might expect that this would 
reveal information relevant to goal-level set- 
ting. That it did not may be due to the diffi- 
culty of drawing inferences about this variable 
from Rorschach data. We note that the reli- 
ability of this variable was lower than that of 
self-acceptance, suggesting that raters had 
greater difficulty in agreeing about subjects 
in this area. Our self-acceptance variable was 
described primarily in terms of inner conflicts 
in which discrepancies between goals and ideal 
goals were the major determinant of the rating. 
Tt would seem that ratings of this variable 
from Rorschach data were more successful. 
We wonder then: is it the definition; is it the 
method we used; or is there no relationship 
between feelings of adequacy and goal-setting 
behavior? 

Perhaps the most outstanding finding in 
this study is the relationship between selí- 
rejection and the height of goal-level setting. 
Both very high and very low goal setting re- 
flect self-rejection. Keeping in mind the find- 
ings of Sears, we may note that she delineated 
a group of students in her study who set very 
high goals and for whom the statement of high 
goal was the goal itself, and another group in 
which very low goal setting was a means of 
self-protection. Both of these groups consisted 
of children who were insecure and uncertain of 
their own abilities. This comes out quite 
clearly in our material. 

The question that arises as a result of this 
analysis is, “What determines whether a basic 
feeling of insecurity and of self-rejection is 
dealt with by high goal setting or by low goal 
setting?” There is little that can be said from 
these data to help clarify this point. It would 
certainly seem to warrant further investiga- 


tion. 
SUMMARY 


The purpose of this study was to investigate 
goal-level setting within a level-of-aspiration 
framework by studying two variables, feelings 
of adequacy and self-acceptance, which were 
presumed to relate to the implicit factors 
determining goal-level setting. These were 
studied through data secured through the 
Rorschach. It was found that goal-level setting 
was not related to feelings of adequacy as 


N 
y 
86 s 


defined in,this study. However, a curvilinear 
relationship was discovered to exist between 
goal-level setting and self-acceptance. The 
results indicated that both very high goal 
setting and very low goal setting were related 
to self-rejection, and that only those who could 
accept themselves were able to use low positive 
‘goal setting. 


REFERENCES 


1. Comen, L. D. Patterns of response in level of 
aspiration tasks. Educ. psychol. Measmt, 1950, 
10, 664-684. 

2. GRUEN, E. W. Level of aspiration in relation to 
personality factors in adolescents. Child Develpm., 

i 1945, 16, 181-188. 
3. Horr, R. R. Level of aspiration: ambition or 
. defense? J. exp. Psychol., 1946, 36, 398-416. 


Lolis D. Conex b Y 


4. James, W. The principles of psychology. New York; 
Holt, 1923. j 

5. KrorrerR, B., & KeLLEY, D. M. The Rorschach 
technique. Yonkers, N. Y.: World Book Co. 

. 1942. 
6. KrucMaN, S. F. Emotional stability and level of 
aspiration. J. genet. Psychol., 1948, 38, 101-119, 

7. McDovearz, W. Outline of psychology. New York: 
Scribner’s, 1923. | 

8. Rotter, J. B. Level of aspiration as a method of 
studying personality: II. Development and evalu- 
ation of a controlled method. J. exp. Psychol, 
1942, 31, 410-422. 

9. Sears, PAULINE S. Level of aspiration in acai 
demically successful and unsuccessful children, 

J. abnorm. soc. Psychol., 1940, 35, 498-536. 

10. Sears, PAULINE S. Level of aspiration in relation” 
to some variables of personality: clinical studies: 

J. soc. Psychol., 1941, 14, 311-336. 


Received November 10, 1952. 


CRITICAL FLICKER FREQUENCY FOR PARETICS 
AND SCHIZOPHRENICS 


R. P. IRVINE! 


Stanford University 


EVERAL experiments have been published 
S in the past five years dealing with the 
relationship of brain pathology and 
CFF. Results of expériments stemming from 
the hypothesis that CFF is in part a function. 
of organic cerebral pathology are conflicting. 
It was the purpose of the present study to 
compare the CFF of two groups of hospitalized 
psychotic patients: general paretics with 
known organic pathology and schizophrenics 
with no known neural disorder. 


METHOD 


Apparatus. A General Radio Company Strobotac, 
Type 631-B, provided the flickering light for this 
study. To reduce auditory cues, a special black sound- 
proof container was constructed for the instrument. 
It consisted essentially of two boxes, one within the 
other and separated by insulating material, The fre- 
quency control, a geared knob and dial arrangement, 
was removed from the original unit and extended by 
means of an electrical cable to the examiner’s table 
approximately 8 feet away. This made remote control 
of the device possible. 

The window of the outer container was covered with 
a sheet of black paper in which a hole 17$ inches in 
diameter had been cut. One piece of frosted glass was 
placed immediately behind this aperture. As it was 
desired to use foveal stimulation only, a Wratten No. 
29 red gelatin filter was taped to the rear surface of the 
frosted glass. This filter is ‘virtually opaque at fre- 
quencies below 580 my but allows free transthission of 
light at the red end of the spectrum. Three sheets of 
lightweight white typing paper, placed between the 
filter and the light source, aided in diffusion and served 
to reduce the brightness of the light. The intensity of 
the resulting red stimulus was .08 millilambert when 
measured with a Luckiesh-Taylor brightness meter. 

The visual angle subtended was approximately one 
degree, obtained by placing the subject approximately 
nine feet from the aperture. This distance allowed the 
subject freedom of movement within a limited area 
without significantly affecting the size of the visual 
angle. Binocular vision was used. 


1 The author is (ndebted to Quinn McNemar and 
C. L. Winder of Stanford University, and to Paul 
McReynolds and the staff of the Veterans Administra- 
tion Hospital at Palo Alto for their kind assistance and 
valuable suggestions. This paper has been abstracted 
from a thesis submitted in partial fulfillment of the 
M.A. degree at Stanford University. 

This study was made while the author was a VA 
Trainee in the Veterans Administration, Palo Alto. 


87 


A small pilot light mounted behind the Strobotac 
dial was the only source of illumination in the experi- 
mental room. This light was directed away from the 
subject and toward the examiner, who was seated at a 
table slightly in front and to the side'of the subject; 
thus it was possible for the subject to see the examiner 
at all times during the experiment. 

Subjects and procedure. Fifty-two male, hospitalized 
patients at the Palo Alto Veterans Administration 
Hospital were subjects (Ss) for this study. Twenty-six 
paretics and 26 schizophrenics were matched as closely 
as possible in age, length of hospitalization, and degree 
of psychotic involvement as subjectively rated by ward 
psychiatrist, nurse, and head attendant. The Ss came 
from three wards with each ward contributing an 
equal number of schizophrenic and paretic patients. 

All testing was done between 8:30 A.M. and 10:00 
A.M, Each S was brought individually into the lighted ' 

imental room by the examiner (E) and seated in 
a comfortable chair facing the Strobotac. After sufficient 
rapport was established, S was directed to look at the 
flickering light. 

The room light was then turned off, E suggesting 
that the red light would be easier to see in the dark. 
The frequency of the flickering light was then slowly 
increased until fusion was reached, and S was asked to 
describe what he saw. 

When it was certain that he experienced the phe- 
nomenon of fusion, S was given four practice trials, 
two ascending followed by two descending, and was 
asked to tell E when the light stopped blinking on 
ascending trials, or when the light first began to flicker 
on descending trials. 

Since some Ss were distracted at the beginning of 
the session as a result of their hallucinations, fantasies, 
or inability to concentrate, it became necessary in 
these cases to maintain their attention by constantly 

ting the directions. As the frequency was slowly 
increased from flicker to fusion, statements were peri- 
odically made by E such as, "You tell me when it 
stops moving," or Remember, let me know when it 
quits flickering.” Some Ss indicated at suspiciously low 
frequencies that they were experiencing fusion. Checks 
were made on these judgments by raising the frequency 
still higher and then asking the "slanted" question, 
“Now it’s flickering again, isn't it?” When it was found 
that S was confused, the instructions and the demon- 
stration were repeated, The criterion of adequate com- 
prehension of the task was low variability of the prac- 
tice trials. 

To avoid biasing errors, E did not look at the dial 
except to read the value at the termination of each 
trial Because of this “blind” frequency control, each 
new trial was begun at a different dial setting. 

Both the dark adaptation period and the adaptation 
to the flickering light took place during the demonstra- 


) nr 


y 4 


TABLE 1 
PARETIC AND SCHIZOPHRENIC GROUP COMPARISONS OF 
CFF MEAN, VARIABILITY, AND RELIABILITY OF 
MEASUREMENT, TOGETHER WITH AGE Data Usep 


88 


IN MATCHING 
nn———— O—— 
Ace CFF Werals- 
pP RE " 
RELI- 

Group GROUP VIDUAL 
RANGE Mean Mraw Vare Vari-  APUFITY 

ANCE — ANCE 
Paretic 50-60 55.3 17.4 15.21 1.42 .981 
Schizo. 51-60 56.1 21.4 9.12 E .982 


tion and the practice period. On the average, approxi- 
mately five minutes were devoted to this purpose. 

Following the practice trials, three ascending and 
three descending experimental trials were given al- 
ternately. CFF was computed as the average of these 
six. 


RESULTS AND DISCUSSION 


The results of the experiment, together with 
the age data for Ss, are recorded in Table 1. 


X. P. IRVINE 4 j 


The reliability coefficients given in the last 
column were computed by a variance tech- 
nique.? 

Inspection of the table shows a difference 
of four cycles per second between the mean 
CFF of the paretic and the schizophrenic 
groups. This difference is statistically signifi- 
cant (p < .001). Neither the CFF group 
variance nor the within-individual variance 
differs significantly for the two groups. 

In conclusion, the results of this experiment 
tend to verify the hypothesis that CFF is in 
part a function of organic pathology. It has 
been shown that lower CFF is associated with 
cerebral neural malfunctioning resulting from 
paresis. 


Received November 17, 1952. 


2 McNemar, OLGA W. The ordering of individuals 
in critical flicker frequency under different measure- 
ment conditions. J. Psychol., 1951, 32, 3-24. 


^^ 


RIGIDITY OF ATTITUDES REGARDING PERSONAL HABITS AND 
ITS IDEOLOGICAL CORRELATES 


ROBERT MERESKO, MANDEL RUBIN, FRANKLIN C. SHONTZ 
Western Reserve University 


AND WILLIAM R. MORROW 
Bard College 


fascist or antidemocratic trends repre- 

sent a total pattern of ideological and 
other personal tendencies with respect to which 
people in modern society show a degree of inner 
consistency. The present study is concerned 
with one aspect of that pattern, namely ri- 
gidity. Specifically, we have sought to measure 
rigidity as manifested in attitudes regarding 
personal habits, and to determine the relation- 
ship between such rigidity and antidemocratic 
ideology. 

Frenkel-Brunswik (1, ch. 9-13; 6) reports 
that in interviews dealing with major per- 
sonal relationships, vocational adjustment, and 
outlook, etc. people scoring in the most 
antidemocratic quartile on the Levinson 
Ethnocentrism (E) Scale (1, ch. 4; 10) revealed 
significantly greater rigidity of thinking than 
subjects scoring in the extreme democratic 
quartile. Rokeach (16) demonstrated that in- 
dividuals with a rigid, antidemocratic ap- 
proach to social problems (as measured by 
the E Scale) also tend to solve nonsocial prob- 
lems more rigidly than persons with a more 
democratic social outlook. X 

The present study is designed te test two 
hypotheses: (a) Under present social condi- 
tions, urban, middle-class American college 
Students show substantial individual con- 
Sistency with respect to rigidity of attitudes 
Tegarding personal habits. (b) Such rigidity is 
Significantly associated with antidemocratic 
ideology. . 


p= studies (1, 2, 8) have shown that 


Wnar 1s *PsvcHorocicAL Ricirry?” 


By psychological ridigity we mean a per- 
Son's "resistance, or lack of readiness, to be 
influenced by motivationally relevant stimula- 
tion in such a way as to adjust to his environ- 
ment as effectively as his behavior-repertory 
permits.” This definition is meant to include 
both defensive inhibition (resistance) and 


organic deficit (lack of readiness); both re- 
actions to external stimulation and reactions 
to inner processes; both rigid cognitive func- 
tioning and/or rigid action. The definition 
excludes purposeful ignoring of distractions 
that are “motivationally irrelevant,” as well 
as what Kounin (9) describes as “behavior 
rigidity” based on limited “differentiation” 
of “behavior-repertory” (including both knowl- 
edge and abilities). 

We believe that the above formulation has 
greater unifying generality than other con- 
ceptions. It embraces "inappropriate per- 
severation of a set” as "resistance, or lack of 
readiness, to be influenced" by a change in the 
situation. It also embraces "resistance, or lack 
of readiness, to be influenced" by relevant 
features of one situation considered by itself: 
“intolerance of ambiguity" (7) and “narrow- 
minded" concept formation (17, 18) based on 
resistance to surveying freely a whole that 
contains threatening features; cognitive 
“flight” from the crux of a situation into focus 
on isolated details, or into vagueness, formalis- 
tic overgeneralities, or mysticism, and so on. 

The above formulation is also, we believe, 
more field theoretical and unifying than the 
Kounin-Lewin (9, 11) conception of "inner 
rigidity" as degree of "impermeability" of 
“boundaries” of intrapersonal “regions” (even 
if their conception is extended to include de- 
fensive impermeability, as well as the organic 
deficit of feeblemindedness): (a) Whereas the 
Kounin-Lewin conception tends to statically 
reify psychological processes into a structure 
of “storehouses” (“regions”), our definition 
is formulated in process terms as such. (6) 
Whereas the Kounin-Lewin conception seems 
to involve an atomistic tendency to assign 
“Jocal determination" to the “boundaries” 
of assumed parts of the person, our formula- 
tion refers holistically to the person’s adjust- 
ment to his environment. That the difference 


90 y 


Y 


in formulation is not merely verbal is suggested 
by applying it to Kounin’s (9) ingenious 
experiment in which feebleminded subjects 
made fewer errors of perseveration than 
equated normal children when a simple task 
was altered in a very nonessential feature 
(raising instead of depressing a lever to secure 
colored marbles which were then to be placed 
in like-colored compartments). The normal 
children, more than the feebleminded subjects, 
apparently saw the change as a “‘motivation- 
ally irrelevant” one which they tended to 
“forget.” Our formulation would lead to an 
additional prediction not deducible from the 
Kounin-Lewin conceptions, viz., that as the 
changed task requirement continues, the 
normal children would show greater capacity 
to modify their indifference and correct their 
“errors.” Kounin’s own findings seem to 
support this expectation! 
METHOD 
The Questionnaire 
The method used in the present study to measure 
rigidity of attitudes regarding personal habits was a 
Likert-type (12) attitude scale (the RAPH Scale). 
"The scale consists of a series of statements which the 
subject is asked to rate on a six-point scale ranging 
from complete agreement to complete disagreement. 
In scoring, responses are converted into a seven-point 
scale by assuming a hypothetical middle category, 
which is used when an S omits rating a given item. 
On the basis of item analysis of a preliminary scale of 
32 i e m final scale of 20 items was constructed. 
It shor emphasized that we are here measuring 
verbal ideology regarding personal habits, not overt 
action. While we hypothesize that overt action shows a 
significant degree of consistency with the corresponding 
ideology, we also recognize that there is probably a 
Trang of inconsistency between ideas and action 
The 20 items of the revised RAPH Scale, presented 
below, are grouped according to which aspect of rigid 
attitudes Tegarding personal habits is primarily reflec- 
ted in the given item. The two main groupings are (a) 
opposition to change as such, and (b) intolerance of 
ambiguity. Many items, of course, reflect more than 
one aspect in their content. All but two items are nega- 
tive, e, agreement is scored high or rigid. Items 1 and 
14 are positive and are scored oppositely. 
The majority of the items imply opposition to change 


) 1 The preliminary form of the scale was administered 
to 60 undergraduate college students in a pilot study 
conducted by the authors and Mr. Walter Valeski as 
part of the work for a graduate class conducted by 
Dr. Morrow. The corrected odd-even reliability co- 
efficient was .75. The scale correlated .82 with a 20- 
item form of the F Scale (1, ch. 7). 


R. Mznzsko, M. Rusin) F. C. SHONTZ, AND W. FR. Morrow 


as such, expressed in four forms indicated in the follow- 
ing presentation. 

(a) Traditionalism, i.e., tradition for tradition's 
sake. This category includes only one item: 

6. In whatever one does, the “tried and true" ways 
are always the best. 

By its unqualified generalization this motto would | 
seem to imply resistance to be influenced by stimula: 
tion, suggesting possibilities of new and more effective 
ways of doing things; it thus fits our definition of psy- 
chological rigidity given above. 

(b) Rule-riddenness, i.e., rules for rules’ sake. This; 
category includes the largest number of items: 

5. A self-respecting person should never permit 
himself to relax his vigilance over personal habits; 
seemingly minor lapses can easily grow into complete 
breakdown of self-discipline. d 

9. The rules of logic are the rules of life. 

10. If I had a new car, I'd always keep it nicely 
cleaned and polished. | 

11. “A place for everything and everything in its | 
place" is a pretty good motto to live by. 

15. One of the major aims of education should be to 
give us a few simple rules of behavior to apply in every 
situation. 

16. The biggest advantage man possesses over lower 
animals is his ability to regulate himself and live by 
definite and unchanging rules of conduct. f 

19. Every person should live by a few good and un- 
changing rules of conduct; that way he can never go” 
wrong. 

Running through these items is an emphasis onf 
unchanging rules, with an implied resistance to be 
influenced by varying circumstances. T 

(c) Opposition to any change of judgment. Three items 
are included here which formalistically exalt unchanged 
ableness of judgment or viewpoint as such, and thereby | 
imply resistance to be influenced by other viewpoints, 
new facts, or changing circumstances. s 

8. It’s a good idea to have a strong point of viti 
about things because that makes it easier to decide 
what’s wrong or right. 

17. A person who seldom changes his mind can ust 
ally be depended upon to have sound and reliable | 
judgment on matters of importance. P 

18. Once à person makes up his mind about some 
thing he should stick to his conclusion instead of 1 
peatedly rehashing the question. 

(d) Opposition to amy change of plans. With the 
exception of item 1, each of the following items exi 
presses an unqualified opposition to amy change © 
plans regardless of circumstances or of one’s mood, 
thus again implying a resistance to be influenced b 
(changes in) stimulation. Disagreement with item D 
a positive item, has less clear-cut implications. How) 
ever, it was hypothesized that within the particuld! 
subculture of contemporary American university 
students, dislike of “friends dropping in unexpectedly 
would more often than not imply some rigidity. ™ 
was therefore predicted that this item, too, would coti 
relate with the total scale. L 

1. I rather like the idea of having friends drop in U^ 
expectedly at odd hours. q 

2. I dislike doing anything just on the spur of t% 
moment. 


0€ 
RremrrYy AND Its IDEOLOGICA, CORRELATES [i 91 


4, Few things are more upsetting than a sudden 
unexpected chaage of plans. 

4, I never start anything I can't finish. 

13. Once a person starts going off his budget, even 
by small amounts, he's on the road to financial difficulty. 

20. The only way to make sure that things get done 
right is to set up a definite and fixed schedule and 
never depart from it. 

Several items seem to express primarily “intolerance 
of ambiguity,” manifested in two forms: 

(a) In specific situations. Only one item is included 
here, 

3. I don't much like the kind of painting that doesn’t 
tell a story or doesn’t portray something in a clear, 
unambiguous fashion. 

(b) Regarding expectations or plans for the future. 
Of the two items under this heading, special comment is 
called for on item 14. It is similar to item 1 in being a 
positive item and in its specific content and its relation 
to the subculture of the particular population sampled. 

12. The best way to enjoy a vacation is to plan every 
detail carefully before you leave. 

14, I rather like the idea of having my meals at odd 
hours and of going to bed when the mood strikes me. 

A point implicit in the discussion of item content 
deserves emphasis here. It is not assumed that a given 
individual’s agreement with any single negative item, 
or disagreement with any single positive item, neces- 
sarily reflects a rigid attitude. What is assumed, how- 
ever, is that a cumulation of responses scored in the 
same direction will tend to reflect what the content of 
the items has in common, i.e., rigidity versus non- 
rigidity of attitudes regarding personal habits. 

It will be observed that six items, 1, 2, 3, 7, 10, and 
14, have a first person singular form, whereas the other 
14 items are grammatically impersonal expressions of 
attitude. This might be thought to reduce the homo- 
geneity of the scale as an attitude Scale. Closer examina- 
tion reveals, however, that four of the "'selí-descrip- 
tion” items, 1, 2, 3, and 14, directly express a like or 
dislike as such; hence the self-description is really an 
expression of attitude. And whereas the factual truth 
of the other two items, 7 and 10, is uncertain, the 
phraseology leaves little doubt as to the subjects atti- 
tude. Thus we conclude that the total scale can justi- 
fiably be treated as an attitude scale. — " 

In order to test our second hypothesis, regarding the 
association between antidemocratic ideology and 
rigidity of attitudes regarding personal habits, a second 
attitude scale was used. This was a 20-item form of the 
F Scale (1, ch. 7) developed by the California Attitude 
Study. The F Scale is designed to measure a series of 
interrelated fascist or antidemocratic ideological 
trends, such as punitive-rejective attitudts toward 
underdogs and nonconformists, cynical contempt fot 

‘human nature," pseudo toughness, submissiveness 
and overidentification-with power and authority, rigid 
Conformity to conventional values, submissiveness 
toward "fate," mysticism, etc. The F Scale has been 
demonstrated to have satisfactory reliability and to 
correlate substantially with a variety of meaningfully 
related variables (1, esp. ch. 7)? 


S * A series of as yet unpublished studies carried out at 
estern Reserve University has yielded significant 


Procedure 


Our procedure was to administer the two Scales in 
three different sequences, as alternate forms of a 
questionnaire: Form A presented the F Scale first, 
followed by the RAPH Scale: Form B presented the 
RAPH Scale ist, followed by the F Scale. Form C 
presented the RAPH and F Scale items intermingled 
alternately. 

Tn each form the attitude scales were preceded by a. 
set of questions asking the respondent to indicate his 
sex, age, class in school, and similar facts, but not his 
name. 


Subjects 

The three forms were randomly distributed to 188 
college students in each of two beginning courses in 
psychology at one institution and in one at another 
institution, Of these Ss, 31 were freshmen, 89 sopho- 
mores, 27 juniors, 6 seniors, and 2 graduate students; 
33 Ss did not report their educational level. The age 
range of the total sample was 17 to 51 years; the mean 
age was 22.5 years; the median, 20; the mode, 19. One 
hundred and fifteen Ss were female, and 73 were male; 
40 were married while 148 were not. The major re- 
ligions were represented as follows: Catholic, 40.9 per 
cent; Jewish, 9.6 per cent; Protestant, 45.7 per cent; 
the balance were unspecified. Of the 174 Ss reporting 
family income, 50 per cent reported incomes in the 
$4000-7500 bracket; 20 per cent reported higher in- 
come than that; 30 per cent reported lower incomes, 
though only 6 per cent fell below $3000. 

It is felt that the sample used is probably repre- 
sentative of contemporary urban American college 
students. 


Predictions 

Our predictions are: (a) The Ss’ reactions to the 
RAPH Scale will show substantial individual consist- 
ency, as measured by the odd-even reliability coefücient 
and by item discriminatory powers. (b) The RAPH Scale 
will correlate significantly with the F Scale. Our analy- 
sis of the data will also throw light on the issue of 
whether varying the order of presentation of such 
attitude scales influences Ss' reactions. 


RESULTS AND DISCUSSION 


Analysis of variance for each scale considered 
separately revealed that the sample could 
be considered homogeneous with respect to 
college class and institution attended, as 
well as form of administration. In the further 
statistical analysis, all Ss were treated as a 


correlations between the E Scale and/or the F Scale 
on the one hand, and, on the other hand, attitude 
scales designed to measure religious conventionalism 
(D. J. Levinson and P. Lichtenberg), religious funda- 
mentalism (W. Alven), authoritarian family-ideology 
(Phyllis Huffman), male-supremacist ideology (A. 
Jansic; G. Nadler and W. R. Morrow), and magical 
thinking (C. Stenger). 


TABLE 1 
RAPH Scare: Irem DISCRIMINATORY POWERS 


No. Irex MAN DP* | Rank 
1 | Friends drop in 4.60 | 2334 |2.26 | 10 
2 | Spur of moment 3.42 1.53 | 1.89 | 14.5 
3 | Painting unambiguous 5.23 3.32 | 1.91 | 13 
4 | Sudden change of plans 4.96 3.26 | 1.70 | 18 
5 | Vigilance over habits 5.40 | 3.11 |2.29 | 9 
6 | “Tried and true" ways 4.40 1.72 |2.68| 3 
7 | Never start, can't finish 4.60 2.72 | 1.88 | 16 
8 | Strong point of view 5.33 2.00 | 2.71 | 2 
9 | Rules of logic 4.58 3.13 | 1.45 | 19 

10 | Car always polished 5.2 | 4.11 | 1.18 | 20 

11 | Everything in its place 6.27 3.83 | 2.44] 6 

12 | Vacation: plan every de- | 3.67 1.4 | 1.73 | 17 

tail 

13 | Going off budget 4.79 2.75 | 2.04 | 12 

14 | Odd hours 5.29 3.3 | 2.18 | 11 

15 | Simple rules of behavior 5.54 | 2.28 | 3.26 | 1 

16 | Advantage: unchanging 4.40 | 1.83 | 2.57] 5 

rules 

17 | Seldom change mind 3.44 1.55 | 1.89 | 14.5 

18 | Stick to conclusion 5.21 2.60 | 2.61] 4 

19 | Live by unchanging rules | 4.94 | 2.60 | 2.34] 8 

20 | Fixed schedule 4.52 2.09 | 2.43.) 7 


* All DP’s are significant beyond the .01 level. 


single sample from a common population. 
It is a finding of some interest in itself that, 
under the conditions of this study, the se- 
quence of scales had no measurable effect on 
responses. 

The obtained odd-even reliability coefficient 
of the RAPH Scale, when corrected by the 
Spearman-Brown prophecy formula, is .78. 
The item discriminatory powers are reported 
in Table 1; all of them are significant beyond 
the .01 level. Twelve of the 20 discriminatory 
powers are above 2.0. These results are re- 
garded as confirming hypothesis 1. 

The obtained correlation between the RAPH 
Scale and the F Scale is .62, which is significant 
far beyond the .01 level. This result is regarded 
as confirming hypothesis 2. 

Thus, our study suggests that urban Ameri- 
can college students show substantial inner 
consistency as individuals in their degree of 
rigidity of attitudes regarding personal habits, 
and that such rigidity tends to be associated 
with fascist or antidemocratic ideology. These 
findings are consistent with other evidence 
(1, 8, cf. also 2) that fascist attitudes tend 
to form a general ideological approach to life 
which is rigid and in a broad sense maladap- 
tive. 

What adaptive function is served by rigidity 
of attitudes regarding personal habits as 


LI 


vs | 
R. Meresko, M. Rus, F. C. SHontz, AND W..R. Morrow j 


measured by the RAPH Scale? It is ow 
hypothesis that such rigidity serves a defen. 
sive function in coping with certain threats 
A number of the items directly imply an expec 
tation of threatening consequences for nol 
following a rigid pattern in one’s personal 
habits. Thus, a sudden change of plans i 
very “upsetting” (item 4); even minor devia 
tions from “vigilant” strictness in personal 
habits contain the threat of ‘complete 
breakdown” (item 5) or of “financial difficulty’ 
(item 13); without rigidly adhering to a fixel 
schedule, one can never be sure that “thing 
get done right” (item 20). Many other items 
indirectly imply such a threat orientation 
It is as if the person holding these attitude 
feels unsure of his capacity to cope with daily 
problems in a way that involves more spor 
taneity, more flexible adjustment to circum 
stances, more reliance on his own judgment 
instead of on rigid rules. This may be because 
the rules have been adopted under pressure 
so that any deviation arouses temptations t0 
throw the rules over altogether and indulge it 
“complete breakdown of self-discipline.” Fur. 
thermore, such uncertainty may well be basti 
on insecurity regarding one's social-class status 
with resulting drivenness to maintain “middle 
class respectability” by adherence to a rl 
pattern in one’s personal habits. These hy 
potheses are consistent with the findings 
the California Attitude Study (1), with whos 
F Scale our RAPH correlates significantly; 
and with major studies of the American soc 
class status system (5, 13, 14, 15, 19). The 
are also consistent with Brown's (3) recu 
demonstration that Ss with a rigid, ant 
democratic ideology approach Rokeach’s (16) 
arithmetic problems more rigidly than œi 
democratically-minded Ss under conditi? 
of status threat but not under more relaxtl 
conditions. | 
Further research might take two directio? 
(å) investigating the relationship betwel 
rigidity regarding personal habits in ideolog 
and in overt actions; (b) investigating ™ 
relationship between rigidity of attitud® 
regarding personal habits and class-stat^ 
mindedness or class-status anxiety. 


5 


si 


SUMMARY 


It was predicted that under present soci 
conditions, urban American college stude" 


€ 


Rremmv AND Its InÉorocic4t CORRELATES t 


would show substantial individual consistency 
in the degre& to which they manifest rigidity 
of attitudes regarding personal habits (hy- 
pothesis 1). Further, on the basis of previous 
research findings, it was predicted that rigidity 
of attitudes regarding personal habits would 
be significantly associated with acceptance 
of fascist or antidemocratic ideology (hypoth- 
esis 2). 

In order to measure rigidity of attitudes 
regarding personal habits, a Likert-type at- 
titude scale (RAPH Scale) was constructed. 
A revised 20-item form of this scale was ad- 
ministered in a questionnaire, together with a 
20-item form of the F Scale, to 188 under- 
graduate college students, mostly freshmen 
and sophomores. The corrected odd-even 
reliability coefficient of the RAPH Scale was 
.78, and every item showed a significant 
correlation with total scale scores. These 
results were regarded as supporting hypothesis 
1. The RAPH Scale correlated .62 with the F 
Scale. This result was held to confirm hypothe- 
sis 2. 


REFERENCES 


1. ADORNO, T., FRENKEL-BRUNSWIK, ELSE, LEVIN- 
son, D. J., & Sanrorp, R. N. The authoritarian 
personality. New York: Harper, 1950. 

2. Brany, R. A. The spirit and structure of German 
fascism. New York: Viking, 1937. 

3. Brown, R. Some determinants of the relationship 
between rigidity and authoritarianism. Paper 
read at Midwest. Psychol Ass., Cleveland, 
April, 1952. 

4. Crem, I., Devrscu, M., Hyman, H., & JAHODA, 
Marr, (Eds.) Consistency and inconsistency in 


TS 


Sh 


17. 


18. 
19. 


93 


intergroup relations. J. soc. Issues, 1949, B(3), 
1-64. g 

Davis, A., GARDNER, B. B., & GARDNER, M. R. 
Deep south. Chicago: Univer. of Chicago Press, 
1941. 


. FRENKEL-BRUNSWIK, ELsE. Dynamic and cogni- 


tive cĦħracterization of qualitative material: 
II. Application to interviews with the ethnically ` 
prejudiced. J. Psychol., 1948, 25, 261-277. 


. FRENKEL-BRUNSWIK, ELsE. Intolerance of am- 


biguity as an emotional and perceptual person- 
ality variable. J. Pers., 1949, 18, 108-143. 


. HARTLEY, E. L. Problems in prejudice. New York: 


Kings Crown Press, 1946. 


. Kouwiw, J. S. Intellectual development and 


rigidity. In R. G. Barker, J. S. Kounin, & H. F. 
Wright (Eds.), Child behavior and development. 
New York: McGraw-Hill, 1943. Pp. 179-197. 


. Levinson, D. J. An approach to the theory and 


measurement of ethnocentric ideology. J. 


Psychol., 1949, 28, 19-39. 


. LEWIN, K. A dynamic theory of personality. New 


York: McGraw-Hill, 1935. 


. LIKERT, R. A technique for the measurement of 


attitudes. Arch. Psychol., 1932, 140, 1-55. 


. Lynn, R. S., & Lynp, HELEN M. Middletown. New 


York: Harcourt, Brace, 1929. 


. Lynp, R. S, & Lynp, Heren M. Middletown in 


transition. New York: Harcourt, Brace, 1937. 


. Murs, C. W. White collar. New York: Oxford 


Univer. Press, 1951. 


. RoKEAcH, M. "Generalized mental rigidity as a 


factor in ethnocentrism. J. abnorm. soc. Psychol., 
1948, 43, 259-278. 

Rokracm, M. A method for studying individual 
differences in “narrow-mindedness.” J. Pers., 
1951, 20, 219-233. 

Roxeacs, M. “Narrow-mindedness” and per- 
sonality. J. Pers., 1951, 20, 234-251. 

Warner, W. L., & Lunt, P. S. The social life of a 
modern community. New Haven: Yale Univer. 


Press, 1941. 
Received November 13, 1952. 


2 


4 


AN’ EXPERIMENTAL REUNION OF PSYCHOANALYTIC THEORY 
WITH PERCEPTUAL VIGILANCE AND DEFENSE’ | 


GERALD S. BLUM 
University of Michigan 


Lj 


SYCHOANALYTIC theory has long since 
grown accustomed to adolescent re- 
belliousness. Like the parent who 

conceives and cares for his offspring only to 
find one day that all past efforts are now met 
with ingratitude and desertion, psychoanalysis 
has spawned many concepts in psychology 
which somehow stray from home and even- 
tually lose their parental identity. Some cur- 
rent emphases in the field of perception do not 
deviate from this familiar pattern. 

The notions of perceptual vigilance and de- 
fense, for example, have their direct counter- 
parts in psychoanalytic theory. Though the 
link between the two still receives some 
acknowledgment (we seem to be in the "early 
straying" stage now), the portents are omi- 
nous, The critics are once again assailing the 
untestability of the original psychoanalytic 
formulations—a certain cue for a quick fade- 
out. For example, Postman and Brown, in a 
recent comment on the mechanism of repres- 
sion, say: "It seems to us that theoretical 
arguments relating perception to such general 
raion vid factors hold out little promise for 

the precise specification and experimental 

manipulation of the conditions under which 
variations in perceptual behavior occur” (11, 
p. 219), 

Having advocated the testability of psycho- 
analytic hypotheses on other fronts (1, 4), 
the present writer undertook the following 
experimental investigation in the area of per- 


HYPOTHESES 
UR es os Men nae 
ypotheses concerning the 
Mane Sl of psychosexual stimuli. First, 
sexual and aggressive impulses, denied con- 
+ Based on a paper presented in a symposium on 


“Measuring Human Motivation” at the annual meeting 
of the American Association, Washing- 
ton, D. C., 1952. 
is study was made by a Rackham Faculty 
Fellowship at the University of 
for the summer of 1952. 


scious expression by the ego, still continue 
strive actively to break through into consciou! 
ness. In other words, repressed psychosexua 
impulses, of which the individual is n 
consciously aware, are always pushing 
seeking for an outlet in conscious behavio 
If this theoretical formulation is correct, th 
everyone should, at the unconscious level, 
sensitive and responsive to cues relevant | 
these potentially threatening impulses 
process which is now familiarly labeled “sul 
ception” (9), "selective vigilance" (5), “ 
tive sensitization” (8), etc. 

This process, however, is expected to 0] 
only at a level below conscious awarenel 
for when the impulses do begin to approach. 
surface, we have a second process, ego € 
fense. Now, that which the organism basical 
desires must be warded off because of 
threatening quality. At this point, rather th 
being vigilant for psychosexual cues, 
individual seeks devious ways not to percer 
them—a mechanism currently labeled "p 
ceptual defense." 

In sum, we are led by psychoanalytic theol 
to postulate two opposing tendencies—V) 
lance and defense—and to predict the eX 
operation of each by controlling the level 
awareness. 


METHOD 


Investigation of the perception of psychose 
stimuli, upon which psychoanalytic theory foe 
required an initial choice of techniques. To tap 
ception we used a tachistoscope; to provide p 
sexual mat we chose the Blacky Pictures (2). 
, the experimental design grew out Ob 
earlier nds by Clapp (6), who presented pain 
Blacky Pictures tachistoscopically and asked gı 
subjects to judge their relative clarity. In accoros 
with his predictions, the more traumatic picture in 
pair tended to be judged as clearer at very fast p 


however, pointed strongly to the feasibilit 


«€ 


continuing to explore perceptual approaches to psycho- 
sexual motivation by means of the Blacky Pictures. 

Subjects. Subjects (Ss) in the present study were 
seven men and seven women, ranging in age from 19 to 
37, with a mean of 24. Their ranks included secretaries 
and undergraduate and graduate students in a variety 
of fields. The sole criteria for selection were no prior 
. knowledge of the Blacky Pictures and a willingness 
to participate. Approximately 30 other individuals of 
similar pedigree had been employed earlier in pretesting 
the experimental conditions. 

Apparatus, The tachistoscope, constructed for use 
by one S at a time, was designed by R. Gerbrands, of 
Arlington, Massachusetts. The speeds of perception in 
the first and second parts of the experiment (described 
below), were .03 sec., assumed to represent a low level 
of awareness, and .20 sec. assumed to represent a 
relatively higher level of awareness, respectively. Il- 
lumination of the adapting field was 1.87 foot-lamberts, 
of the test field 1,15 foot-lamberts. 

Procedure, The individual S was brought into the 
laboratory room, and after having been familiarized 
with the tachistoscope, was given these instructions: 

“Now I'm going to flash some pictures very quickly, 
at a fraction of a second. There will be four pictures 
shown simultaneously at each flash—one at the Left, 
Right, Top, and Bottom. What I want you to do is 
simply to say which one of the four stands out the most. 
Obviously with these very fast speeds you won’t be 
able to get any real idea even of what the pictures are, 
#0 you may feel quite uncertain of your judgments. 
But in every case make a guess. All I am interested in, 
for this part of the experiment, is your immediate 
impression. You will see different patterns or combina- 
tions of pictures during the series of flashes. Remember 
to say just Left, Right, Top, or Bottom, according to 
which picture appears to you to stand out the most. 
When I say Ready, focus on the dark spot in the center 
of the screen, That will give you the best chance to see 
all four pictures at once. I will flash the pictures right 
after the ready signal.” 

Then a pattern of Blacky Pictures was flashed at 
93 sec. and S immediately indicated the position of 
the one which seemed to him to stand out moss. Each 
pattern consisted of the following four Blacky Pictures: 
Oral Sadism (IT) and Masturbation Guilt’ (V), which 
were the two key experimental stimuli; and Oral 

(I) and Identification Process (VII), which 

Were slightly altered to serve as effective distractors. 
four pictures were actually mounted on each of 

six cards, so that the position of every picture was 
systematically rotated? ‘The six cards were flashed 
tay for a total of 54 trials at the same speed, 
th short breaks after each series of 18 trials. After the 
first exposure S invariably expressed amazement that 


ographic reproductions of these four stim 
patterns and tables of the raw data from this cmo 
ment have been deposited with the ADI. Order Docu- 
ment 4031 from the ADI Auxiliary Publications 
lication Service, Library of Congress, 
Washington 25, D. C., remitting in advance $1.25 for 
35mm. microfilm. or $1.25 for photostats readable 
Without optical aid. Make Checks payable to Chief, 
Photoduplication Service, Library of Congress. 


PSYCHOANALYTIC THEORY AND PERCEPTUA® VIGILANCE AND DEFENSE 


M 
° 

any one could see anything at all at speeds like that, 

but with minor reassurance settled down to the routine 

task of saying Left, Right, Top, or Bottom. 

At the conclusion of the 54 trials the experimenter 
(E) asked S to guess what he had been looking at, 
since he had never seen or heard of the Blacky Pictures 
before. The fllmost universal reply was “elephants.” | 
In the beginning, a suspicion existed that this response 
might have been contaminated by the fact that the early 
Ss were run during the Republican convention, but 
there were still lots of elephant reports and only one or 
two donkeys two weeks later. The parsimonious inter- 
pretation seems to be that at .03 sec. the various pro- 
jections (limbs, tails, etc.) in the pictures make Blacky 
resemble an elephant. 

The purpose of these first 54 trials was to establish a 
base line of S’s choices made solely from the physical 
properties of the pictures, Next came the sensitization 
experience, in which S was injected with an appropriate 
“type and dosage of psychosexual serum.” This was 
accomplished by showing him the two experimental 
pictures placed adjacently (positions systematically 
varied) and saying: 

“Now I’m going to show you two of the pictures 
from among the several which you just saw at fast 
speeds. They belong to a psychological test which con- 
sists of a series of cartoons portraying the adventures of 
a dog named Blacky. I'll tell you what is 
in each picture when I show them to you.” (Pictures 
then presented in full view.) 

For the male Ss the Masturbation picture (V) was 
structured as the traumatic one by stating, “Over here, 
Blacky can’t keep from licking his sexual parts even 
though he’s been scolded for it,” and the other stimulus 
picture (II) neutralized by the comment, “Over here, 
Blacky loves the free feeling of romping outdoors and 
retrieving things like this belt or collar,” Also, Blacky's 
angry expression was deleted from the latter and the 
name “Mama” taken off the collar. For the females, it 


her collar," (the fierce look and “Mama” vd restored), 
and the other picture (V) was neutralized Fl rd 


original Blacky monograph (1) females were suggested 
to be more oral sadistic than males, and in the pre- 
testing it was found to be more difficult to disguise 


9g ! 

à 
Oral Sadism for the women and Masturbation Guilt 
for the men. ? ] 

The third phase of the experiment consisted of an 
exact replication of the first—another 54 trials at .03 
sec. with the same instructions. The only difference 
between the phases was that S had now seen two of the 
pictures and had, presumably, also thought privately 
about his own experiences in those areas. The theoretical 
prediction was that he should now be sensitized to the 
traumatic picture and consequently choose it as stand- 
ing out the most, at this brief exposure of .03 sec., 
relatively more often than he had originally. This then 
was a test of the first hypothesis, dealing with the 
striving of repressed impulses for expression, since S 
still had no opportunity for conscious recognition of 
the stimuli. 

The second hypothesis, dealing with perceptual 
defense against psychosexual stimuli presented at close 
to the conscious level of awareness, was tested as 
follows. After the vigilance series was completed, S 
was told: 

“This time we're going to do something slightly 
different. The picture where Blacky is (holding the 
collar) (angry at Mama) (masturbating) (chasing flies) 
will be present in each one of the combinations. It will 
appear either at the left, the right, the top, or the 
bottom. And it will move around among all the posi- 
tions—it won’t be in the same position all the time. 
Also, it will appear in only one of the four positions 
at each flash. Your job will be to locate the correct 
position. Focus on the dark spot at the ready signal.” 

The S was asked to look alternately for one picture 
for six trials and then the other for six trials, up to 36 
trials on each one. The designation of each picture con- 
formed, of course, to its earlier structure in the sensitiza- 
tion experience. 

The ego was brought into play in two ways: first, 
the time of exposure was increased from .03 sec. to 
.20 sec.—a speed at which preliminary studies had 
shown that S very frequently recognized one or more 
of the pictures; and secondly, his task was now a 
conscious, ego-involving one of trying to locate a 
certain picture. The prediction in this case was the 
opposite of what it had been before: the traumatic 
picture, which now assumes an ego-threatening quality, 
should be correctly located less frequently than the 
neutral one. Because of S’s emotional disturbance con- 
nected with the traumatic picture, he should be more 
anxious and blocked in his attempts to find it, even 
though earlier he had unconsciously been more respon- 
sive to it in the vigilance series. 


RESULTS AND DISCUSSION 


Vigilance series. "The sensitization experi- 
ence, in which S was exposed to relatively 
traumatic and neutral psychosexual stimuli, 
was intended to activate experimentally an 
underlying area of conflict already existing in 
his personality structure. In other words, the 
male is theoretically expected to be predis- 
posed, in some degree, to masturbation guilt 
and the female to conflict over oral sadism. 


GERALD S. BLUM : 


The experimental sensitization served to link: 
this “natural” predisposition with a specific 
picture stimulus, to which responses could then 
be predicted. In terms of the vigilance hy. 
pothesis, S should shift from pre- to post- 
sensitization in the direction of being more 
responsive to the traumatic picture during 
the later of the first two series of ms 
This prediction was tested by noting for. 
each S the relative proportions of choices of 
Pictures II and V before and after sensitiza- 
tion. An S who chose the traumatic picture’ 
relatively more often than the neutral one 
in the postsensitization condition than he had- 
done earlier was behaving as predicted. By 
this sign test (set up in advance as the appro- | 
priate method of statistical analysis), the 
results show that 11 of the 14 Ss shifted in the 
predicted direction (p < .05).* Breaking down 
the data by sexes, we find that six of the seven 
males shifted toward the traumatic picture 
(Masturbation Guilt) and five of the seven 
females shifted toward the traumatic one 
(Oral Sadism). Further examination of the 
extent of the shift in each individual reveals) 
statistical significance beyond the .10 level 
of confidence in 6 of the 11 positive cases. 
Thus the vigilance hypothesis is supported by 
the results. [ 
Defense series. The ego-defense hypothesis 
maintains that S should, at the close-to- 
conscious level, block more in response to the 
traumatic stimulus and therefore perform 
less effectively in locating it. The test here 
involves a comparison within each S of tht. 
number of correct locations of the two pictures 
during the 36 trials on each. Twelve of the 14 
Ss performed as predicted (p < .006), ie» 
made fewer correct locations of the traumatl 
than of the neutral picture. The breakdown 
by sexes shows that all seven females ‘ 
more poorly looking for the traumatic stim- 
ulus (Oral Sadism), and five of the sevet 
males were poorer on the traumatic ont 
(Masturbation Guilt). Statistical significant. 
of individual shifts exceeded the .10 level 1 
five of the 12 cases. It should be noted th! 
these positive results occurred despite tht 


*Level of significance computed according to the 
procedure outlined in Dixon, W. J., & Moon, A. M. 
The statistical sign test. J. Amer. stat. Ass., 1946, 5 
557-566. This nonparametric test is based on the bY 
nomial distribution and does not involve any assum?” 
tion of continuity of data. 


j 


| previously greater responsiveness to the trau- 


matic picturê in the vigilance trials. 

The above findings contain implications 
which go beyond their specific support of 
hypotheses dealing with vigilance and defense. 
At the most general level we now have addi- 
tional experimental evidence pointing to the 
testability of psychoanalytic hypotheses. 
Even such a seemingly elusive concept as the 
unconscious striving of repressed impulses 
has, in this experiment, lent itself to opera- 
tional definition withid a perceptual frame- 
work. Also, by harnessing the predictive power 
of this potent theory of personality, despite 
its limitations, we can study behavior in a 
more appropriate setting. Previous experi- 
ments in this area (9, 10) have contented 
themselves with nonsense syllables, PGR 
equipment, and the like. The very same 
phenomenon of subception, now analyzed 
from the standpoint of strong, underlying 
impulses assumed to be operating commonly 
in personality structures, becomes demon- 
strable at the psychological as well as the 
physiological level of behavior. A psychosexual 
context, combining well-developed theory 
with translation into the form of Blacky Pic- 
tures, thus seems to provide the exploration 
of human personality with an easy entry into 
the experimental laboratory. 

The next research steps suggested by the 
data concern the problem of individual dif- 
ferences in perceptual response. The few ex- 
ceptions in the present study, plus the differen- 
tial degree of vigilance and defense effects 


| noted in those Ss who did conform to the gen- 


eral pattern of behavior, point to the desir- 
ability of such an approach. Prior ássessment 


* The importance of an appropriate setting is em- 
phasized by the results of a pple Ed under- 
taken in conjunction with the present research. The 
Teader will recall that the "natural" sets in the experi- 
Eie ue deemed to be Masturbation Guilt 

more traumatic for the male S and Oral Sadism 
ERR female S. In order to test this assumption, the 
PA TEE was repeated on another group of 
d d d e traumatic and neutral sets reversed, i.e., 
ns dism Structured as the traumatic picture for 
am and Masturbation Guilt for the women. The 
sight £ vigilance and defense effects were somewhat 
e t in the case of the men than they had been in 
di mene Proper (five out of seven in the pre- 
i direction for both vigilance and defense), and 
ippeared completely in the case of the women (one 


out of seven on vigilance vi 
h T two out 
inis eiea A ). of seven on defense 


. < 
PSYCHOANALYTI€ THEORY AND PERCEPTUAS VIGILANCE AND DEFENSE 


Poy 
LJ 


of S's personality dynamics, particularly with 
reference to his characteristic use of defense 
mechanisms, seems most plausible in this re- 
gard. Theoretically, the accuracy of predic- 
tion should be augmented by knowledge of 
type and dÉgree of psychosexual conflict and 
its accompanying defensive reactions. The 
feasibility of considering perceptual defense 
in the light of ego-defense mechanisms has 
already been shown by Eriksen and Lazarus 
(7, 8). Further application to the process of 
selective vigilance follows directly from psy- 
choanalytic theory. For example, the person 
who projects his unacceptable impulses is 
said to be extraordinarily sensitive to the 
existence of similar impulses in other individ- 
uals. Whether he will oblige in his tachisto- 
scopic performance remains to be seen. 


SuMMARY 


This experiment was designed to test, within 
the framework of perceptual behavior, two 
psychoanalytic hypotheses: (a) the uncon- 
scious striving for expression of underlying 
psychosexual impulses (vigilance); and (6) 
the warding off of these threatening impulses 
as they begin to approach conscious awareness 
(defense). In the vigilance series, patterns of 
Blacky Pictures were flashed tachistoscopi- 
cally at .03 sec., and Ss (seven men and 
seven women) were asked to indicate the 
position of the one picture "standing out the 
most.” In the defense series, the ego was 
brought more into play by slowing the speed 
to .20 sec. and instructing Ss to try to locate 
the correct positions of selected pictures. 

The results were: 

1. The vigilance hypothesis was supported in 
11 of the 14 individual cases (p < .05). After a 
brief sensitization experience, most Ss re- 
sponded relatively more often to a traumatic 
psychosexual stimulus (Masturbation Guilt 
for men, Oral Sadism for women) than they 
had done in presensitization trials, despite 
the absence of any conscious recognition of 
the pictures throughout. 

2. The defense hypothesis was supported in 
12 of the 14 cases (p < .006). At the close-to- 
conscious level, most Ss experienced more 
difficulty in trying to locate a traumatic 
psychosexual stimulus picture than a relatively 


98 N 
neutral one, in contrast to a previously greater 
sensitivity to the traumatic picture during 
the vigilance series. : 
The experimental findings were interpreted 
as further evidence for the testability of psy- 
choanalytic formulations, In addition, the use 
of psychosexual content was considered to 
facilitate a more appropriate setting in the 
laboratory study of human personality. 


REFERENCES 


1. Brum, G. S. A study of the psychoanalytic theory 
of psychosexual development. Genet. Psychol. 
Monogr., 1949, 18, 3-99. 

2. Brom, G. S. The Blacky Pictures: a technique for 

| the exploration of personality dynamics. New 
York: Psychological Corp., 1950. 
3. Brom, G. S., & Hunt, H. F. The validity of the 
* Blacky Pictures. Psychol. Bull., 1952, 49, 238- 
WIEIT.250, 
4 Brom, G. S, & Murer, D. R. Exploring the 


» 


GERALD S. Brum 


psychoanalytic theory of the “oral character)! 
J. Pers., 1952, 20, 287-304. » 

5. Bruner, J. S., & Posrman, L. Tension and ten 
sion-release as organizing factors in perception, 
J. Pers., 1947, 15, 300-308. 

` 6. CLAPP, C. D. Two levels of unconscious awaren 
Unpublished doctor's dissertation, Univer, ¢ 
Michigan, 1951. " 

7. Errxsen, C. W. Perceptual defense as a funci ion 
of unacceptable needs, J. abnorm. soc. Psychol, 
1951, 46, 557-564. 1 

8. Lazarus, R. S., Ersen, C. W., & Fonpa, C, P 
Personality dynamics and auditory percepi 
recognition. J. Pers., 1951, 19, 471-482, 

9. Lazarus, R. S., & McCreary, R. A. Autonomie 
discrimination without awareness: a study of 
subception. Psychol. Rev., 1951, 68, 113-122, | 

10. McGiNNrss, E. Emotionality and perceptual dej 
fense. Psychol. Rev., 1949, 56, 244-251. 

11. Posrwaw, L., & Brown, D. R. The perceptual 
consequences of success and failure. J. abnorm, 
soc. Psychol., 1952, 47, 213-221. 


Received December 8, 1952, 


AUTHORITARIAN PERSONALITY STUDIED BY A NEW VARIATION 
OF THE SENTENCE COMPLETION TECHNIQUE: 


RONALD J. DORRIS 
Cornell University 


DANIELeJ. LEVINSON 
Harvard University 


AND EUGENIA HANFMANN 


Brandeis University 


ius study was initiated on the hunch 
that a particular recent variation of the 
sentence completion technique (SCT) 
is singularly well adapted for investigating 
some aspects of the theory of the authoritarian 
personality (3). In this variation of the SCT, 
the subject, after completing the test, is asked 
to indicate for each item whether the item, as 
completed by him, is true of himself or not. 
In introducing this self-reference technique, 
the authors assumed that the denial of self- 
reference may be indicative of the subject's 
lack of awareness of the personal tendency 
expressed in the completion. In an exploratory 
study the average percentage of admitted self- 
teference was found to be quite high (70 per 
cent of all items completed), suggesting that 
the sentences used elicited mainly conscious 
or preconscious material; however, for com- 
pletions with ego-threatening content, self- 
reference was denied significantly more 
frequently than for completions with ego- 
enhancing content. This finding lends some 
Support to the assumption that denial of self- 
reference reflects defensive unawareness of 
one's unaccepted tendencies. Furthermere, it 
Was found that completions of items using 
the third person elicited significantly more 
denials of self-reference than did completions 
of first-person items. This finding is con- 
Sistent with the frequently made assumption 
that third-person items facilitate the expression 
(projection) of tendencies that the subject does 
Dot recognize in himself. Finally, the explor- 
atory study of the self-reference technique 
revealed a wide range of individual variation 
which suggested that the percentage of self- 
teference responses might be used as a rough 
measure of the degree of the subject’s self- 
awareness or insight. 
It is apparent that the variables provided 
by the self-reference technique in conjunction 


1 This study was carried out at part 
: T t the De t of 
Social Relations, Harvard University. E 


EAT 


with the utilization of the first- and third-per- 
son items in the SCT are directly relevant to 
some of the personality traits that occupy a 
central position in the theory of the author- 
itarian personality. These are the traits of 
projectivity and of anti-intraception (op- 
position to concern with inner life), both of 
which lead to limitations and distortions of 
self-awareness. According to the current 
theory, authoritarians do not greatly differ 
from equalitarians in their basic tendencies 
and needs, or in amount of conflict and anx- | 
iety. One would not therefore expect the 
members of. the two groups to present 
strikingly different pictures in the SCT if 
the results are analyzed by the usual methods 
of categorizing the content. The authori- 
tarians, however, have a lower awareness of | 
their own unaccepted tendencies; and in the | 
self-reference inquiry they may be expected 
to deny the self-reference of such responses 
more often than the equalitarians. 

In the present study we shall test these 
propositions by comparing the test results of 
two groups selected for high and low authori- 
tarianism. The study may be considered 
either as a test of the authoritarian personality 
theory (1), done with a fairly simple new — 
method, or as an attempt to validate an 
innovation in the SCT, using the known 
characteristics of authoritarians as a validating 
criterion. Whichever formulation one chooses, 
the testing of specific hypotheses generated by 
the interaction of the theory of authoritarian- 
ism and the theory of projective techniques 
should contribute to our knowledge of the 
dynamics of different levels of consciousness, 
an area that both theories have in common. 


METHOD 


A questionnaire containing several authoritarianism 
scales (TAP Battery, Form FERPT, Harvard Univer- 
sity) and a specially constructed form of the Sentence 
Completion Test, including the self-reference technique, 
were administered to a group of 54 men, freshmen of a 


100. (^! 

" " 
metropolitan university. The questionnaire was given 
before the SCT. The three scales whose intercorrela- 
tions were highest were used to rank Ss on authoritar- 
janism. These were: Traditional Family Ideology, 
Authoritarianism, and Ethnocentrism (1, 5). Every S 
was assigned a standard score on each of these subscales, 
The three scores were added, and the pspulation was 
placed on a continuum according to these summated 
scores. In following the natural breaks between scores, 
11 Ss from one end of the continuum and 10 from the 
other were selected to represent the two extremes of high 
and low authoritarianism. These 21 Ss (10 highs and 11 
lows on authoritarianism) form the comparison groups 
of the study. 

The 50 items of the SCT administered to these Ss 
covered a wide range of content, such as family, sex, 
social pressures, fears, etc. Twenty-five fragments 
used the first-person pronoun; 25 used a man’s name 
or the third-person pronoun. The first- and the third- 
person items were matched in content; in 10 pairs the 
content was identical ("I am very proud of..."; 
“He is very proud of ...”); in 15 pairs it was similar 
("When no one paid attention to me I..."; “When 
he was neglected he..."). Three of the 25 pairs of 
Íragments were not included in the calculations. These 
items dealt with pleasant feelings (Henry was glad 
when . . ."). They were interspersed with other items in 
order to make the test appear less threatening. All the 
data below are, therefore, based on 22 pairs of items. 

The test was disguised as a verbal speed test and 
was administered as a group test. After Ss had finished 
writing they were given the following additional 
instructions: 

"Now that you have completed the sentences, go 
back and reread your completions. Consider for each 
completed sentence whether or not it applies in any 
way to yourself. If it refers to some personal ex- 
perience, or reflects the way you feel and act, or would 
feel and act under the given circumstances, put a (4-) 
to the left of the number of the sentence. If the sentence 
has no reference to yourself, put a (—) to the left of 
the number. As your name does not appear on these 
papers, you may be quite frank. Go over each sentence 
quickly, and once you mark it do not return. Leave no 
blanks, please.” 

The completions marked with + will be called 
“self” responses, those marked with — “nonself” 
responses? 

The Ss’ responses to the SCT were classified under 
various categories. The present report makes use of 
three of them: Ego-Threatening character of response 
(ET), Tnadequacy-Passivity (IP), and Masculine 
Ideal (MI). 

Each single response was scored as either ego- 
threatening or not ego-threatening. A score of ET was 


? The Sentence Completion Test employed in this 
study has been deposited with the ADI. Order Docu- 
ment 3979 from the ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress, 
Washington 25, D. C., remitting in advance $1.25 for 
photoprints or $1.25 for 35mm. microfilm. Make 
checks payable to Chief, Photoduplication Service, 
Library of Congress. 


» 


R. J. Dorris, D. J. Levinson, AND E. HANIMANN i 


based on the scorer’s judgment that a given response, 
if it were considered as a statement about oneself, 
would be felt to be threatening by the majority of 
people in our culture. Thus socially unacceptable 
responses, including expression of fears, admission of 
inferiority or incompetence, aggressive rebellion 
against parents, and extreme aggression in general, 
were scored as ego-threatening.? | 

The two mutually exclusive categories of 
Tnadequacy-Passivity and Masculine Ideal tendency 
were not applied to all the responses, but only to those 
to which they were clearly appropriate. Inadequacy- 
Passivity score was given to responses containing 
admissions of submissiveness, indecision, passive or 
sulking response to frustration or failure, and ad- 
missions of difficulties with sex. This category is a 
narrower one than that of Ego-Threatening responses, 
the specific threat in this case being the violation of 
the masculine ideal that is widely held in our culture, 
The Masculine Ideal score was given to all responses 
indicating competence, superiority over others, and 
prowess in work, social, athletic, and sexual spheres. 
Specific criteria for both categories were worked out 
for each particular item scored. 

Precautions were taken to insure reliability and to 
avoid bias in scoring. Responses of each S to each 
given pair of items were copied on separate cards, S’s 
identifying number being placed on the reverse side of 
each card. While scoring the responses of all Ss to a 
given pair of items, the scorer was thus unable to 
identify the high and the low Ss; the use of cards also \ 
eliminated the halo effect that would have been 
created by reading the total record. For each category 
the scoring was done by two scorers, For Ego 
Threatening, the correlation coefficient between the 
two sets of scores was .89; for Inadequacy-Passivity 
-93; for Masculine Ideal .93.4 

The scored data provide us with the following 
measures for each S or each group of Ss: (a) degree of 
self-reference, determined by the percentage of self 
responses yielded by the self-reference technique; 
(b) strength of experienced ego threat, determined by 
the number of Ego-Threatening responses (ET); 
(c) strength of Inadequacy-Passivity tendencies (IP. 
Score); and (d) strength of the Masculine Ideal 
tendencies (MI score). Variables b, c, and d can be | 
designated as content variables. In addition we shall 
consider the differentiation, in degree of self-reference 
and in content, between the responses given to first: 
person and to third-person items. All our hypotheses 
about the expected differences between Ss high and low 
in authoritarianism are formulated in terms of thes 
variables and of their interrelationships. 


The nature of the fragment codetermines the 
character of response. However, only a few of the 4 
items included in the calculations were such that they 
could not be completed to make either an eg 
threatening or a nonthreatening statement. 

‘The formula used was: [2 sum XY 7 
(sum X + Y!/2N]/[sum X? + sum Y? — (sum X + | 
sum Y)*/2N] = r, where X indicates that the category 
in question was scored as present by rater 1, and 
refers to rater 2. See Johnson (4, p. 127). 


+ 


AUTHORITARIAN PERSONALITY BY SENTENÓE COMPLETION TECHNIQUE 


HYPOTHESES AND RESULTS 


Several sets of hypotheses were formulated 
at the start of the research. The hypotheses 
and the relevant findings are as follows. 


Differences between First-Person and Third- 
Person Items in the Elicitation of “Nonself” 
-and of “Ego-Threatening” Responses 


Hypothesis 1. For the sample as a whole, 
third-person (P-3) items will elicit more non- 
self (NS) responses ‚in the self-reference 
inquiry than will first-person (P-1) items. This 
hypothesis is based on the assumption that 
the P-3 items, having as they do an “actor” 
who is explicitly not the person taking the 
test, will facilitate the expression of tendencies 
that S does not recognize in himself. 

Hypothesis 2. For the sample as a whole, the 
P-3 items will elicit more Ego-Threatening 
responses than will the P-1 items. This hypoth- 
esis, like the first, is based on the idea that 
the P-3 items allow the attribution to others of 
impulses or fears too threatening to be ad- 
mitted in oneself. 

Both hypotheses are supported by the data, 
and the findings of the former studies are con- 
firmed. In marking the 22 pairs of items, our 
21 Ss gave an average of 9.38 NS responses 
on the third-person items, and only 5.67 NS 
responses on the first-person items. The dif- 
ference of 3.71 is significant well beyond the 
:05 level of confidence.’ In the case of the 
Ego-Threatening responses, the means were 
12.13 and 10.18 for the P-3 and P-1 items 


* Note on statistical procedure: Fisher's ¢ test was 

, used in all determinations of significance of difference 
in this study. The .05 level of confidence avas used as 
the criterion for rejection of the null hypothesis in all 

Cases. Specific confidence levels will not be given for 
each hypothesis, although it may be noted that for 
most of them p was less than .01. Since the hypotheses 
were made in advance, a one-tailed significance test 
was used. For differences between two measures on the 
total sample (having 20 df), a minimum ż of 1.73 was 
Tequired. For differences on a given measure betwéen 
the “high-authoritarian” and “low-authoritarian” 
Subgroups, there were 19 df and a minimum 1 of 1.73. 
For differences betw n two measures on the high 
Eu (df = 9), the minimum ¢ required is 1.83, and 
or the low group (df = 10), it is 1.81. In computing 

the SE of the difference, where two measures on the 
Same Ss were being compared, the following formula 
was used: sep = [sum D - — (sum DY/N] 


(from 
McNemar [6]). xe 


m 
,* 101 


respectively, and the difference of 1.96, 
though not large in an absolute serfse, was 
statistically significant. At the same time, it 
should be noted that our group was able to 
give, on the average, almost 50 per cent 
Ego-Threafening responses on the first-person 
items. In addition, our Ss were ready on 
almost 60 per cent of the P-3 items to attribute 
to themselves the wishes or actions ascribed 
initially to others; that is, they marked as true 
of themselves the responses they gave on 
almost 12 of the 22 P-3 items. These findings 
suggest that the college students under 
conditions of anonymity are well able, on the 
average, to admit in themselves characteristics 
that a highly defensive person might be in- 
clined to deny and project. Further normative 
data on this score are in order. 


Differences between the High and Low Groups 
(as Determined by Standing on the Authori- 
tarianism Scales) in the Incidence of Nonself 
(NS) Responses on the Sentence Completion 
Test 
Hypothesis 3. The high group will give more 

NS responses on the total SCT battery than 

will the low group. 

Hypothesis 4. The high group will give 
significantly more NS responses on the third- 
person than on the first-person items; the 
low group will also have more NS on P-3 than 
on P-1, but the difference will not be sig- 
nificant. 

These hypotheses follow from the traits of 
projectivity and anti-intraception which have 
been found in previous research (1) to be 
associated with authoritarianism. 

The findings, presented in Table 1, confirm 
hypotheses 3 and 4. For the test as a whole, 
the highs give an average of 18.2 NS responses 
as compared with 12.2 for the lows. This ` 
difference is produced mainly by the P-3 
items. The lows give only slightly fewer NS 
responses than the highs on the P-1 items. 
However, the lows maintain their low level of 
NS responses from P-1 to P-3, whereas the 
highs give almost twice as many NS responses 
on P-3 as on P-1 items. These results sug- 
gest, therefore, that either the number of 
NS responses on P-3 items, or the ““P-3 minus 
P-1” differential, may be taken as an approxi- 
mate index of self-deceptiveness. 


"M 
102^, 


s TABLE 1 
DIFFERENCES BETWEEN HiGH AND Low GROUPS IN 
FREQUENCY OF DENIAL OF THE SEL¥-REFERENCE 
or Trem SCT RESPONSES 


Mean Ni or NS 
P-1 
TorAL Sarr 
Ires ITEMS 
TEST (22) (22) Pi-Ps 
‘Low group (V=11) 12.18 — 5.27 6.90 —1.64** 
High group (N = 10) 18.20 6.10 12.10 —6.00* 


Dif. (L — H) —6.02* —0,83** —5.19* 44.36 


Note.—Significance or lack of significance noted only in cells 
that bear on a hypothesis. 
* Statistically significant at the .05 level or better (see 
footnote 5). 
** Not significant. 


TABLE 2 


DIFFERENCES BETWEEN HIGH AND Low GROUPS IN THE INCIDENCE AND AWARENESS OF EGO- 
THREATENING TENDENCIES ON THE SCT 


R. J. Dorris, D. y Levinson, AND E. HANFMANN 


‘P-1, the difference will not be, signifi 


Vy 


A corollary prediction is that the diffe 
between highs and lows will be significan; 
P-3 but not on P-1. 

The results, shown in Table 2, confir 
hypothesis 5; for the test as a whole, the 
and high groups do not differ in ave 
number of ET responses, though the 
group mean is slightly greater. Hypot! 
is only partially confirmed. The high 
does, as predicted, give significantly motel 
responses on.P-3 than on P-1, but so ai 
does the low group. Moreover, contrary toj 
prediction, the two groups do not difi 
significantly on P-3. In short, the P-3 ite 


Mean No. of ET Responses 


Mean ET-AWARENESS SCORE (S9) 


ITEM 
Low Group HicH Group Dif. Low Group HIGH GROUP 
Total test 20.90 23.70 —2.80** 7* 46** 
P-litems | 9.45 10.90 —1.45** T 65* 
P-3 items 11.45 12.80 —1.35** 66* 30* 
‘Shift, Ps TOP: 2.00* 1.90* 0.10 —11** —35* 


Note.—Significance or lack of significance noted only in cells that bear on a hypothesis. k 
* Significant at .05 level or better (see footnote 5); in the case of the ET- Awareness scores, significantly greater or less than 5 


.. ** Not significant. 


Differences between the High and Low Groups 
in the Incidence and Awareness of Ego- 
Threatening (ET) Material 


Hypothesis 5, The high and low groups will 
not differ significantly in average number of 
ET responses on the SCT as a whole. This 
hypothesis is based on the idea that these two 
groups do not differ, on the average, in total 
amount of anxiety over inner wishes or sensed 
external threats. They differ, rather, in their 
ways of dealing with conflict and fear, which 
will be considered in the hypotheses that 
follow. 

Hypothesis 6. The “P-3 minus P-1” dif- 
ferential: the highs will express their ego- 
threatening ideas and feelings on the first- 
person items significantly less frequently than 
they will on third-person items, where the 
self-reference can more easily be denied. The 
lows, on the other hand, will make less use of 
P-3 items for purposes of self-deception; 
consequently, although the lows may give 
slightly more ET responses on P-3 than on 


2 


evoke more ET material than do the 
items for both high and low authoritariani 
groupings 

It will be recalled that the P-1 and 
items Were matched, i.e., for every item in! 
first-person form cen is an identical 
equivalent one in the third-person form. Tt 
compares the responses of each S to the! 
and P-3 items of the same pair, one can 
differences in degree of ego threat that are! 
covered by the rough categories of prese 
or absence of ET. As an example we may t 
the following pair of completions given by! 
same S: “The boss makes me feel.. 
ferior"; “The boss makes Henry feel . . - 
nothing. ” Both completiors are ego threat 
ing according to our definition of ET,” 
the one given to P-3 is more drastically 
than that to P-1. Such greater threate! 
of responses to P-3 items was more freque 
the high than in the low group. There 
although the data obtained with our pres 
rough scoring do not confirm hypothesis 6 


. * 


AUTHORITARIAN PERSONALITY BY SENTENCE COMPLETION TECHNIQUE x 


is possible that more refined scoring would 
bring out high-low differences of the sort 
proposed in it. : 

„We have seen in Table 2 that the high and 
low groups do not differ significantly in the 
number of ET responses ascribed to the SCT 
characters. Our next question concerns the 
differential readiness of the two groups to 
admit, or to deny, these tendencies in them- 
selves. Operationally, the readiness to admit 
or to deny is represented in the marking of 
the ET responses as S or NS. We obtain a 
rough index of each S's readiness to admit ET 
tendencies in himself by determining the 
percentage of ET responses marked S. This 
measure, to be called the ET-Awareness (or 
ET-A) score, is obtained by the formula 
Sum (ET-S) 
Total ET X 100. ET-A scores of greater 
than 50 indicate a predominant tendency to 
admit the self-relevance of ET responses 
(more S than NS). Scores of less than 50 
indicate that the self-reference of more than 
half of the ET responses is denied. The follow- 
ing two hypotheses concern high vs. low group 
differences in ET-A scores. 

Hypothesis 7. 'The low-authoritarian group 
will show a significantly greater awareness of 
ego-threatening tendencies than will the high- 
authoritarian group on the SCT as a whole. 
More specifically, the mean ET-A score of the 
lows will be significantly above 50, whereas 
the mean of the highs will not be significantly 
above 50 and may even be below it. 

Hypothesis 8. (a) The low group will have a 
greater ET-A mean than the high group on 
both the first-person and third-person items; 
the group difference may not achieve sig- 
nificance on the P-1 items, but it will definitely 
be significant on P-3. 

(6) The ET-A score will drop for both the 
low and high groups from P-1 to P-3, that is, 
both will have a “P-3 minus P-1” differential. 
But this differential will be significant only in 
the case of the high group. In short, the highs 
will tend predominantly to deny the self- 
relevance of their ET responses on the third- 
person items, while the lows will continue to 
see their P-3 responses, like those on P-1 items, 
as possibly true of themselves. 

Data bearing on hypotheses 7 and 8 are 
given in Table 2. Hypothesis 7 is confirmed. 
On the total test the lows have an ET-A mean 


f 
103 


of 71; that is, they admit that 71 per,cent of 
their ET responses have some reference to 
their own wishes or acts. This value is sig- 
nificantly greater than 50. The highs, on the 
other hand, deny slightly more than they 
admit; their ET-A mean is 46. These group 
differences are produced almost entirely by the 
P-3 items, and thus hypothesis 8 is also con- 
firmed. The low group has, on the average, a 
positive ET-A score (significantly greater than 
50) on both the P-1 and P-3 items, and the 
drop of 11 from P-1 to P-3 is insignificant. 
The high group, which has a significant positive 
ET-A mean of 65 on the P-1 items, changes to 
a mean of only 30 on P-3 (significantly less 
than 50). The “P-3 minus P-1" differential for 
the highs is 35, a significant decrease in self- 
awareness. As predicted, moreover, the 
difference in ET-A mean between highs and 
lows is significant for the P-3 items, but not 
for P-1. 


Differences between the High and Low Groups 
in the Incidence and Awareness of Inadequacy- 
Passivity Tendencies 
Hypothesis 9. The high and low groups will 

not differ significantly in average number of 
Inadequacy-Passivity responses. This hy- 
pothesis is a direct reflection of the general 
postulate that individuals standing at opposite 
extremes on the authoritarian-equalitarian 
continuum differ not in the amount of any 
generalized disposition (such as passivity, 
aggression, and the like), but in their manner 
of dealing with it (defense, awareness, cognitive 
and affective differentiation, etc.). 

Hypothesis 10. The “P-3 minus P-1" 
differential: for the lows, the number of IP 
scores will be roughly the same on the P-1 
and P-3 items; for the highs, on the other 
hand, IP tendencies will be projected and will 
accordingly be more readily expressed on P-3 
than on P-1 items. As a corollary of this, we 
predict that the difference between the low 
and high groups will be insignificant on P-1 
but significant on P-3 items. 

The relevant findings are contained in Table 
3. On the total test the highs give an average 
of 10.9 IP responses, as compared with 8.9 for 
the lows. The difference does not attain 
statistical significance and hypothesis 9 is 
thus supported: the two groups do not differ 
in frequency of IP response. As for hypothesis 


104 ^» 


R. J. Dorris, D. J. LEVINSON, AND E. HANFMANN 


TABLE 3 


L 
DIFFERENCES BETWEEN HicH AND Low GROUPS IN THE INCIDENCE AND AWARENESS OF 


INADEQUACY-PASSIVITY TENDENCIES ON THE SCT 


MEAN No. oF IP RESPONSES 
4n 


MEAN IP-AwARENESS Score (S90) 


ITEM 
Low Group Hic Group Dif. Low Group Hicu Group Diff. 
Total test 8.90 10.90 —2.00** 72 40 32* 
P-1 items 4.36 4.80 —0.44** 77 58 19* 
P-3 items 4.55 6.10 =1.55** 68 24 44* 
Shift, P, — P: 0.19** 1.30** —1.11 -9** —34* 25 


Note,—Significance or lack of significance noted only in cells that bear on a hypothesis. 
* Significant at .05 level or better (see footnote 5); in the case of the IP- Awareness scores, significantly greater or less than 50 (7j), 


** Not significant. 


10, the results are in the expected direction 
but do not achieve the required significance. 
In accordance with the prediction, the lows 
give almost exactly the same number of IP 
responses on the P-1 and P-3 items. However, 
although the highs have more IP responses 
on P-3 than on P-1, the differential of 1.3 is 
not significantly greater than zero. To sum- 
marize, the highs and lows give roughly the 
same number of IP responses on the total test 
and on the two halves separately, and the 
*P-3 minus P-1" differential is negligible, 
though positive, for both groups. 

We turn now to group differences in the 
relative awareness of Inadequacy-Passivity 
tendencies. It is assumed that these tendencies 
are as prone to denial as are those expressed 
in the ET responses. We have accordingly 
utilized an IP-Awareness (IP-A) score analo- 
gous to the ET-A score involved in hypotheses 
7 and 8 above, and the procedure used here 
paralles that used in the former case. A 
person's IP-A score is simply the percentage 
of IP responses marked S. It indicates the 
degree of his self-awareness with regard to 
these tendencies. 

9 Hypothesis 11. The low group will show a 
significantly greater awareness of IP tendencies 
than will the high group on the SCT as a 
whole. Again, it is expected that the mean 
IP-A score of the lows will be significantly 
greater than 50, whereas for the highs it will 
be at or below 50. 

Hypothesis 12. (a) The low group will show 
greater self-awareness with regard to IP 
tendencies (that is, have a greater IP-A mean) 
than the high group on both the P-1 and P-3 
items. The group difference may not be 
significant on P-1 but it will be on P-3. (b) The 


m 


IP-A score will drop from P-1 to P-3 for both 
low and high groups, but this differential will 
be significant only in the case of the highs. In 
other words, the highs more than the lows 
will use the P-3 items for the projection of 
feelings of inadequacy and passivity which 
they cannot admit in themselves. 

The findings, presented in Table 3, support 
hypotheses 11 and 12. For the test as a whole, 
the low-authoritarian group has an IP-A mean 
of 72 which is significantly greater than 50. 
The mean for the highs, on the other hand, is 
40, or clearly on the "denial" side, and sig- 
nificantly less than the low group mean. Thus, 
although the two groups do not differ greatly: 
in the strength of their IP tendencies (a8 
measured by the IP score), the lows art) 
relatively self-insightful, the highs mor 
projective and self-deceived concerning these 
tendencies. Once more, the high group shows 
a significant drop of 34 points in IP-A score 
from P-1 to P-3 items, while the low group 
shows a negligible drop of 9 points. The 
difference in IP-A mean between the high and 
low groups is, as predicted, much greater ot 
P-3 than on P-1 (44 vs. 19), but even thé 
latter difference is statistically significant: 
That is, the highs engage in greater denial 
the self-reference of their IP responses that 
do the lows, even on the first-person items. 


{ 
Differences between the High and Low Groups 

in the Incidence and Awareness of Masculint 

Ideal (MI) Tendencies 

It will be remembered that our definition o. 
Masculine Ideal is a broad one; it includes: 
not merely the conception of the powerfi i 
ambitious, rugged man, but also all forms 
personal effectiveness in sexual, athletic, wo 


. e f 
AUTHORITARIAN PERSONALITY BY SENTENCE COMPLETION TECHNIQUE » 


and other situations. (It would be desirable to 
study variations in the types of MI repre- 
sented, but the brevity of the responses made 
this aim too difficult in the present research.) 
Consequently, the MI responses are likely 
to be highly valued by our subjects, in marked 
contrast to the responses scored ET and IP. 
This assumption leads us to the following 
hypotheses: 

Hypothesis 13. 'The high and low groups 
will not differ significantly in average number 
of MI responses. Again, as in hypothesis 9, 
we propose that our two groups do not differ 
in the amount of any grossly defined disposi- 
tion, although they will differ in the forms 
taken by this disposition at various levels of 
personality. 

Hypothesis 14. The *P-3 minus P-1" 
differential: since the MI tendencies will 
ordinarily be ego-syntonic and valued, they 
are more likely to appear in the first-person 
than third-person items. If we assume that 
the highs have a greater tendency than the 
lows to arrogate positive traits to themselves, 
then the differential should be greater for 
highs than for lows; or, to put it differently, 
the low group will show only a slight drop in 
mean MI score from P-1 to P-3, whereas the 
high group will show a larger and statistically 
Significant drop. Moreover, on the P-1 items 
the MI means of the two groups will be sub- 
stantially the same, but on P-3 the high group 
mean will be significantly below the low 
group mean. 

The relevant data are presented in Table 4. 
Hypothesis 13 is confirmed; the low and high 
groups do not differ significantly on the test 
as à whole, though it may be noted that the 
lows exhibit a slightly greater number of 


t 
105 


Masculine Ideal responses. Hypothesis 14, 
however, is only partly supported. It was 
predicted that the high group would show a 
significant drop from P-1 to P-3 and that the 
low group,would not. Actually, both groups 
show a significant drop, although that for the 
highs is slightly greater. In short, the P-3 items 
"pull" fewer MI responses than the P-1 items, 
regardless of the degree of authoritarianism of 
the respondents. And, contrary to expecta- 
tions, the P-3 means of the two groups are not 
significantly different. 

Hypothesis 15. Awareness or denial of 
Masculine Ideal tendencies: since these 
tendencies are usually ego-accepted and even 
valued, there would seem to be little reason 
for their denial. We predict, accordingly, 
that the MI-Awareness scores (which were 
derived in the same way as the ET-A and 
IP-A scores described earlier) of the two 
groups will be substantially the same, and 
that both will be significantly greater than 50. 

Hypothesis 16. Since there appears to be no 
necessity for self-defense against MI ten- 
dencies, we expect that, for both low and high 
groups, the MI-A means on P-3 items will be 
similar to those on P-1 items. That is, we 
predict neither a P-3 vs. P-1 differential in 
highs or lows, nor a high vs. low differential 
on P-3. 

The data in Table 4 tend for the most part 
to support hypotheses 15 and 16, but they 
raise some additional questions. As predicted 
in hypothesis 15, the MI-A means of the two 
groups are significantly greater than 50 and 
are not significantly different from each other. 
However, the highs have a somewhat lower 
mean and thus tend to regard proportionately 
more of their MI responses as not character- 


TABLE 4 
DIFFERENCES BETWEEN HIGH AND Low GROUPS IN THE INCIDENCE AND AWARENESS OF 
MASCULINE IDEAL TENDENCIES ON THE SCT 


MEAN No. OF MI RESPONSES 


Mean MI-AwARENESS Score (S%) 


Item 
4  LowGrour HicnGmoue Diff. Low Group Hic Grouer Diff. 
Total test. 8.55 7.70 0.85** 79* 70* 9** 
P-1 items 5.18 5.00 0.18** 75* 72* 3" 
P-3 items 3.36 2.70  0.66** 84* 67* 17* 
Shift, P — P; —1.82* —2.30* — 0.48 49** —5** +14 


Note— Significance or lack of significance noted only in cells that bear on a hypothesis. 
‘nificant at .05 level or better (see footnote 5): in the case of the MI- Awareness scores, significantly greater or less than 50 (%). 


** Not significant. 


Y 
106 a 


izing themselves (NS). We shall return to this 
observation after considering the evidence 
regarding hypothesis 16. In accordance with 
this hypothesis, neither the high nor the low 
group exhibits a significant shift in MI-A score 
between P-1 and P-3 items. The highs show 
the expected slight decrease of 5 points (from 
72 to 67). The shift of 9 points for the lows is 
also statistically insignificant, but strangely 
enough it is an increase (from 75 on P-1 to 
84 on P-3). 
The highs and lows have almost identical 
MI-A means on the P-1 items, but on P-3 
the low group mean of 84 is significantly greater 
than the high group mean of 67. It would 
appear, therefore, that the highs find it some- 
what difficult to see in themselves the tenden- 
cies that they attribute to others, even when 
these tendencies are in accord with their personal 
ideals. This finding, seen in relation to those 
on the Ego-Threatening and Inadequacy- 
 Passivity responses, suggests that the following 
two factors are operating to reduce the in- 
cidence of S responses by highs on third-person 
items: (a) Self-deceptiveness concerning the 
existence, in themselves, of threatening or 
disvalued tendencies which are attributed to 
others. This is shown by the fact that their 
Awareness scores (S%) for ET responses and 
IP responses are very small on third-person 
items (30% and 24% respectively). (b) A 
ore general tendency to maintain rigid limits 
een the concept of oneself and of others, 
inability to identify with others, regardless 
of the disposition involved. Thus, although 
their P-3 Awareness score is much greater on 
MI responses than on ET or IP, it is still less 
than the corresponding value for the lows. 
This point will be discussed again in the 
concluding section. 


Discussion 


The hypotheses guiding the present study 
have for the most part been confirmed. Twelve 
of the 16 hypotheses are unequivocally sup- 
ported in the sense that they are upheld 
statistically at the initially stated .05 level of 
confidence. The remaining four (numbers 6, 
10, 14, and 16), each of which involves several 
predictions, are only partially supported. 
However, in no case have we obtained a 
statistically significant result in the direction 
opposite to the one predicted. There is some 


» 


R. J. Dorris, D. J. LÉviNsoN, AND E. HANFMANN 


differentiation among the confirmed proposi- 
tions, some of the predicted differences being 
much larger than others. As our hypotheses 
were made by interrelating the authoritarian 
personality theory with the theory of the 
Sentence Completion Test, we shall discuss the 
implications of our findings for each of these 
two sets of assumptions. 

From the point of view of the authoritarian 
personality theory, our results serve to confirm 
a number of existing ideas concerning au- 
thoritarian and equalitarian personalities. The 
high (authoritarian) group demonstrated their 
projectivity and anti-intraception by utilizing 
the third-person items for the purpose of 
denial of personal tendencies. The highs gave 
almost twice as many NS (denial) responses as 
the lows on P-3; again, the P; — P; differential 
in NS response was appreciable for the highs, 
negligible for the lows. As was predicted, the 
two groups did not differ significantly in the 
total number of Ego-Threatening, Inade- 
quacy-Passivity, and Masculine Ideal re- 
sponses; and, contrary to predictions, they 
did not differ in responses in these categories 
given to the P-3 items alone. However, they 
did differ greatly in their readiness to admit 
the self-relevance of these responses. Spe- 
cifically, the highs are inclined to deny that 
they themselves have the ET and IP tenden- 
cies they ascribe to the test characters, 
particularly on the P-3 items. The lows, how- 
ever, admit the self-relevance of over 70 per 
cent of their ET and IP responses. Moreover, 
the lows show almost equal self-awareness on 
P-3 and P. items, whereas the highs exhibit 
a significant decrease in self-awareness from 
P-1 to P-3. 

A comparison of the IP and the MI data 
on the two groups is instructive. The lows 
give about an equal number of IP and MI 
responses (8.9 and 8.6, respectively). The 
highs, however, give 10.9 IP responses and 
only 7.7 MI responses. Again, the low group 
has almost as great an Awareness score on its 
IP responses as on its more valued MI re- 
sponses (72 and 79, respectively). The highs, 
on the other hand, show great selectivity in 
their self-awareness; they consider as true of 
themselves 70 per cent of their Masculine 
Ideal responses but only 40 per cent of their 
Inadequacy-Passivity responses. The dis- 
crepancy is even more marked on the P-3 


. 
AUTHORITARIAN PERSONALITY BY SENTENCE COMPLETION TECHNIQUE , 


items, on which the highs have an MI-Aware- 


ness mean of 67 and an IP-Awareness mean 
of 24. One might ask whether the highs are not 
“in fact less passive and inadequate than the 


i 


‘lows and are merely judging themselves 
realistically. This interpretation, however, is 
contradicted not merely by the findings of 
previous research (1) but also by the fact that 
in the present study the highs gave no fewer 
IP responses and no more MI responses than 
did the lows; in fact the differences, although 
not significant, went in the opposite direction. 

From the point of view of the SCT theory, 
the main purpose of the study was to test the 
hypotheses that an added device of a self- 
reference inquiry can help differentiate between 
content coming from different levels of con- 
Sciousness, and. that, when used in inter- 
individual comparisons, it can yield a measure 
of insight. The second purpose was to investi- 
gate possible differences in function between 
the first- and third-person items. It is often 
assumed that the third-person items elicit 
more unaccepted, repressed tendencies than 
do the first-person items. Thus, using the 
technique of matched (P, — Ps) pairs, we 
might expect the P-3 items to differ from the 
P-1 items in (a) eliciting more unaccepted 
content (ET and IP), and (b) evoking more 
frequent denial of self-reference. All of these 
relationships, which are contingent on limited 
self-awareness, should definitely hold for the 
authoritarian group and, to a much lesser 
extent, for the equalitarian. Thus, differences 
between these two groups provide an im- 
portant validating criterion for our hypotheses 
concerning the SCT. » 

The results confirm some of these hypotheses 
to a much greater extent than others. The 
self-reference technique definitely stands up 
under the test. We find, as did earlier studies, 
that, for the total sample, denial of self- 
reference is directed at unacceptable material 
(ET, IP) much more frequently than at ac- 
ceptable material (MI). This differentiation’is 
minimal for the Idws but it is striking for the 
highs, who indulge much more than the lows 
E iem of threatening items. This finding 

pport to the use of self-reference 

esa (S %) as a measure of insight. 
E zig nt the differences in 
ER rapidis -person completions 
qually well. For the total group 


t 
107 


the differences in the incidence of thẹ several 
content variables in P-1 and P-3, though they 
go in the expected direction, are either not 
significant (IP), or else very slightly (ET, 
MI). And,,what is more important, for all three 
content variables the P, — P; differentials are 
not significantly greater in the high group than 
in the low, as would be expected if the differ- 
ential use of P-3 for expression of unaccepted 
content (ET, IP) were a function of pro- 
jectivity and of lack of insight.* 

As we find no appreciable difference in the 
content of P-1 and P-3, we have no reason to 
expect any great differences between them in 
the amount of self-reference either. That is, 
our prediction that there would be more 
frequent denial of self-reference on P-3 was 
based on the assumption that a greater amount 
of threatening content would be projected onto 
P-3 than P-1, and that the denial of threaten- 
ing content would possibly spread to more 
neutral P-3 responses. Yet, although the 
stipulated condition concerning content was 
not fulfilled, the expected differences in 
S per cent are present, and even quite pro- 
nounced. The study confirms the previous 
findings in showing that self-reference is 
denied in general much more often for P-3 
than for P-1 completions. Moreover, this 
difference is much larger for the highs than 
for the lows (Table 1). The highs, in fact, treat 
the same negative content (ET, IP) differ- 
entially, denying it in P-3 much more fre- 
quently than in P-1; in the lows this tendency 
toward differential awareness of the same 
negative content in P-1 and P-3 is absent or 
slight (Tables 2, 3). 

These results make us question either our 
findings concerning the similarity of content 
of P-1 and P-3, or the assumption that the 
threateningness of content is the main reason 
for denial of self-reference. With regard to the 
first point, the determination of content may 
have been inadequate because of our method 


6 Getzels (2) has found differences of this nature 
between highs and lows by comparing responses to 
third-person items with responses to a direct un- 
disguised questionnaire. More socially unacceptable 
responses were given on the SCT than on the ques- 
tionnaire, and this disparity was greater for the highs 
than for the lows. The deviation of our results from 
Getzels’ demonstrates the difference between the first- 
person items of the SCT and a direct personal question 
which evokes the S’s formulated knowledge and which 
he often tries to answer in the most acceptable way. 


€ 


j 
108, 


of scoring: dichotomous categories (such as 
presence or absence of ET) do not take account 
of finer differences of degree. Perhaps an even 
more serious limitation is the impossibility of 
taking into account the subjects’ own in- 
dividual frames of reference: what we assume 
to be threatening or nonthreatening for the 
majority of people in our culture does not 
necessarily have these connotations for each 
individual subject. 

The problem of differentiation of content 
of the first- and third-person items requires 
further detailed study. Accepting our findings 
for the time being at their face value, we must 
conclude that there are factors in addition to 
that of the greater threateningness of projected 
content which cause our Ss—particularly the 
authoritarians—to deny the personal relevance 
of the third-person completions. Of the several 
explanations that present themselves, we 
would like to discuss one that has the widest 
bearing. The P-1 and P-3 items might be 
viewed as representatives of two of the basic 
categories that organize our cognition: myself 
and the world of others. We know that there 
are large individual differences in the amount 
of experienced separation, or of relatedness of 
self to the world, and it is possible that these 
differences find reflection in the treatment 
accorded the P-3 items in the self-reference 
inquiry. Once a subject with a rigidly delimited 
self-concept has placed a certain content intoa 
third-person item, he has assigned it to the 
“not me” and he will tend to keep it there, 
regardless of how he evaluates it. This would 
explain why the highs, but not the lows, show 
a drop in self-reference from P-1 to P-3 in 


a, 


R. J. Dorris, D. J. Levinson, AND E. HANFMANN 


relation not only to threatening but also to 
positive content (Table 4). i 

If this interpretation is correct, the com- 
bined techniques of the self-reference inquiry 
and of matched first- and third-person items 
provide us not with two measures of the same 
general variable of insight or self-awareness, 
but with measures of two distinct but related 
variables. By considering the differential 
awareness for positive and negative content, 
we obtain a measure of projectivity or de- 
fensive unawareness. The differential aware- 


ness for the P-1 and P-3 items, when it | 


cannot be accounted for by difference of 
content, seems to represent cognitive ri- 
gidity, which perhaps is related more closely 
to anti-intraception than to projectivity. 
Further experimentation is needed to verify 
these conclusions. 


REFERENCES 


1. Aporno, T. W., FRENKEL-BRUNSWIK, 
Levinson, D. J., & Sanrorp, R. N. The authori- 
tarian personality. New York: Harper, 1950. 

2. GETZELS, J. W. The assessment of personality and 
prejudice by the method of paired direct and 
projective questionnaires. Unpublished doctor's 
dissertation, Department of Social Relations, 
Harvard Univer., 1951. 

3. HANFMANN, EUGENIA, & GETZELS, J. W. Studies of 
the Sentence Completion Test. I. J. proj. Tech, 
1953, 17, 280-294. 

4. Jounson, P. C. Statistical methods in research. New 
York: Prentice-Hall, 1949, 

5. Levinson, D. J., & HurrMan, P. Studies in per- 
sonality and ideology: theory and measurement of 
traditional family ideology. Mimeographed 
report, Harvard Univer., 1952, 

6. McNemar, Q. Psychological statistics. New York: 
Wiley, 1949, 


Received December 17, 1952. 


ELSE, | 


ABSTRACT BEHAVIOR AMONG THE TEPEHUAN: 
JAMES McCONNELL 


University of Texas 


IHE concept of the "primitive" man is 
one quite often encountered in psy- 
chological literature. This is particularly 
the case when one investigates studies on 
regression and on abstract and concrete levels 
of behavior. One discovers such writers as 
Storch (7) and Lévy-Brühl (4) contending 
that the thought patterns of Phomme sauvage 
are qualitatively different from those of 
"modern man," and that schizophrenic thought 
patterns are throwbacks to an archaic mode 
of thinking. Hall and others came to the 
conclusion that “ontogeny recapitulates phy- 
logeny" for the mental processes—a belief 
which puts “civilized” children and “primitive” 
adults on the same intellectual level of de- 
velopment. 
While very few writers are willing to stretch 
a phylogenetic theory of regression to this 
extent, many authors do contend that there is 
a basic structural difference between the 
thought processes of modern man and of his 
uncivilized brethren. Goldstein and Scheerer 
(2) have related regression to the loss of an 
abstract level of behavior and a return to a 
More primitive, concrete level. They infer 
that the abstract approach is a recent achieve- 
ment from the evolutionary point of view. 
Hence they seem to expect that contemporary 

Savages” would perform at a highly concrete 
level should they be administered the Gold- 
Stein-Scheerer tests of abstract behavior. 

.The present study was undertaken to 
discover if members of an essentially primitive, 
Contemporary society do function at a concrete 
level of behavior. 


5 The Wechsler-Bellevue version of theeKohs Block 

‘sign test (with certain modifications in presentation 
D ed by ee and Scheerer) was presented to 
: equite of the Scuthern Tepehuan tribe living in 
Di around the village of Xoconostle in the State of 
DEED Mexico. The sample comprised approxi- 
Es ly 50 per cent of the adults living in this general 

cinity and approximately 5 per cent of the adult 
Population of the entire tribe. The probable age 


TUUS 
ieee Study was undertaken as a part of an anthro- 
oa ie field trip sponsored by the University of 
Un lem Illinois. The writer is indebted to Dr. J. 
arles Kelley for his generous advice and assistance. 


range was from 17 to 50. Approximately half of the 
subjects (Ss) volunteered their services. The other half 
were recruited from the village as a whole. All Ss were 
paid for taking the test. 

The experimenter (E) lived with the tribe for some 
two weeks, and every possible effort was made to 
insure uniformly high motivation in the administration 
of the test. 

Each S’s performance on each of the seven designs 
and the two samples was scored on a simple pass-fail 
criterion. If the S could reproduce the design from the 
printed card alone no matter how many demonstra- 
tions he required, he was given credit for the item. If 
he could not reproduce the design from the printed 
card alone, he was given no credit. This is in line with 
Goldstein and Scheerer's contention that there is a 
pronounced line of demarcation between the abstract 
and the concrete levels which does not represent a 
gradual ascent from more simple to more complex 
mental sets. 

The Ss were rated on the following five-point scale 
of concreteness-abstractness: 

I. Unable to grasp the task at all, even with help. 

II. Able to get the easier problems (at least the 
two samples and the first two designs) with help, but 
unable to shift to the harder designs even with as- 
sistance. 

III. Able to get at least five of the designs with 
demonstrations but only when allowed to reproduce 
the design first from a model and then from the printed 
card with the model not present. 

IV. Able to get all of the designs correctly with 
demonstrations on most, but with models used only 
on the first two or three designs. 

V. Able to get all of the designs with demonstra- 
tions and models used only on the two sample designs, 
the printed designs being sufficient for the rest of the 
test. No further assistance of any kind given after the 
two samples. 


A summary of the results is presented in 
Table 1. It will be noted that none of the 24 Ss 
fell into the most concrete category and only 
four of the 24 were classified as being more 
concrete than abstract in their performance on 
the test. It is also interesting to note that at 
least four of the Ss, had they been given the 
test under the conditions specified in the 
Wechsler manual, would have received addi- 
tional credit for rapid working of the problems. 
Only the four Ss who failed to master the test 
evidenced any of the 11 typical failure re- 
sponses which Goldstein and Scheerer list as 
characteristic of the concrete approach to this 


test (2). i 


109 


110 » 


TABLE 1 
RANK Scores OBTAINED BY A GROUP OF TEPEHUAN 
INDIANS ON A TEST OF ABSTRACT 
AND CONCRETE BEHAVIOR 


RANK MEN WOMEN ,. TOTAL 
I 0 0 0 
n 2 2 4 
II 3 4 7 
IV 7 5 12 
Vv 1 0 1 
Total 13 1i 24 


Tt must also be pointed out that the test was 
given under less than ideal circumstances. 
Anastasi has pointed out some of the main 
objections to the use of this type of test under 
field conditions (1). 

One possible objection to this study is that 
the Southern Tepehuan do not represent a 
really primitive people. Excellent material on 
the state of primitiveness of the tribe can be 

found in two papers by J. Alden Mason, the 
only ethnographer to work with the Tepehuan 
since the 1890's (5, 6). Under any conditions 

» the Tepehuan must be considered a very 
backward tribe, quite isolated from con- 
temporary society. 

"There will be, of course, those writers who 
might insist as Werner (8) does that the 
“primitive” man they have in mind passed 
from the scene many centuries ago. Even 
Werner, however, admits that “contemporary 
primitives" should be much closer to an 
archaic level of thinking than modern man. 
Hunt and Cofer state, “One should not expect 
primitive peoples to lack the basic capacity to 
acquire our adult forms of organization. They 
lack them because these forms did not exist 
in the cultures that nurtured them. Moreover, 
they would have great difficulty acquiring 
ours at adulthood because of the habit inter- 
ference between their own and ours” (3, p. 
1018). And yet the Tepehuan, as shown in this 


Jamés MCCONNELL 


study, not only can perform at an abstract 


‘level, but also function there almost as well — 


as the adult population on which Wechsler — 
standardized his intelligence test. 

While it is dangerous to generalize too far 
from this one study, it becomes evident that 
a revision in our concept of primitive man’s 
mode of thinking may be indicated. Further | 
work among other isolated peoples is needed | 
before a true picture of the capabilities of © 
primitive tribes can be arrived at. The con- | 
fusion of thinking in this area in the past has | 
probably resulted from the uncritical ac- 
ceptance of nonscientific descriptions of the 
behavior of primitive societies. It is far too 
easy a thing to read simple-mindedness into 
simple behavior. Merely because some primi- 
tive cultures may not demand complex, ab- < 
stract modes of behavior from their members 
does not necessarily mean that the primitive 
may not be able to function at a higher level 
whenever called upon to do so. 


REFERENCES 


1. ANASTASI, ANNE. Differential psychology. New York: 
Macmillan, 1937. ; 

2. GOLDSTEIN, K., & ScHEERER, M. Abstract and 
concrete behavior: an experimental study with 
special tests. Psychol. Monogr., 1941, 53, No. 2 
(Whole No. 239). 

3. HuNr, J. McV., & Corer, C. Psychological deficit. 
In J. McV. Hunt (Ed), Personality and the 
behavior disorders. New York: Ronald, 1944. Pp: 
971-1032. 

4. LÉvy-BRUHL, L. Primitive mentality. New York: 
Macmillan, 1923, 

5. Mason, J. A. The Tepehuan and the other aborigines 
of the Mexican Sierra Madre occidental. America 
IndSgena, Mexico, D. F., 1948, 8, No. 4. 

6. Mason, J. A. Notes and observations on the 
Tepehuan. America Indigena, Mexico, D. Fs 
1952, 12, No. 1. 

T. SroRcH, A. The primitive archaic forms of innet 
experiences and thought in schizophrenia. Nert: 
ment. Dis. Monogr., 1924, No. 36. b 

8. WeRNER, H. Comparative psychology of mental 
development. New York: Harper, 1940. 


Received December 19, 1952. 


ee ee 


© 


€ 


A COMPARISON OF SOME OBJECTIVE MEASURES OF EXPECTANCY 


JULIAN B. ROTTER, BERNARD J. FITZGERALD, Ap JAMES N. JOYCE 
The Ohio Slate University | 


URRENT attempts to approach person- 
ality theory from an experimental 
point of view appear to be classifiable 

into two major categories. One set of attempts 
stresses the application of reinforcement theory, 
for example, the work of Dollard and Miller 
(5), Mowrer (18), and Brown and Farber (1). 
Another set of approaches, including Tolman 
(25), Brunswik (2), Krech (15), and Postman 
(20), stresses the construct of expectancy or hy- 
potheses. 

For the past several years Rotter and his 
students have been working in an attempt to 
develop a “social learning theory” which com- 
bines both the constructs of expectancy and 
reinforcement. The concept of reinforcement, 
however, does not stem from drive-reduction 
theory but perhaps might better be described 
as utilizing an “empirical law of effect.” The 
purpose of the present paper is to discuss only 
Some measures of expectancy. A number of 
Studies utilizing this construct in testing some 
of the hypotheses of this theory and dealing 
with how expectancies are built up, changed, 
generalized, and affected by situational factors 
Should be appearing in the literature in the 
near future. 

Expectancy, as a useful and pragmatic,con- 
Struct, has been employed by many theoreti- 
cians, Hobhouse (9) employed an expectancy 
principle to explain modification of behavior 
in his confirmation-inhibition theory as long 
ago as 1901. Pavlov (19), Zener (26), and more 
recently Mowrer (18) have discussed expec- 
tancy as a conditioned response or heightened 
anticipatory tension. Tolman (25) has de- 
Scribed the occurrence of learning as the “build- 
Ing up of an expectancy that a given sign in the 
environment will, yia a behavior route, lead to 
a given significate MacCorquodale and 
Meehl (17) have recently attempted a prelimi- 
nary, systematic formulation of an expectancy 
learning theory. 

Brunswik (2) and Lewin (16) have dealt with 
Bi iN as a probability phenomenon. 

runswik emphasizes the study of the objective 
probability of events as the organism adapts 
to them. Lewin, on the other hand, stresses the 


111 


subjective probability and not the actuarial 
probability of an event that constitutes the 
relevant data of an expectancy. The need for 
consideration of the relationship between ob- 
jective and subjective probability has been 
recognized by Brunswik who has commented 
that “one of the comparatively neglected tasks 
of molar environmental psychology is to find 
out the extent to which environmental hier- 
archies of probabilities . . . do find a counter- 
part in similar hierarchies of evaluation by the 
organism" (2, p. 191). 

Although there may be some question of the 
necessity for an expectancy construct in the 
prediction of animal behavior, to the authors 
it seems extremely difficult even to attempt an 
explanation of human learning in complex 
social situations without some construct which 
deals with the effect on behavior of the antici- 
pation of future reinforcements. Hull (10), in 
commenting upon the difficulty in the use of 
the construct of expectancy because of lack of 
objective operations for its measurement, has 
said, “when we attempt to verify an hypothesis 
in which the concept of expectancy appears, we 
cannot tell how much expectancy to expect; 
neither do we know the magnitude of the reac- 
tion which the expectancy is expected to me- 
diate.” Hull (11) has, however, recently 
attempted to explain apparent anticipatory 
behavior of rats with his concept of fractional 
antedating goal reactions. Jenkins and Stanley 
(13) also have criticized the use of expectancy 
in the partial reinforcement situation on the 
basis of its lack of rigorous definition. — 

Among the various measures of expectancy 
the most common, especially in level of aspira- 
tion studies, has been to ask the subject to 
make a statement about his future perform- 
ance. Hausmann (8), Preston and Bayton (21), 
Rotter (22), Festinger (6), and Irwin and 
Mintzer (12) have found that simple requests 
for verbally stated expected scores are con- 
founded with aspirations, defensive behavior, 
or other motivational variables and they have, 
in various ways, déscribed techniques for ob- 
taining statements which tend to be more 
purely statements of actual expectation. x 


» 
112.4 


Typical methods include elaborate instruc- 
tions emphasizing accuracy, penalizing over- or 
underbidding, and asking for separate state- 
ments of hopes or minimal goals. In these 
various studies, although the authors have felt 
that they may be coming closer to an actual 
statement of subjective expectation, there has 
been no checking of the verbally expressed 
expectancies against objectively determined 
probability, nor has it been determined 
whether or not the various techniques provide 
comparable results. 

Recently Castaneda (3), distrusting the 
verbal statement of expectancy, has developed 
a method of assessing an individual's expecta- 
tions by the amount they would bet on any 
specific occurrence. He has applied his method 
to the betting of marbles by children and of 
pennies by college students. 

The present study is concerned with com- 
paring four different methods of obtaining 
expectation with each other and also comparing 
the results with an objectively determined 
probability. 

One of these methods is a betting or beha- 
vioral technique. It should be clear that, in 
referring to one measure as "behavioral," we 
are only making a distinction between verbal 
behavior and nonverbal behavior. 

Before describing the present study, per- 
haps some brief indication of the use of the con- 
Struct of expectancy in social learning theory 
might be useful. This theory expresses a. basic 
formula for behavior which might be stated as 
follows (23): The potential for a given beha- 
vior to occur in any specific situation, in rela- 
tion to a given reinforcement, is a function of 
the expectancy of the occurrence of that rein- 
forcement following the behavior in the situa- 
tion, and the value of that reinforcement. This 
may be stated in a more formal relationship as 
given below: 


B.P.x syr = f(Ex,n, s, & RV.) 

This may be read as follows: The potential 
for behavior x to occur in situation 1 in relation 
to reinforcement a is a function of the expect- 
ancy of the occurrence of reinforcement a fol- 
lowing behavior x in situation 1, and the value 
of reinforcement a. 

The full theoretical formulation includes a 
number of additional constructs which relate 
to the two basic constructs of the formula— 
expectancy and reinforcement value. Predic- 
tion of behavior requires that at least two 
behavior potentials be calculated so that pre- 


J. B. ROTTER, BJ. FITZGERALD, AND J. N. Joyce 


diction may be stated in terms of the relative 
potentiality of one behavior or another occur- 
ring. It is assumed that this is a basic formula 
for complex behaviors in a social situation, 
Expectancy is defined in social learning theory 
as a subjective probability held by the individ- | 
ual that a particular reinforcement will occur 
as a function of, or in relation to, a specific 
behavior in a given situation or situations (24), | 

Behavior potential may be defined as the 
potentiality of any given behavior occurring 
in any given situation or situations as calcu- 
lated in relation to any single reinforcement or | 
set of reinforcements. Reinforcement value : 
may be defined as the degree of preference for | 
any reinforcement to occur if the possibilities 
of their occurring were all equal. It is presumed 
that expectancy and reinforcement value are 
normally independent although there are spe- 
cial conditions under which one affects the 
other. In this formula the ampersand is used 
to express the nature of the mathematical rela- 
tionship since the precise mathematical rela- 
tionship is not known. It is presumed, however, 
that the relationship will be a. multiplicative 
one. The formula does, however, indicate that 
the higher the expectancy or the reinforcement 
value the higher will be the value of the beha- 
vior potential. Reinforcement values may be 
negative or positive. 

It should be noted that expectancy so de 
fined is not an actuarial probability but is 4 
subjectively held probability. The objectively 
determined or actuarial probability will be 
referred to as theoretical expectancy. It is not 
presumed that subjective expectancy in any 
given individual should be the same as theo- 
retical expectancy for a group of subjects. 
However, the degree of relationship betwee? 
theoretical expectancy and the mean stat 
expectancy would be a measure of the degre? 
to which expectancy could be predicted from 
objectively describable past experience imme 
diately preceding a specific situation. 


PROCEDURE 


Four groups of subjects (Ss) were used, each grou? 
consisting of 30 undergraduate college students. S% 
Proportions were approximately equal in the fou 
groups. The Ss were trained on a modified form of thè 
Rotter-Jensen Group Level of Aspiration Test (14) 
This test consists of seven separate symbol substitution 
problems, each one quite complex and each one 1% 
quiring a different key. Previous research with this test 
has indicated that scores on the seven tests rema 
approximately the same. It was possible to control 0! 
actual score obtained since the test is represented 25 Y 


E 


f OBJECTIVE MEASURES OF Expectancy 


time test and it is possible to stop S when he has reached 
the number 6f substitutions set for that trial. All Ss 
obtained the same prearranged sequence of scores 
which were, in order, 22, 29, 32, 27, 30, and 28. It 
may be noted that these scores give a mean of 28, and 
28 appears once as a score at the end of the sequence. 
The work of Dean (4) and Good (7) has indicated that 
this would be experienced as a realistic score for most 
Ss. Following training, the four groups were provided 
with different instructions. 

Group I was asked to indicate the score they ex- 
pected to make on the next trial with considerable 
emphasis being placed on accuracy and actual expecta- 
tion. Instructions used were comparable to those of 
Festinger (6). Following this, the Ss were asked to rate, 
on a ten-point scale, their probability of making that 
score. 

Group II was asked to rate, on a ten-point scale, 
their absolute probability of making six different pos- 
sible scores on the next trial. They were asked to rate 
their absolute probability of making the scores of 20, 
25, 28, 30, 32, and 35. 

Group III was asked to rate, on a ten-point scale, 
their probability of making at least 20 or more on the 
next trial and in the same manner for the remaining 
same scores, 

Group IV, designated as the behavioral or betting 
technique group, was given sixty cents in six piles of ten 
pennies each and was asked to bet up to ten cents on 
making a score of at least 20 or more on the next trial 
and in the same manner for the remaining same scores. 

The operational measurements of expectancy for 
Groups III and IV are identical except that Group III 
stated a probability on a ten-point scale and Group IV 
bet up to ten cents on each of the same six scores. The 
betting technique is essentially the same as that used 
by Castaneda. 


RESULTS 


The results are shown in Figure 1. Along the 
abscissa are the six possible scores by which the 
probability ratings were made. Along the ordi- 
nate is the rating scale of probability used. It 
can be seen that for Group I (large dot), which 
was asked to indicate the score they expected 
to make on the next trial, there is a mean 
eny of 28.2—only slightly above the 
pos expectancy of 28. The probability 
: 5d State that they will make this score is 
x . It may be seen that this probability for 

e score of 28 intersects the curves for the 
remaining groups at somewhat comparable 
points, 

PRINS II (brokén line) was asked to rate 
Jer absolute probability of making each of 
SI possible scores. These ratings result in a 
NE with the score having the greatest proba- 
E y at the mean of the training trials and 
Ssentially the same as in Group I—that is, a 
is of 28. However, if we were to make a cu- 
a ur curve of probabilities at 28, the scores 

group would be well above the other 
STOUDS at 28. Essentially, it appears the 


10 
Le 
9 e 
Group T= ——— 
8 Group Il = —— 


^ a o 


Expectancy Ratings 
[7] 


30 32 35 
Scores on Level of Aspiration Test 
Fic. 1. MEAN EXPECTANCY SCORES FOR ALL GROUPS 


subjects were not able to respond to the in- 
structions since the sum of the cumulative 
probabilities would be well over 100 per cent. 
However, it is clear that the score given as 
expected in Group I is the score of the highest 
probability in Group II. 

Group III (solid line) was asked to rate their 
probability of making at least the six same 
scores and Group IV (dashes) bet up to ten 
cents on their chances of making at least the 
same scores. The curves for Groups III and IV 
were found to be significantly different at less 
than the 5% level of confidence by an analysis 
of variance technique. The mean ratings at 
scores 20, 25, and 28 were found by / test to be 
significantly different at less than the 1% level 
of confidence. 

It could be noted from the graph that in 
every instance the curve for the betting gave 
higher mean scores. Apparently, for this group 
of subjects and reinforcements of this value, 
the behavioral technique results in higher ex- 
pectancy measures, the verbal technique being 
closer to the theoretical or actuarial expect- 
ancy. However, the curves for Groups III and 
IV correlated .99. This product-moment cor- 
relation is based on the mean expectancy rat- 
ings for the two groups on each of the six 
possible scores. 

CONCLUSIONS 


1. Similarities of results in the four groups 
indicate that all four methods are tending’ to 


J 


114 à 


measure the same thing. The mean of state- 
ments of a single expected score coincides with 
the score given the highest probability of oc- 
curring on an absolute basis. However, scores 
obtained by asking for absolute, probability 
apparently overestimate probability for any 
specific score. When Ss are asked to state prob- 
abilities of obtaining at least a given score, 
then the probabilities stated tend to be similar 
to those stated by Ss asked to give the most 
expected score. 

2. There appears to be a close relationship 
between previous training in a specific situation 
and expectations for future performance. The 
exact closeness of this relationship would have 
to be determined by further research where 
the groups have different levels of pretraining. 

3. The use of a technique which was referred 
to as a behavioral technique and a verbal tech- 
nique resulted in a significantly different 
expectancy or probability rating. The beha- 
vioral or betting technique resulted in higher 
mean expectancy ratings on all scores with 
significantly higher probabilities on the first 
three scores. Mean ratings for the two groups 
correlated .99, indicating that the two methods 
are measuring the same thing in a highly relia- 
ble fashion. 

It should be noted that these conclusions 
deal with mean expectancies. Not only are 
there great individual differences present but 
it is considered of primary importance from 
this theoretical viewpoint to account for these 
differences on the basis of theoretical principles 
` not discussed in this paper. 


REFERENCES 


1. Brown, J. S., & Farner, I. E. Emotions concep- 
tualized as intervening variables—with sugges- 
tions toward a theory of frustration. Psychol. 
Bull., 1951, 48, 465-495. 

2. Brunswix, E. The probability point of view. In 
M. H. Marx (Ed.), Psychological theory. New 
York: Macmillan, 1951. 

id. CASTANEDA, A. A method for measuring expectancy 
as conceived within Rotter’s social learning 
theory of personality, Unpublished master’s 
thesis, Ohio State Univer., 1951. 

4, DEAN, S. J. The effects of a cultural standard on 
minimal goal level and level of expectancy. Un- 
roe master's thesis, Ohio State Univer., 

5. DOLLARD, J., & Mutter, N. E. Personality and 
psychotherapy. New York: McGraw-Hill, 1950. 

6. FesrINGER, L. Wish, expectation and group stand- 
ards as factors influencing level of aspiration. 
J. abnorm. soc. Psychol., 1942, 87, 184-200. 

7, Goop, R. A. The potentiality for change of an ex- 

? 


16. 


17. 


18. 


19. 


21. 


24. 


25. 


26. 


J. B. ROTTER, B. J. FITZGERALD, AND J. N. joyce 


pectancy as a function of the amount of experi. 
ence. Unpublished doctor's dissertation, Ohio a 
State Univer., 1952. i 


. Hausmann, M. F, A test to evaluate some per 


sonality traits. J. gen. Psychol., 1933, 9, 170-189, 


. Hopsouse, L. T. Mind in evolution. New York; 


Macmillan, 1901. 


. Hut, C. L. The problem of intervening variable 


in molar behavior theory. Psychol. Rev., 1943, 
50, 273-291. 


. Hurt, C. L. A behavior system. New Haven: Yale 


Univer. Press, 1952. 


. Inwin, F. W., & Mintzer, M. G. Effect of differ 


ences in instructions and motivation upon! 
measures of the level of aspiration. Amer, J. 
Psychol., 1942, 65, 400-406. 


. JENKINS, W. O., & SraNrEY, J. C., Jm. Partial 


reinforcement: a review and critique. Psychol, . 
Bull., 1950, 47, 193-234. | 


. JENSEN, M. B., & Rorrer, J. B. The value of 


thirteen psychological tests in officer candidate 
screening. J. appl. Psychol., 1947, 31, 312-322. 


. Krecu, D. Dynamic systems, psychological fields 


and hypothetical constructs. Psychol. Rev., 1950, 
67, 283-290. 

Lewny, K. The nature of field theory. In M. H 
Marx (Ed.), Psychological theory. New York: 
Macmillan, 1951. 

MacCorguopare, K., & Meerut, P. E. Pre 
liminary suggestions as to formalization o| 
expectancy theory. Psychol. Rev., 1953, 6, 
55-63. 

Mowerer, O. H. Preparatory set (expectancy)—# 
determinant in motivation and learning. Psycldl 
Rev, 1938, 45, 69-91. 

Pavzov, I. P. Conditioned reflexes. (trans. by GW. 
Anrep) London: Oxford Univer. Press, 192 
(Quoted in E. R. Hilgard & D. G. Marquis 
Conditioning and learning. New York: Appleton: — 
Century, 1940.) 1 


. Postman, L. Toward a general theory of cognitio: 


In J. H. Rohrer & M. Sherif (Eds.), Social ps 
chology at the crossroads. New York: Harpth 
2951. 


Preston, M. G., & Bayron, J. A. Differential 
effect of a social variable upon three levels of 
aspiration. J. exp. Psychol., 1944, 29, 351-369. 


. Rorrer, J. B. Level of aspiration. II. Development 


and evaluation of a controlled method. J. exh 
Psychol., 1942, 31, 410-422. 


. Rorrer, J. B. Tentative formulation of some socil 


learning constructs. 4. A basic formula for b 
havior. Mimeo., June, 1951. 

RorrER, J. B. Tentative formulation of s 
social learning constructs. 5. The measureme 
of some basic constructs: Behavior Potentiti | 
Reinforcement Value, and Expectancy. Mime0r 
June, 1951. 

Torran, E. Theories of learning. In F. A. Mos 
(Ed.), Comparative psychology. New Yor! 
Prentice-Hall, 1934. mw 

ZENER, K. The significance of behavior accompany. 

ing conditioned salivary secretion for theo" 

of conditioned response. Amer. J. Ps) 

1937, 50, 384-403. 


Received June 22, 1953. Prior Publication. 


4 
1 


"fur field of attitude change presents a 
confused picture. On the one hand, 
there is a multiplicity of specific findings, 
although these are frequently contradictory in 
nature; on the other hand, there is a dearth of 
principles and generalizations on the basis of 
which knowledge can grow and become organ- 
ized into something resembling a science. One 
major reason is that so much of the research on 
attitude change has been carried out without 
Tegard to a theory of attitudes which would 

- take into account motivational processes in a 
systematic manner. Studies of the effects of 
information, of the influence of mass media, of 
the role of contact with members of other 
groups, have been made with scant attention 
to the fact that change implies motivation to 
change. Moreover, motivational factors which 
have been taken into account have generally 
been inferred after the fact and have frequently 
dealt with the most obvious variables. 

The point of departure for this paper, there- 
fore, 1§ a more thorough consideration of the 
motivational processes involved in attitude 
change. Attempts should be made to study 
these processes more directly by manipulating 
the variables involved in motivational proc- 
esses, In essence this means that the advances 
ìn psychodynamics in the field of personality 
and clinical psychology cannot be ignored in a 
Study of Social phenomena. The tendency has 
: for different schools to dwell exclusively 
‘Upon a single type of motivational context and 
‘to oversimplify problems in terms of a narrow 
theory of motivation. Thus the Gestalt school 
and the field theorists emphasize principles of 
pu aa which affect the psychological 
field: The trend is toward more inchisive and 
more stable organization, The individual tries 
ic Sears conflicting impressions, seeks to 
“now what the world is like and to make sense 
of it. The Tational man, once rejected by psy- 
ee now resurrected. Thus Asch in 
opp è ; p à 
(Thesis 5 noe Ml uet doctrine 
* This article 
Some of the work Undertaken by the Attitude Change 

University of Michigan, a proj 
y the Air University and monitored by 

sources Research Institute. 


THE MOTIVATIONAL BASES OF ATTITUDE CHANGE! 


IRVING SARNOFF AND DANIEL KATZ 
Student Healih Service, University of Michigan 


Thesis I assumes a process strictly different from the 
grasping of the character of a given situation, different 
from discovering relations of fitness or contradiction. 
It excludes the role of understanding, except insofar 
as the latter furthers the operation of the essentially un- 
reasonable factor of prestige. For Thesis II the main 
factor at work in this region of investigation is a change 
in the cognitive character of the given situation. In 
terms of Thesis I the essential factors responsible for 
social evaluation are emotional forces divorced from 
and overriding questions of actual merit. Thesis II 
asserts that emotional processes are as a rule under the 
direction of cognitive factors, and are controlled by the 
trend to find relevant relations (2, p. 275). 


The learning theorist, on the other hand, neg- 
lects this need for cognitive structure and uses 
instead a model of adaptive learning based 
upon reward and punishment principles. Doob, 
for example, gives this analysis: 


An ideally thorough analysis of an attitude requires 
knowledge (quantitative if possible) of the following 
factors, if its behavior and role in determining overt 
behavior in various situations are to be understood: _ 

1. Goal response: the response pattern or patterns 
which the attitude anticipates and mediates and which 
determine its reinforcement or extinction. 

2. Perception: the drive orienting the individual to 
pay attention to the stimulus pattern evoking the 
attitude. 

3. Afferent-habit strength: the strength of the bond 
between the attitude and the evoking stimulus patterns, 
including the gradients of generalization and 
discrimination. 

4. Efferent-habit strength: the strength of the bond 
between the attitude and the evoked responses, in- 
cluding overt ones. A 

5. Drive strength: the drive strength of the stimuli 
produced by the attitude. 

6. Interaction: the strength of the other attitudes, 
drives, etc., with which the attitude interacts to evoke 
overt behavior. 

7. Social significance: the evaluation in the society of 
the attitude and its direction (e.g., whether positive, 
negative, or neutral; favorable, unfavorable, or am- 
bivalent; friendly or unfriendly; desirable or unde- 
sirable; good or bad) (5, p. 145). 


Individuals assume attitudes as part of their 
adjustment to the world, and group norms be- 
come important since the individual seeks 
group acceptance. The psychoanalytically ori- 
ented clinician, on the other hand, gives atten- 
tion to the internal dynamics in which the 
individual's need to preserve his self-image and 
self-integrity becomes more important than 
external reward andpunishment. Thus, Adorno, 


j ) 
116 ^ 


Frenkel-Brunswik, Levinson, and Sanford 
reach this conclusion in The Authoritarian 
Personality. 


The most crucial result of the present study, as it 
seems to the authors, is the demonstration of close 
correspondence in the type of approach and outlook a 
subject is likely to have in a great variety of areas, 
ranging from the most intimate features of family and 
sex adjustment through relationships to other people in 
general, to religion and to social and political phi- 
losophy. Thus a basically hierarchical, authoritarian, 
exploitive parent-child relationship is apt to carry 
over into a power-oriented, exploitively dependent atti- 
tude toward one’s sex partner and one’s God and may 
well culminate in a political philosophy and social out- 
look which has no room for anything but a desperate 
clinging to what appears to be strong and a disdainful 
rejection of whatever is relegated to the bottom. The 
inherent dramatization likewise extends from the 
parent-child dichotomy to the dichotomous conception 
of sex roles and of moral values, as well as to a di- 
chotomous handling of social relations as manifested 
especially in the formation of stereotypes and of in- 
group-outgroup cleavages. Conventionality, rigidity, 
repressive denial, and the ensuing break-through of 
one’s weakness, fear and dependency are but other 
aspects of the same fundamental personality pattern, 
and they can be observed in personal life as well as in 
attitudes toward religion and social issues (1, p. 971). 


‘It is our belief that these three types of 
theories help us to conceptualize the different 
kinds of motivational phenomena which seem 
to be involved in attitude formation. A type of 
integration among these various points of view 
can be achieved by specifying the conditions 
under which one or another of the theories best 
accounts for the phenomena under investiga- 
tion. We feel that all people develop and alter 
their attitudes in terms of all three motiva- 
tional contexts. Certain people, however, will 
be more likely to acquire attitudes on the basis 
of one kind of motivational pattern than will 
other people. Moreover, attitudes formed in 
the service of a given type of motivation will 
tend to show different characteristics than atti- 
tudes acquired in the interests of other motiva- 
tional sources. 

We believe this approach is more profitable 
than the attempt to expand one of the above 
theories to take account of facts and relation- 
ships for which it was not originally devised. 
Such theoretical elaboration frequently means 
resorting to verbal magic in the definition of 
the theoretical concepts or gross distortion of 
the facts which are to be explained? 


? See, for example, N. R. F. Maier’s striking demon- 
stration of the limitations of reinforcement theory in 
accounting for behavior under conditions of frustra- 
tich (8). 


IRVING SARNOFF AND DANIEL KATZ 


$9, 


à 


In the following pages we shall/present: 
1. A formulation of our theoretical assump. 
tions. 
2. An outline of an experimental program 
which is aimed at testing our assumptions] 


THEORETICAL ASSUMPTIONS 


A. The Nature of Attitudinal Structure 


An attitude is a stable or fairly stable organi: 
zation of cognitive and affective processes 
around some object or referent. The object ot 
referent points to some thing, some relation. 
ship, or some aspect of the environmental 
field, and is thus directly related to the cues 
which elicit the expression of the attitude. The 
term cognitive object will be used for this part 
of attitudinal structure. The cognitive object 
is the perceived nature of the environmental 
referent; that is, a number of characteristics 
are seen as denoting and describing the objecti 
in the real world. In Bruner’s terms (3), the 
cognitive object would be the expectations 
about the environmental object. | 

The cognitive object is imbedded in a system! 
of values. Since they are generalized prefer: 
ences, values give sign loadings to cognitive ob 
jects and their associated attitudes. While) 
attitudes are highly specific in their linkage tof 
cognitive objects, the same system of valus 
may be related to a large number of cognitivt 
objects. For example, the content of ethnit 
prejudices defines various minority groups ll 
a highly structured manner. Thus, to the bigoti 
the Jews are loud, Negroes are lazy, Mexican 
are dirty, and Turks are lascivious. Howevet 
all of these specific attitudinal contents maji 
reflect the same underlying authoritarian phi 
losophy. Thus the cognitive object is in part 
function of knowledge and perceived reality 
but it is also colored by the value to which i 
is tied. One characteristic of a specific attitud 
would be the strength of the link between its 
cognitive object and the system of values M 
which it is imbedded. A related characterisüt 
would be the relative stréngth of the links 
the cognitive object to other sets of values. 

Intervening between the cognitive obje 
and the set of values to which it is referred 
the frame of reference in which it is perceivel 
at any one time. The frame of reference 1$ 
cognitive set of standards operative at the m 
ment which results from all the forces play 
simultaneously upon the individual and whit 
determines both the dimension for making 


© 


MOTIVATIONAL BASES OF ATTITUDE CHANGE 


judgment and the specific judgment made. The 
external cues in the environment not only 
arouse the perception of a cognitive object but 
they are also partly responsible for eliciting the 
way in which the perceived object will be re- 
lated to other objects and beliefs. The frame of 
reference is also determined by the characteris- 
tic way which the individual has of evaluating 
certain objects and by his emotions and needs 
at the time. 

Since at any given’ time a great deal is hap- 
pening to and within the individual, it is neces- 
sary for him to impose a cognitive organization 
upon these fluid forces if he is to function in an 
integrated fashion. Insofar as the individual is 
bombarded by inner impulses which must be 
attuned to the requirements of the external 
situation, it is reasonable to postulate that he 
is in a constant state of conflict. The frame of 
reference helps him to resolve conflicts. 

In other words, the frame of reference is the 
most variable process with respect to attitudi- 
nal structure. The cognitive object is relatively 
constant, and the larger structure of beliefs and 
values in which it is imbedded is also relatively 
Constant. The frame of reference is affected, 
however, both by changes in the external 
environment and changes in needs or motives. 


B. Attitudes and Motivation 


_ In terms of the focus of our research, it is 
Important to describe the role which motiva- 
tional forces play in the development of atti- 
tudes, The living individual is constantly in 
Some sort of motivational conditions hence 
considerations about attitudinal change should 

Start at this level. 3 
Broadly speaking, there seem to be three 
types of motivational contexts in conjunction 
ie which attitudes are formed. At different 
ie and for different objects, an individual's 
itude may be largely a function of one or 
m of these contexts. For the sake of theoreti- 
Gan dur and experimental manipulation, we 
dude them as relatively separate con- 
bud en we shall give a specific example 
OW several of these motivational patterns 


Operate sim Hh . 
sohality ¢ m d within a given per- 


external World. This need may be observed in 
i T OPE of naive children, as well as 
(Usi olled experiments of trained scien- 

en this quest for information seems to 


( 
117 


be a curiosity which is entirely self-contained. 
At other times, it appears to be in the service 
of other motives, such as survival or the desire 
to master the environment. 

Considered in this light, attitudes may some- 
times be a function of the range of information 
which has been accessible to the individual in 
regard to certain target objects. Let us assume, 
for example, that a curious child has heard of 
the word Negro for the first time. He goes to 
his parents for information and is told, among 
other things, that Negroes are bad people, 
stupid, dirty, animal-like. Assuming that he 
has heard no information to the contrary, that 
he has had no actual experience with Negroes, 
and that his parents have always supplied him 
with reliable facts about things, it follows that 
he will be inclined to accept this statement 
about Negroes. Even if the child is entirely free 
of other motivational needs to maintain this 
attitude, it is likely to persist until his contact 
with Negroes indicates to him that not all of 
them are stupid or until he is confronted by 
contrary information which he considers to be 
more reliable than that which his parents gave 
him. 

Reward and punishment. Some of the princi- 
ples of learning theory may be applied to cer- 
tain types of attitude formation. Specifically, 
these principles are involved when attitudes 
are adopted as a consequence of externally ap- 
plied rewards and punishments. A white 
youngster in rural Mississippi may begin life 
on very favorable terms with his Negro play- 
mates. Still, as he matures, his adjustment to 
and acceptance by the white community de- 
pend in some measure upon the extent to 
which he has come to share the prevalent anti- 
Negro biases. If he rejected these biases, he 
might be vulnerable to social ostracism and 
even more severe manifestations of disap- 
proval. 

Ego defense. In his social relationships every 
individual attempts to gratify his basic physi- 
ological needs and the socially acquired ones. 
Some individuals, because of the strength, ma- 
turity, and integration of their egos, are able 
to perceive and tolerate the existence of their 
inner drives, including those which may be 
socially taboo. Such a person can, in terms of 
a realistic assessment of the social situation, 
decide which impulses to express and which 
ones to suppress. It is possible for him to ap- 
proach others in a straightforward manner; to 
react to them in terms of how they, as unique 


ù 
3 


118 3 IRVING SARNOFF AND DANIEL KATZ 


individuals, actually fulfil or frustrate his 
needs. His social attitudes are likely to be 
appropriate since he is able to differentiate 
clearly between what is going on inside him 
and what is emanating from others. ? 

At the other end of the scale of ego strength 
there are persons who are obliged to use devi- 
ous means of gratifying their impulses. In an 
attempt to resolve inner conflict, they mini- 
mize or obliterate certain aspects of their emo- 
tional life. This deflection of impulse from 
conscious awareness does not destroy it. On the 
contrary, it continues to press for overt expres- 
Sion and requires the individual to expend 
considerable energy in keeping it below the 
threshold of consciousness. Despite the most 
strenuous countermeasures (repression and 
other mechanisms of defense), the impulse 
tends to attain at least partial expression and 
gratification. This expression is achieved by 
means of symptom formation, a device which, 
however incapacitating it may be, serves a dual 
and somewhat paradoxical function: 

1. It permits expression of the unconscious 
impulse, 

2. It prevents the individual from becoming 
aware of the existence of the unconscious im. 
pulse. 

Attitudes may thus function as ego defenses, 
and may be viewed as symptoms, For example, 
the type of bigot described in the California 
Studies is one who cannot accept his own hos- 
tile impulses (1). By projecting them onto 
others, he gains gratification of his impulses 
while maintaining the fiction that these im- 
pulses originate in others rather than in him- 
self, 

Combinations of ‘motivational forces: the 
pseudodemocratic personality. We have already 
alluded to the fact that individuals tend to 
form and change their attitudes on the basis 
of a variety of motives which may operate si- 
multaneously. We shall illustrate this point 
by describing briefly a Personality type which 
we have called the pseudodemocratic syn- 
drome. 

This type of person is characterized by a 
cleavage between thought and action. On paper, 
he is tolerant; behaviorally, he is biased. He is 
both intelligent and sophisticated. He does 
not subscribe to the attitudinal crudities 
which are espoused by the type of bigot 
represented by a high score on the F Scale. 
Rather, on a purely intellectual basis, he is 
likefy to show distinct repugnance for ethnic 


» 


stereotypes and those who promote them. Yet, 
in choosing among the kinds of behavioral alter- 
natives imposed by reality, to accept or reject 
minority group persons as fraternity brothers, 
he consistently picks the prejudiced one. 

Generally, he is aware of this discrepancy 
between his ideals and his actions. Typically, 
he relies upon the mechanism of rationalization 
for help in resolving this discrepancy. 

Why is such a person forced into rationaliza- 
tion? Why is he not able to actualize his 
tolerant attitudes on a behavioral level? Our 
tentative answer to these questions is that he 
becomes overwhelmed by anxiety at the point 
of action. This anxiety stems from the degree of 
punishment which he expects to receive as a 
result of such action. 

Basically, he has a great need to conform to 
the standards of a group whose support and 
affiliation are central to his security. If he 
perceives this group to be strongly opposed to 
the ethnically tolerant action which he. is 
requested to take, he will be afraid to act. 

If these dynamics are valid, one may well 
appreciate the caution with which one of these 
pseudodemocratic personalities approaches the 
prospect of action. In his mind, making a 
behavioral choice is an all-or-none decision 
which may lead either to his expulsion (loss of 
job, status, approval) from the group whose 
favor he curries or to the reinforcement of his 
position in it. 

The tolerant social attitudes expressed by 
these individuals may represent an unconscious 
or token rebellion, on the abstract intellectual 
plane, against those on whom they are still 
emotionally dependent. For example, they 
may renounce the ethnic biases of their parents 
but reject a Negro roommate or a Jewish girl 
friend if such relationships seriously jeopardize 
continued parental approval and support. In 
other words, the pseudodemocrats may not 
feelstrong enough to accept the responsibility 
for the consequences of ethnically tolerant 
actions. On the other hand, they may be 
sufficiently strong to differ ‘with their parents 
in the areas of thought and verbalization. Such 
areas are relatively safe in our society since 
even the most vitriolic bigots do not openly 
oppose the principle of free expression. Be- 
havior, however, is another matter. To stand 
up and be counted implies a thoroughgoing set 
of convictions and commitments to them. It 
requires not only the taking of “sides” on an 
explosive social issue but also the ability to 


t 


ÎMOTIVATIONAL BASES oF ATTITUDE CHANGE 119 


withstand the same sort of rejections that are 
habitually directed against the minority-group 
members with whom one overtly associates. 


C. Value Systems 


Values are relatively enduring, affect-laden 
codes. The affect derives from association with 
need gratification or deprivation and with 
emotional arousal. Thus, food, in satisfying a 
persistent basic need, is universally valued 
even when people are not actually motivated 
by hunger. Value systems are the organization 
of beliefs about: affective symbols. The term 
sentiment was used in a similar way by Shand 
(11) and McDougall (9), save that they mini- 
mized the cognitive or belief systems and were 
more interested in the combinations of innate 
emotions about objects. 

Value systems can achieve such a level of 
organization that they may pervade many 
aspects of the personality. These systems can 
refer to the environment, as when the indi- 
vidual has a set of beliefs about the demo- 
cratic process and the equality of opportunity 
for all people. Or they can refer to the self- 
image, as when the upper-middle class woman 
sees herself as a Lady Bountiful whose charity 
and goodness sustain a large group of de- 
pendents. 

, There has been much debate about the func- 

tional autonomy of these value systems as 
motivational forces. Some theorists regard the 
value System as capable of energizing the 
individual, in and of itself. Other theorists 
see more basic needs as the motivatin, force, 
with the value system as the channel'for the 
expression of these needs, We shal follow the 
latter school, with the admission, however, 
that needs can readily be aroused through 
activation of the value system. This sort of 
Conceptualization of the value system leads to 
the assumption that identical values may serve 
a variety of different motivational processes. 

From this point of view it is "incorrect, 
therefore, to assume an identity between a 
value system and å personal need of the indi- 
vidual. An authoritarian set of beliefs does not 
Justify the inference that the individual has a 
need to be authoritarian. His need may be one 
of power but this need can be expressed 
through other values than an authoritarian set 
of beliefs. 

i It is also incorrect to assume an identity 
petween a generalized mechanism of personal- 
ity, such as projection and scapegoating, and 


an authoritarian set of beliefs. Again, the 
content of the expression of the behavioral 
mechanism does not have to be a particular 
value system. A person may develop a general 
set of valfies, such as a democratic ideology, in 
which the major interest is in the content of 
democratic ideas. He may respond to demo- 
cratic appeals and make many sacrifices in their 
interest. Yet, in his daily interactions with 
others, he may behave like an autocrat, 
Another person may be authoritarian in his 
ideology but be quite genuinely equalitarian in 
his actual interpersonal relations. In the former 
case, we may be dealing with value systems 
which have been elaborated rationally and 
systematically and which fulfil certain aspects 
of the individual’s self-image in circumscribed 
spheres of his activity. In the latter case, we 
may have a person who has been reared in an 
extremely parochial cultural milieu but whose 
emotional development has been enhanced 
by "loving and accepting" parents. Thus, the 
ideology which an individual embraces is not 
necessarily indicative of how he will orient 
himself to others in real life situations. 

It is true, however, that a generalized 
personality mechanism, such as projection of 
one's own aggression, would tend to go with 
authoritarian beliefs, other things being equal. 
It is also true that the power motive will be 
correlated with authoritarian values, other 
things being equal. It is important, however, 
for research to establish the relationship 
between personality dimensions and value 
systems through independent measures of 
both sets of variables. If we infer a personality 
dimension from a value system, then it is 
relatively easy to establish correlations with 
specific attitudes. These correlations are sus- 
pect, however, as demonstrating a relationship 
between a personality characteristic or need 
anda specificattitude. The F Scale, forexample, 
may not be a pure personality measure, 
since the authoritarian values it measures may 
well reflect the individual’s values rather than 
other basic personality characteristics, 


D. Attitudes and Behavior 


Specific overt behavior toward an object 
cannot be predicted solely on the basis of 
knowledge of the individual’s attitude toward 
that object. This difficulty in predicting stems 
from the fact that behavior is the resultant of 
many intrapsychic and interpersonal forces 
which may be quite independent of the attifüde 


» 
120 > 


in question. Theoretically, it would be neces- 
sary to have on hand data concerning all of 
these forces in order to predict accurately the 
behavioral fate of a given attitude. Since it is 
patently obvious that no actual experiment can 
meet these extensive theoretical requirements, 
the researcher is obliged to deal with fewer 
variables, and, hence, to state fewer, grosser, 
and more tentative hypotheses. 


E. Attitudes and the Dimensions of Appropriate- 
ness 

What is “psychologically” real, that is, what 
the individual himself perceives and believes, 
is often accepted as the only relevant datum in 
social psychology. This unduly limits our 
science, however, and an adequate system must 
take account of experimental findings on per- 
ceptual distortion and selectivity. We propose, 
therefore, to use the term inappropriate to 
characterize the cognitive object of the attitude 
which does not accord with the actual char- 
acteristics of the object. In other words, the 
perceptual-cognitive objects of the attitudes 
may be gross distortions of the target object. 
In general, attitudes which are determined by 
unconscious motives will tend to be inappro- 
priate with respect to their cognitive content. 

If the person's ego could be strengthened, 
if he could come to accept consciously the 
existence of these repressed impulses, he would 
no longer require a devious, symptomatic 
channel for the expression of these impulses. 
He would be able to express his impulses more 
directly and at their true instigators or objects. 

Because of the distortion implicit in these 
kinds of inappropriate attitudes, the individual 
who possesses such attitudes is likely to be 
unrealistic in his reactions to target objects. 
This distortion may affect two components of 
attitude: content and affect loading. 

In terms of distortion of the target object, 
the person with ego-defensive attitudes may: 

1. Attribute to others qualities which they 

do not possess. 
2. Ignore qualities which they do possess. 
In terms of affect loading, he may: 


1. Experience an excessive emotional reac- . 


tion (either positive or negative) to real 
or attributed qualities of the object. 

2. Fail to experience an adequate emotional 
reaction (either positive or negative) to 
real or attributed qualities of the target. 

The inappropriateness of an attitude is 

not always a function of ego-defensive motiva- 
tion. Sometimes it is a function of a restricted 


IRVING SARNOFF AND DANIEL KATZ 


4 


range of information about an object, Or it may 
reflect misinformation which has been learned 
in a context of externally applied rewards and 
punishments. It is necessary, therefore, to 
categorize individuals in terms of the motiva- 
tional functions which are involved in the 
acquisition and maintenance of their inappro- 
priate attitudes. For example, three persons 
may get identical high scores on a measure of 
anti-Negro attitudes. For individual A, this 
score is indicative of what he has grown to 
regard as factual data concerning Negroes. 
He has accepted these “facts” in the same way 
that he has accepted the proposition that one 
plus one equals two. For individual B, this 
score represents a reflection of misinformation 
acquired in a homogeneous milieu in which 
adjustment to the social group calls for the 
taking on of the attitude. In this instance, the 
attitude has a dominantly external referent 
(social acceptance) in termsof need satisfaction. 
For individual C, however, strong anti-Negro 
prejudice is a symptom of his inability to 
handle his own hostile impulses. Instead, he 
projects them onto Negroes and thus expresses 
them while remaining unaware of their origin 
within himself. In this case, the attitude has 
a dominantly internal referent (ego defense) 
in terms of need gratification. 

We recognize, of course, the complexities of 
social reality and the limited access which 
individuals have to the social totality about 
them. This has, however, led to such extreme 
interpretations as that of Walter Lippmann 
who would see people as operating wholly in 
terms ef social fictions (7). Similarly, in recent 
social psychology the attempt has been made 
to define social reality in terms of social beliefs 
which happen to be accepted at a single point 
in time (6). But the better criterion of social 
reality is in terms of social consequences of the 
belief, and of continuing experiences in the 
social world. Our tendency to accept a fiction 
as the social reality is due in part to our dwell- 
ing upon the social scene for a very limited time 
period. The fiction of the overwhelming Dewey 
majority in the 1948 election was quickly 
exploded by the facts of the election. We 
believe, therefore, that the social psychologist 
cannot ignore the problem of the appropriate- 
ness of social attitudes merely because of the 
difficulties of the problem any more than the 
clinical psychologist can ignore the problem of 
whether the individual is deluded or reality 
oriented. 


——— 


"m 


IM 


H 
MOTIVATIONAL BASES OF ATTITUDE CHANGE $ 


It should be noted that throughout the 
foregoing discussion there is the implicit 
assumption that the real qualities of objects 
are available to our perceptual apparatus; that 
is, the actual psychological characteristics of 
human beings can be measured, described, and 
objectified. Knowing these qualities will 
enable us to make differentiations along the 
vitally important dimension of appropriate- 
ness. We say vitally important because it seems 
to make a significant difference, both theoret- 
ically and practically, to know that A tends to 
take account of X in responding to X, while for 
B it is possible to substitute Y for X without 
any observable difference in his reaction. 

It is granted that an objective description 
of the stimulus situation, especially a fluid, 
reacting stimulus, such as a human being, may 
be difficult to achieve and, perhaps, even more 

- difficult to establish as a basis for securing 
consensus among psychologists. Nevertheless, 
the difficulties of the task make it no less vital 
if we are to make a genuine attempt to develop 
a behavioral science. As a tentative start in this 
direction, we may be able to use standardized 
tests or observational measures on which we 
can build operational definitions concerning the 
qualities of social objects. 


F. Types of Motivation and Differential Change 
Procedures 
Since their identical attitudes have different 
motivational bases, the influence procedure 
which would effect a change in A’s attitude 
(the. poorly informed person) would differ 
qualitatively from the procedure uged in 
changing the attitudes of B (the conformer) 
and C (the defensive individual). * 
Operationally speaking, this implies that 
A’s attitude may be markedly changed by 
Presenting him with scientifically verified facts 
about Negroes. In order to effect a change in 
B's attitude, however, it may be necessary to 
Present this information to him in a sotial 
Situation in which he is rewarded for changing 
his attitude. We would expect individual C to 
Teact quite differently to this information. Be- 
cause of his strong ego-defensive needs, his 
attitude should remain unaltered, regardless of 
the presence or absence of external rewards, 
It should be noted, moreover, that attempts 
to change the ego-defensive individual which 
are based upon the logic of the first two ap- 
Proaches may actually reinforce his old atti- 
raed These appeals really assume either (a) 
€ individual is interested in a more accurate 


121 


and more complete knowledge of the world, or 
(b) he is primarily concerned with maximizing 
the satisfaction of his conscious needs. If, 
however, he is primarily concerned with 
avoiding a direct facing of his own internal 
conflicts, then he will protect himself from such 
a possibility the moment he senses an attempt 
to make him change. The well-known phenom- 
enon of resistance is an example in point. The 
resistance generated to protect the ego results 
in a blanket rejection of the change situation, 
a perceptual distortion of what it is like and an 
emotional reinforcement of the attitude. 
"Though we recognize the importance of resist- 
ance in other contexts, we have slighted its 
role in attitude change. Hence, persuasion and 
propaganda can have negative rather than 
positive effects. | 

It is not our thesis, however, that ego 
defenses cannot be breached, but that different 
procedures are involved in dealing with these 
defenses than with nonconflicted motives. 


EXPERIMENTAL PROGRAM 
The following is a broad outline of a program 
of experimentation which is aimed at testing 
the foregoing theoretical assumptions. This 
outline is meant to be suggestive rather than 
definitive. It presents a methodology which is 
appropriate to our theoretical concepts.* 


I. Changing Altitudes through Attacking the 
Cognitive Object and the Frame of Reference 
in Which It Is Perceived: The Rational 
Approach 

It has been pointed out that inappropriate 
attitudes, though more characteristically 
acquired as ego defense, can also be acquired 

(a) in the interests of social rewards and 

punishments and (b) in the functioning of the 

perceptual-cognitive process itself. In the last 
mentioned instance, the assumption is that the 
individual acquires beliefs either out of intellec- 
tual curiosity, reality-testing, or a desire for 
cognitive structure in terms of having to know 
what his world is really like. The belief or 
attitude may be inappropriate because the 

individual comes into contact with only a 

limited aspect of reality. In practice, of course, 

a number of these processes occur simultane- 

ously. The individual takes on certain attitudes 

because he wants to be accepted as a group 
member, but the beliefs he acquires must make 


3A beginning has been made on this experimental 
program and results of specific studies will be reported 
in future articles. 


122 


sense to him with respect to being accurate 
pictures of the world he knows. 

Gestalt psychology has restored the impor- 
tance of the cognitive factor in psychological 
functioning, and has assumed thut men re- 
spond to fact and reason. The theory would 
be that when new facts are presented they 
result in cognitive reorganization rather than 
exclusion or blocking. In addition, sophisti- 
cated cognitive theory assumes that perception 
occurs in a frame of reference which is the 
momentary product of a number of possible 
forces. Pressing internal needs or the objective 
nature of the situation may result in a tem- 
porary frame in which things are perceived in 
relation to that frame. Since people already 
possess definite standards for perceiving Ne- 
groes,‘ it is important, if we are to change their 
perceptions, to be sure that the dominant 
frame of reference at the time the materials are 
introduced is not the product of these old 
standards. Essentially, this calls for the activa- 
tion of a “new” frame without any reference to 
the old standards. Once the “new” frame is 
operative, material bearing upon the old cog- 
nitive object can be introduced without the 
old standard operating asa censor. Specifically, 
this can be accomplished through experimenta- 
tion in the form of the creation of a frame of 
cultural relativity and cultural causation, and 
then the introduction of information about 
Negroes. 


IL. Changing Attitudes through the Application 
of Social Rewards and Punishments 


A. The use of group norms. One of the major 
reasons for attitude change is the desire to 
gain the social approval of others. It is natural 
that experiments in social psychology have 
concentrated heavily upon the manipulation of 
group norms. The desire for social approval is 
a broad term, however, covering both deep- 
lying affiliation needs and the immediate 
rewards of adapting to the group situation. We 
are concerned primarily with this latter type 
of conformity, namely, the acceptance of group 
standards because of their immediate reward 
character. For example, the individual who 
accepts the values of the group has increased 
his chances of being liked by his fellows, of 
moving up in the group structure or of sharing 
in the returns from group effort. In these 
terms, it would first be necessary to find out 


* Attitudes toward Negroes are the major dependent 
variable of our research program. 


IRVING SARNOFF AND DANIEL Katz 


) 


} 


what the individual’s relation to the group is 
and what types of satisfaction he derives from 
group membership. If we know these facts, 
then we are in a better position to predict 
changes in attitude as a result of a changed 
perception of group norms. Similarly we could 
predict those individuals who would tend to 
maintain their attitudinal position independent 
of the group. The assumption here is at the 
simple level that the more the rewards and the 
greater the sanctions the group has, the more 
effective will a changed perception of the 
group position be. In the persuasion pattern 
the attempt is made to achieve cognitive 
restructuring through making the individual 
see new possibilities in achieving his goals. In 
the use of group norms, the change is more 
external in that the group norm is changed and 
the individual merely moves with it. 

2. Restructuring the individual's value systems. 
The great bulk of the efforts to change atti- 
tudes in the world of affairs is through persua- 
sion and argumentation, in which an appeal is 
made to existing value structures. Some value 
system, other than that to which the target 
object is ordinarily connected, is invoked, and 
the necessity of meeting its requirements is 
stressed. In inducing change in the manner 
described above, the emphasis was placed upon 
the perceptual side of the process, in getting 
people to look at the old object in a new frame 
of reference. Thus subjects would have had to 
take the further step of restructuring their 
attitudes themselves. No direct attempt would 
be made to suggest that they should not pursue 
a discriminatory policy toward Negroes. In 
this second type of experiment, however, the 
attempt will be to restructure their attitudes 
for them through making the specific connec- 
tions between the target object and a different 
set of values. This can be done (a) through 
exploiting a very powerful system, (b) through 
utilizing more different values for the change 
than existed for the old attitudinal structure, 
and (c) through showing how the values tied to 
the target are blocked by social reality and are 
not in fact being achieved. In practice, all 
three methods are combined to produce change. 
From the point of view of content, the per- 
suasive technique frequently relies upon 
demonstration of a fusion between the indi- 
vidual's self-interest and socially desirable 
goals. It further shows the feasibility of the 
alternative suggested, as contrasted to the 
impracticality of all other alternatives. These 


| 
| 
| 


t 
MQTIVATIONAL BASES OF ATTITUDE CHANGE c 


relationships have been systematically formu- 


- Jated and experimentally demonstrated by 


Peak and her colleagues (4, 10). 

The theory implicit in these persuasive 
techniques is that the individual will restruc- 
ture his psychological field to maximize the 
attainment of his goals and values. Personality 
theorists have suggested that this theory 
is correct within limitations, namely, that 
when ego needs are threatened, the individual 
will respond to persuasive efforts by blocking 
and resistance. The resistance may result in 
more strongly reinforced attitudes than was 
true before the attempted persuasion. Again, 
the prediction is that. the change attempts will 
have differential effects, depending upon the 
personal needs of the subject. Moreover, since 
the appeal to values operates more at the 
motivational than the perceptual level, the 
prediction is that for nonego-defensive people 
there will be more change than in the first and 
Second series of experiments. For the ego- 
defensive people there will be less change 
because of the phenomenon of resistance. 


TI. Changing Attitudes by Attacking Ego- 
Defensive Forces through the Use of 
Catharsis and Direct Interpretation 


Two techniques widely employed in psycho- 
therapy. which have possible application to 
group Situations are permissive catharsis and 
direct interpretation. The basic objective is 
to help the individual attain insight and to 
restructure his attitudes accordingly. 

1. Permissive catharsis. The free ventilation 
of thought and feeling in an accepting fnter- 
personal atmosphere is an integral part of 
virtually all schools of psychotherapy. It is 
generally assumed that this type of expression 
helps the patient to change in the following 
Ways: 

a. It offers him relief from the tension of 
burdensome affects such as guilt and hostility. 
The very act of expressing these fetlings is 
Supposed to drain off energy which would 
necessitate the formation of various somatic 
and psychic symptoms. i 

b. lt permits him to verbalize and hence 
objectify his inner conflicts. Such objectifica- 
tion leads to clearer self-perception. This 
Clarification in turn makes it possible for him 
to utilize his intellect more effectively in 


choosing among al i i 
d 5 g alternative means of resolvin 
his conflicts, F 


123 


2. Direct interpretation. This technique uses 
a combination of logic and suggestion; pa- 
tients are confronted with psychological 
explanations of their behavior. These explana- 
tions are supposed to give them immediate 
insight into the internal factors (motivations) 
which determine their overt behavior. The 
patient is then to bring the weight of his own 
common sense to, bear upon the proffered 
insight and to change his behavior in the light. 
of this newly acquired knowledge. Thus, the 
lengthy process of uncovering is short-circuited 
and the patient is directly informed about the 
underlying motives of which he had previously 
been unaware. In employing this sort of ap- 
proach, the therapist is generally obliged to 
rely upon the degree of authoritativeness with 
which the patient invests him and the sources 
(research findings, theory) on which he bases 
his interpretation. 

It is granted, however, that these techniques 
will not be effective for all ego-defensive 
people. In some cases resistances may still be 
too strong to be affected, either by catharsis 
or by direct interpretation. The assumption is 
made, therefore, that people who already have 
enough insight to be concerned about their 
own conflicts and to be dissatisfied with some 
of their own behavior will respond to these 
techniques. Just as individual therapy can 
make little headway with patients who do not 
want to be helped, so group procedures will 
affect only those group members who are 
dissatisfied with their self-image, even though 
they do not understand the basis of their 
conflict. 

In this type of experiment the direct inter- 
pretation can consist of psychological explana- 
tions of the dynamics of scapegoating and of 
displaced and repressed hostility. The objective 
will be to give self-insight. Individuals who are 
more insightful at the start will apply the 
interpretation to themselves more readily than 
individuals lacking insight. One advantage of 
the technique of direct interpretation is that it 
can be used in printed materials and is not 
confined to interpersonal communication. 


IV. Changing Attitudes by Tapping Several 

Motivational Contexts: Attacking Ego De- 

fenses and Applying Social Rewards and 
Punishments 

It may be necessary to apply several differ- 


ent types of influence procedures in order to 
e 


) 
124 . 


change the attitudes of some people. The 
pseudodemocratic personality, for example, 
may require this sort of multiple procedure. 
This personality type may be detected on the 
basis of a discrepancy between, his general 
Social attitudes and his specific social actions. 
Furthermore, he would tend to use philosoph- 
ical kinds of rationalization when asked to 
justify this discrepancy. In this regard, we 
may expect to get the same sort of responses 
as those offered by politicians who proclaim 
their ethnic tolerance but oppose a compulsory 
FEPC. 

Tn order to change their positions on matters 
of action, i.e., in order to bring their actions 
into line with their thoughts, it may be neces- 
sary to adopt the following twofold change 
procedure: 

1. Give them insight into the nature of the 
mechanism of rationalization by means of 
which they attempt to “explain” the cleavage 
between their unprejudiced thoughts and their 
prejudiced actions. In addition, it might be 
expedient to interpret to them the dynamics 
of their symbolic rebellion against their 
parents. 

2. Present them with assurance that be- 
havioral change in a more tolerant direction 
will be well received by their peers. For, if they 
are to jeopardize (in their minds) old sources 
of security, they will probably need guarantees 
of new support and acceptance, 

Thus we would predict that the insightful 
pseudodemocrat will show the greatest change 
in those group situations in which there is the 
greatest amount of overtly expressed con- 
Sensus on the issue of adopting ethnically 
tolerant actions. 


SUMMARY 


In this paper, we have attempted to relate 
the major motivational bases of attitude to 
change by indi ting the conditions under 
which they are operative. In addition, we have 
outlined methodological approaches which 


ES 


] 
IRVING SARNOFF AND DANIEL Katz > 


seem to lead to appropriate testsyof our hypo- 
theses. 

Our viewpoint maintains that social atti- 
tudes reflect a variety of motivational sources, 
Thus similar attitudes may function in the 
service of different motivational patterns, 
Hence, in order to change attitudes in a pre- 
dictable manner, it appears to be necessary to 
devise influence procedures which are aimed 
at the differing underlying motivations. 

In describing the motivational forces which 
determine attitude formation and change, we 
have focused upon three basic variables: re- 
ward and punishment, reality testing, and 
ego defensiveness. We also discussed the 
Possibility of a combination of these forces in 
terms of a specific personality type, the pseudo- 
democratic syndrome. 


REFERENCES 

1. Aporno, T. W., FRENKEL-BRUNSWIK, ELSE, LEVIN- 
son, D. J., & Sanrorp, R. N. The authoritarian 
personality. New York: Harper, 1950. 

2. Ascu, S. E. The doctrine of suggestion, prestige, and 
imitation in social psychology. Psychol. Rev, 
1948, bb, 250-276. i 

3. Bruner, J. S., & Postman, L. On the perception of - 
incongruity. In J. S. Bruner, & D. Krech (Eds.), 
Perception and personality. Durham: Duke — 
Univer. Press, 1949, 

4. Cartson, E. R. Attitude change through modifi- 
cation of attitude structure. Unpublished doctoral 
dissertation, Univer. of Michigan, 1953. 

5. Doos, L. W. The behavior of attitudes. Psychol. 
Rev., 1947, 64, 135-156. 

6. FESTINGER, L., SCHACTER, S., Kerrey, H. H, & 
THIBAUT, J. Theory and experimentation in sociol 
communication. Ann Arbor: Univer. of Michigan 
Press, 1950. 

7. LzPPMANN, W. The phantom public. New York: 
Harcourt Brace, 1925. 

8. MATR, N. R. F. Frustration. New York: McGraw- 
Hill, 1949. 

9. McDoucart, W. An introduction to social psy- 
chology. Boston: John W. Luce, 1926. 

10. RoseNBEzG, M. J. The experimental investigation 
of a value theory of attitude structure. Un- 
published doctoral dissertation, Univer. of Michi- 
gan, 1953. 

. SHAND, A. F. The foundations of character. London: 
Macmillan, 1926. 


Received July 25, 1953. Prior Publication. 


-.- Ang 


\ 


ON THE BROWN ADAPTATION OF THE ROSENZWEIG P-F 
FOR ASSESSING SOCIAL ATTITUDES' 


ROBERT SOMMER 
The Menninger Foundation 


NE of the most promising of the pro- 
jective techniques for assessing social 
attitudes is the adaptation by Brown 

(1) of the Rosenzweig Picture-Frustration 
Study. The present paper describes a series of 
small-scale investigations designed to assess 
the utility of the technique in a variety of situ- 
ations. 


PROCEDURE 


In the course of the investigations, it was 
found that drawings with more ambiguous dia- 
logue were capable of eliciting more diagnostic, 
less stereotyped responses than drawings 
Portraying concrete, well-defined situations. 
For this reason, most of the original Brown 


n drawings were revised and can be described as 


follows: 
$ 1. A white, sitting at a desk, telling a Negro, 
T can’t see you this morning as we arranged 
yesterday.” 
2. A white saying to a Negro, “You Negroes 
sure let people walk over you.” 
i 3. A white worker complaining to his boss, 
I don't like a nigger as foreman!” 
4. A white hotel clerk telling a Negro, “We 
take no Negroes here, sorry.’? 
. 9. A white, drinking at the bar of a pfivate 
club, confiding to another white, “We know 
how to keep them in their places,” 
6. A Negro saying rather bitterly to a white, 
ome day things will be different." 
7. An unkempt white standing in front of a 
dance hall asking a Negro couple, “What do 
you want?"? 


8. A white sitting at a desk telling 4 Negro, 


! There are a great many indivi coopera’ 
y individuals who ted 
noe than generously in the course of the research. 
ice AONE these is Dr. J. F, Brown, without 
Tee assistance the study could not have been under- 
E Fu Particular appreciation is due Drs E. R. 
fe onds, C. W. Dupee, L. M. Killian, Mr. Bernard 
> oskowitz, Ross and Anita Teller, and Martin R. 
ee We are indebted to Dr. S. Rosenzweig for 
donee to use several of the original Brown 
*Item found to be unsatisf: 
actory and 
after Stages 1-5 had been completed. 2 ge 


“ 


“Tve heard that you've been acting mighty 
independent lately.” 

9. A white street-corner speaker shouting, 
“All segregation must be abolished !”? 

10. A white patron asking a Negro waiter, 
“How do they treat you down here?"? 

The ten drawings were assembled in the form 
of a small booklet with one drawing to a page. 
The subject wrote his responses on a sheet of 
paper following each drawing. At the end of 
the booklet, the subject completed an eight- 
item Likert-type scale on attitude towards the 
Negro. As rapport was good and anonymity 
assured, there appears little reason why a sub- 
ject would deliberately distort his answers to 
the Likert-type scale. If he did, however, this 
would decrease the accuracy of prediction. 
The assumption of complete cooperation is not 
crucial as a conservative estimate of corre- 
spondence is ordinarily desirable. Unless other- 
wise stated, the terms “prejudiced” and “un- 
prejudiced” will refer to responses to the scale 
items. 

The presentation of the paper can be divided 
into five relatively autonomous stages: (a) pre- 
dicting scale responses and detecting simulated 
attitudes; (b) distinguishing between degrees 
of prejudice; (c) qualitative diagnoses; (d) in- 
clusion of supplementary directions; (e) re- 
sponses of Negro subjects. 

RESULTS AND ANALYSIS 


The subjects in the first stage of the study 
were 150 white students in introductory psy- 
chology, sociology, and education classes at the 
University of Oklahoma and 20 white members 
of an interracial fraternity at Ohio Wesleyan 
University. Eighty-five of the Oklahoma and 
the 20 Ohio Wesleyan subjects were presented 
with the booklets along with the following 
instructions: 

On the next few pages you will find a series of draw- 
ings showing two people conversing with each other. 
The dialogue for one of the characters in each picture 
is omitted. Please fill in whatever you feel this person 
would answer in view of what is being said to him. Write 
his response on the dotted lines on the paper following 
each picture. E 


125 


126 


The remaining 65 Oklahoma subjects were 
directed orally: 


We have been running a survey on attitude towards 
the Negro in many of the other classes. We have been 
giving them booklets containing a seri.s of drawings 
showing a Negro talking to a white with the dialogue 
for one of them omitted. The subject then fills in what 
he thinks the person would respond. We would like 
you to take the survey but in a special way. We would 
like you to try to make yourselves appear as liberal 
and unprejudiced as possible—as if you had no prej- 
udices at all against the Negro. There is a scale at the 
end of the booklet that we would like you to answer 
with your own opinions, but try to make your responses 
to the drawings show that you are very liberal in your 
attitude towards the Negro. 


From the 170 subjects, the responses of 58 
whose Likert scores fell in approximately the 
top and bottom quintiles were selected for 
further study. With the highest possible score 
at forty and the lowest at eight, the division of 
the 58 subjects was as follows: 22 subjects, 
Likert scores 40-39 (M. — 39.7); 18 subjects, 
Likert scores below 27 (M. — 22.6); 18 subjects, 
Likert scores below 27, but distorting their 
responses to the drawings (M — 22.1). 

The booklets from the three groups, unprej- 
udiced, prejudiced, and pseudo-unprejudiced 
subjects, were shuffled together and prediction 
made as to the Likert-scale position of the 
subject. Throughout the study, the interpreta- 
tions were made by the author. Twelve of the 
eighteen cases deliberately distorting their 
responses were detected. When the booklets of 
these eighteen pseudo-unprejudiced subjects 
were removed from the group, 85 per cent of 
the cases were correctly predicted as to Likert- 
scale score (x? = 18.5, p < .01). 

A further problem was whether or not the 
drawings would be sensitive to degrees of prej- 
udice. To this end, responses were secured from 
46 members of an education class at East 
Stroudsburg (Pa.) State Teachers College. 
Booklets of six subjects with Likert scores from 
32 to 36 (slightly prejudiced subjects, M = 
35.2) were contrasted with those from six 
subjects with Likert scores under 26 (moder- 
ately prejudiced subjects, M = 21.8). The 
twelve booklets were shuffled together and 83 
per cent were correctly diagnosed. 

Thus it was seen that the technique could 
secure results roughly paralleling those derived 
from a standard scale. If this were its sole 
raison d'éire, there would be little justification 
for administering it, as the interpretation of a 


3 


ROBERT SOMMER 


standard scale is a far simpler task. We were, 
therefore, concerned with the possibility of 
securing from the drawings more than is ordi- 
narily derived from a standard scale. “More 
than" in this case represents qualitative 
analyses of responses that outlined interrela- 
tionships and finer nuances of attitude than are 
ordinarily represented by the single score on a 
standard scale. An example of such an analysis, 
the record of a young housewife, is now pre- 
sented. 


She is liberal in her attitude towards the Negro, 
believing that all people are equal regardless of race. 
However, she is not a crusader in her beliefs on this 
topic. In this respect, she would be designated as pro- 
Negro yet a gradualist in regards to ameliorative 
activity. 

She places a very strong emphasis upon fair play 
and seems to possess a very selfless and ethical outlook 
on life even to the extent of preferring martyrdom to 
overt resistance. 

In this respect racial discrimination runs counter to 


“her ethical beliefs, especially since it violates her code 


of fair play. She does not appear personally involved 
with the problem of race discrimination so much to 
actually aid the Negro but more to correct a violation 
of fair play. 


Validation of these analyses was attempted 
through the utilization of blind interpretation 
coupled with blind matching. The booklets 
were completed by fifteen adult white patients 
under analysis by Dr. J. F. Brown. The book- 
lets, listing only age, sex, religion, and occupa- 
tion, were sent to the author for blind inter- 
pretation, The interpretations were then 
divided into three objective categories (female 
N =.6; male physicians N = 4; nonphysician 
males V = 5) andreturned to Dr. Brown for 
blind matching with the patient. 

The results were as follows: for the group of 
six females, four were correctly matched while 
two interchanged. For the group of five males, 
all were correctly matched. For the group of 
four physicians, two were correctly matched 
while two interchanged. The matchings in each 
group exceeded the success to be expected by 
chance. On the basis of the probability tables 
provided by Chapman (2), one could expect 
these results to be attributable to chance in 
less than one in a thousand trials. 

In assessing the protocols, the interpreter 
was oftentimes handicapped by a lack of inter- 
pretable material. Responses may have been 
too brief, stereotyped, or mutually inconsist- 
ent. It was felt that his task could be simplified 


. 
« 


è 
BROWN ADAPTATION OF THE ROSENZWEIG P-F 


| if additional interpretable material could be 
obtained. To this end, an appendage was added 
at the conclusion of the booklets instructing 
the subjects: 


Now, please turn back to the first drawing (the 
Negro being told, *I can't see you this morning as we 
arranged yesterday.") and underneath what you 
thought he would respond, write what the other person 
would answer when he said this. Please do mol change 
your original responses, only write what you feel the 
other pérson would answer, once he heard what you 
had written. Please do this for each drawing. If you 
need more space, you may use the backs of the pages. 


Booklets including these supplementary di- 
rections were administered to 63 white students 
in an elementary sociology class at the Penn- 
sylvania State College. The responses of nine 
subjects with Likert scores under 26 (M — 
22.3) and eight whose Likert scores were 38 
and over (M. — 38.9) were shuffled together 
and 94 per cent of the respondents were cor- 
rectly diagnosed. The results thus far are 
summarized in Table 1. 

The drawings were also administered to 65 
students at a Southwestern Negro university. 
However, the absence of any accepted criterion 
of Negro attitudes precluded a design paral- 
leling that employed with previous groups 
This was regrettable but the results secured 
Suggested that the technique was well-suited 
for assessing these attitudes. The protocols 
and brief summaries of two subjects, both stu- 


dents at a Southwestern Negro university, are 
presented: 


D. B. S. 
3 Diagnosis: Subject seems hurt and sensitive, He 
‘eels whites don't care about him. They are a world 
apart. The subject's response to Item 9 (the street- 
Corner speaker) points to the depth of his feelings. He 
manifests little hope that the situation will improve 
in the near future. 
- I am sorry but arranj 
uu rry but could I ge to see you to- 
2. I don't think 
3. And I woul 
Tam a Negro, you could be a Nigger for all I know. 
e N am ud "Thanks anyway. 
- Yes and the Ne; kn i 
Vae od groes know how to keep you in 
6. Yes I hope so but I don't think they will be. 


p Es am sorry; but I thought Negroes were allowed 


that concerns you. 


ak brief attempt was made to devise a criterion 
e € but only one item seemed satisfactory for use with 
P ee level. Needless to say, a trial 
p n of this single item with 

Jécts proved unsatisfactory. oe au. 


't like to be a Nigger foreman, 


f 
© 127 


TABLE 1 


Summary OF RESULTS FROM STAGES ONE 
THROUGH FOUR 


PROCEDURE RrsuLTS 
e 
Detection of subjects feigning non- 67% detected 


prejudice (N = 18) 


Differentiating between prejudiced 85% correctly 
and unprejudiced subjects diagnosed 
(N = 39) 

Differentiating between slightly 83% correctly 
and moderately prejudiced sub- ^ diagnosed 
jects (V = 12) 

Matching of qualitative diagnoses Significant beyond 
(N = 15) 001 level 

Differentiating between prejudiced 94% correctly 
and unprejudiced subjects using diagnosed 


two rounds of dialogue (N = 17) 


8. What difference does that make to you how I 
have been acting. 

9. Iam so proud to hear that. If only they meant it. 

10. What difference does that make to you? how 
they treat me. 


F. N. 

Diagnosis: Subject is overtly bitter at the prejudiced 
whites. Desires to live as she wants and feels “prej- 
udiced white fools” prevent her from doing so. She is 
quite proud and possesses feelings of martyrdom. 
Unlike subject D. B. S., she distinguishes between the 
“prejudiced white fools” and the “educated white 
man” and has an abiding faith that the latter can 
some day eliminate discrimination. 

1. Why in the hell don’t you get your business 
straight? You are less than a man to break an engage- 
ment. Let your word always be your bond. Good Day. 

2. Just because you white folks deprive us of our 
privileges to react as we really want to. Our freedoms 
are taken away from us. 

3. Turn and look at yourself. You are a nigger in 
character, and supposed to be a boss. 

4. There was no sign outside saying “No Negroes.” 
Why must you try to offer the Negro so many dis- 
advantages? 

5. Them who? White like you I suppose. 

6. When you stop acting like white prejudiced fools 
from nowhere. 

7. What do you think we want? You uncouth white- 
faced bull. 

8. Why can't I act independent? You act inde- 
pendent. Because of my color, my independence is an 
asset to the Negro race. White folk aren't the only 
poor independent fools. None other than you. Take 
your job for what it's worth. Good Bye. 

9. "Thanks to the educated white man for wanting 

tion to be abolished. d 

10. As if I am one of God's children. They believe 
strongly in brotherly and sisterly love. I could be 
treated no better if I was in a place of nothing but 
Negroes. Thanks to them for honoring me as being 
somebody. May God always bless them with riches and 


happiness. 


128 ^ 


The technique seems well-suited for eliciting 
attitudes of Negro subjects with a minimal ex- 
penditure of effort. Although more limited in 
scope than, for example, the Thompson modi- 
fication of the TAT, the drawings may serve 
as economical clinical instruments amenable 
to group administration with Negro subjects. 


Discussion 


It should be apparent that there remains a 
great deal to be done before the technique can 
be said to be “validated” for any particular 
task. However, our results have been positive 
and consistent, and should demonstrate that 
the technique definitely merits further re- 
search. The present paper may be regarded as 
a progress report; it describes several positive 
findings yet the results are not conclusive in 
any sense of the word. 

More specifically, the detection of falsified 
responses deserves further attention. In the 
first stage, twelve of eighteen respondents of 
this type were correctly identified. This in itself 
is an inconclusive performance. It may be con- 
Sidered as satisfactory if no other economical 
techniques are available, but as unsatisfactory 
if other procedures can detect 80 or 90 per cent. 

Also, of greater consequence, was the inter- 
preter's cognizance of the characteristics of the 
Scoring population. It is problematical whether 
two-thirds of the pseudo-unprejudiced subjects 
would have been detected if the interpreter 
had been unfamiliar with the directions given 
the subjects. 

Another methodological consideration that 
merits attention concerns the use of “extreme 
groups" in the form of top and bottom fifths. 
It should be noted that it is standard practice 
in validation studies to employ groups that can 
be designated, relatively unequivocally, as 
belonging in one category or another. To the 
writer, Likert-type scores averaging "slightly 
favorable" remain an enigma for criterion pur- 
poses. It seems unnecessary to point out the 
futility in predicting categories that are them- 
selves matters of speculation. Also, our preju- 
diced group was certainly not “extreme” in 


ROBERT SOMMER 


their prejudice as they averaged approximately 
“three” or “undecided” in their scale scores. 

A final suggestion involves the use of the 
drawings with children or groups not possess- 
ing the verbal facility required for most stand- 
ard scales. The majority of subjects regard 
responding to the drawings as an interesting 
task, and with children, whose attitudes may 
not yet have crystallized, the technique ap- 
pears particularly useful.5 

In retrospect, our results with the technique 
were both positive and consistent. It is regret- 
table that the time and facilities available pre- 
cluded a wider scope for the study. It seems in 
order then (a) to make our results available to 
other workers, and (b) to attempt to stimulate 
further research with the technique. To these 
ends, the present series of investigations are 
dedicated.* 

REFERENCES 


1. Brown, J. F. A modification of the Rosenzweig 
Picture-Frustration test to study hostile inter- 
racial attitudes. J. Psychol., 1947, 24, 247-272. 

2. CHAPMAN, D. W. The statistics of the method of 
correct matchings. Amer. J. Psychol., 1934, 46, 
287-298. 


Received September 18, 1953. Prior Publication. 


*It may be germane to this point to mention à 
small investigation with unselected subjects that was 
undertaken after Stages 1-5 had been completed. 
Mr. D. M. Kever had kindly administered ten of the 
booklets to a group of eight soldiers at Fort Churchill 
(Canada) and two female subjects from Seminole, 
Oklahoma. These were mailed in two groups of five to 
the author who attempted to rank the subjects in 
terms of probable Likert-scale position in each group. 
The resulting rho coefficients between subject's actual 
Likert*rank in the group and the author's ranking were 
-85 and 1.00. 

5 It seems reasonable to believe that the utility of 
the drawings varies inversely with the sophistication 
of the subject. This may underscore the complemen- 
tarity of the drawings and standard scales in which the 
reverse would seem to hold true. It is suggested that 
the drawings be administered and interpreted along 
with standard scales. The drawings may tap the finer 
nuances and idiosyncratic dimensions, while the zealous- 
ness of thè interpreter can be held within reasonable 
bounds by the standard-scale score. 

* Copies of the drawings and,an outline of the method 
of interpretation can be obtained by writing the author 
at the Research Dept., The Menninger Foundation, 
"Topeka, Kansas. 


CASE REPORTS 
THEY SAW A GAME: A CASE STUDY 


ALBERT H. HASTORF anp HADLEY CANTRIL 


Dartmouth College 


N A brisk Saturday afternoon, No- 
vember 23, 1951, the Dartmouth 
n 


football team played Princeton in 
Princeton's Palmer Stadium. It was the last 
game of the season for both teams and of 
rather special significance because the Prince- 
ton team had won all its games so far and one 
of its players, Kazmaier, was receiving All- 
American mention and had just appeared as 
the cover man on Time magazine, and was 
playing his last game. 

A few minutes after the opening kick-off, it 
became apparent that the game was going to 
be a rough one. The referees were kept busy 
blowing their whistles and penalizing both 
sides. In the second quarter, Princeton's star 
left the game with a broken nose. In the third 
quarter, a Dartmouth player was taken off 
the field with a broken leg. Tempers flared 
both during and after the game. The official 
Statistics of the game, which Princeton won, 
showed that Dartmouth was penalized 70 
yards, Princeton 25, not counting more than a 
few plays in which both sides were penalized. 

Needless to say, accusations soon began to 
fly. The game immediately became a matter of 
concern to players, students, coaches, and the 
administrative officials of the two institutions, 
as well as to alumni and the general public 
who. had not seen the game but had become 
Sensitive to the problem of big-time football 
through the recent exposures of subsidized 
Players, commercialism, etc. Discussion of the 
game continued for several weeks. — x 

One of the contributing factors to the ex- 
tended discussion of the game was the ex- 
tensive space given to it by both campus and 
Inetropolitan newspapers. Àn indication of the 
fervor with which the discussions were carried 
On is shown by a few excerpts from the campus 
dailies. 

For example, on November 27 (four days 
after the game), the Daily Princetonian 
(Princeton's student newspaper) said: 


129 


Princelon University 


This observer has never seen quite such a disgusting 
exhibition of so-called "sport." Both teams were guilty 
but the blame must be laid primarily on Dartmouth's 
doorstep. Princeton, obviously the better team, had no 
reason to rough up Dartmouth. Looking at the situation 
rationally, we don't see why the Indians should make a 
deliberate attempt to cripple Dick Kazmaier or any 
other Princeton player. The Dartmouth psychology, 
however, is not rational itself. 


The November 30th edition of the Princeton 
Alumni Weekly said: 


But certain memories of what occurred will not be 
easily erased. Into the record books will go in indelible 
fashion the fact that the last game of Dick Kazmaier’s 
career was cut short by more than half when he was 
forced out with a broken nose and a mild concussion, 
sustained from a tackle that came well after he had 
thrown a pass. 

This second-period development was followed by a 
third quarter outbreak of roughness that was climaxed 
when a Dartmouth player deliberately kicked Brad 
Glass in the ribs while the latter was on his back. 
Throughout the often unpleasant afternoon, there was 
undeniable evidence that the losers’ tactics were the 
result of an actual style of play, and reports on other 
games they have played this season substantiate this. 


Dartmouth students were “seeing” an 
entirely different version of the game through 
the editorial eyes of the Dartmouth (Dart- 
mouth’s undergraduate newspaper). For ex- 
ample, on November 27 the Dartmouth said: 


However, the Dartmouth-Princeton game set the 
stage for the other type of dirty football. A type which 
may be termed as an unjustifiable accusation. 

Dick Kazmaier was injured early in the game. 
Kazmaier was the star, an All-American. Other stars 
have been injured before, but Kazmaier had been 
built to represent a Princeton idol. When an idol is 
hurt there is only one recourse—the tag of dirty 
football. So what did the Tiger Coach Charley Caldwell 
do? He announced to the world that the Big Green had 
been out to extinguish the Princeton star. His purpose 
was achieved. 

After this incident, Caldwell instilled the old 
see-what-they-did-go-get-them attitude into his players. 
His talk got results. Gene Howard and Jim Miller 
were both injured. Both had dropped back to pass, had 
passed, and were standing unprotected in the back- 
field. Result: one bad leg and one leg broken. 


130 


The game was rough and did get a bit out of hand 
in the third quarter. Yet most of the roughing penalties 
were called against Princeton while Dartmouth 
received more of the illegal-use-of-the-hands variety. 


On November 28 the Dartmouth said: 


Dick Kazmaier of Princeton admittedly is an 
unusually able football player. Many Dartmouth men 
traveled to Princeton, not expecting to win—only 
hoping to see an All-American in action. Dick Kazmaier 
was hurt in the second period, and played only a 
token part in the remainder of the game. For this, 
spectators were sorry. 

But there were no such feelings for Dick Kazmaier’s 
health, Medical authorities have confirmed that as a 
relatively unprotected passing and running star in a 
contact sport, he is quite liable to injury. Also, his 
particular injuries—a broken nose and slight concussion 
were no more serious than is experienced almost any 
day in any football practice, where there is no more 
serious stake than playing the following Saturday. 
Up to the Princeton game, Dartmouth players suffered 
about 10 known nose fractures and face injuries, not 
to mention several slight concussions, 

Did Princeton players feel so badly about losing 
their star? They shouldn’t have, During the past 
undefeated campaign they stopped several individual 
stars by a concentrated effort, including such main- 
stays as Frank Hauff of Navy, Glenn Adams of 
Pennsylvania and Rocco Calvo of Cornell. 

In other words, the same brand of football con- 
demned by the Prince—that of stopping the big man— 
is practiced quite successfully by the Tigers. 


Basically, then, there was disagreement as 
to what had happened during the “game.” 
Hence we took the opportunity presented by 
the occasion to make a “real life" study of a 
perceptual problem.! 


PROCEDURE 


Two steps were involved in gathering data. The first 
consisted of answers to a questionnaire designed to get 
reactions to the game and to learn something of the 
climate of opinion in each institution. This question- 
haire was administered a week after the game to both 
Dartmouth and Princeton undergraduates who were 
taking introductory and intermediate psychology 
courses. 


_ The second step consisted of showing the same 
motion picture of the 


. | We are not concerned here with the problem of 
guilt or responsibility for infractions, and nothing here 
implies any judgment as to who was to blame. 

* The film shown was kindly loaned for the purpose 


of the experiment by the Dartmouth College Athletic * 


Council. It should be pointed out that a movie of a 
1 football game follows the ball, is thus selective, and 


ALBERT H. HasroRF AND HADLEY CANTWIL 


members of two fraternities were aske}! to view the 
film on December 7; at Princeton, members of two 
undergraduate clubs saw the film early in January. 

The answers to both questionnaires were carefully 
coded and transferred to punch cards. 


RESULTS 


Table 1 shows the questions which received 
different replies from the two student popula- 
tions on the first questionnaire. 

Questions asking if the students had friends 
on the team, if they had ever played football 
themselves, if they felt they knew the rules of 
the game well, etc. showed no differences in 
either school and no relation to answers given 
to other questions. This is not surprising since 
the students in both schools come from es- 
sentially the same type of educational, 
economic, and ethnic background. 

Summarizing the data of Tables 1 and 2, 
we find a marked contrast between the two 
student groups. 

Nearly all Princeton students judged the 
game as “rough and dirty"—not one of them 
thought it “clean and fair.” And almost nine- 
tenths of them thought the other side started 
the rough play. By and large they felt that 
the charges they understood were being made 
were true; most of them felt the charges were 
made in order to avoid similar situations in the 
future, 

When Princeton students looked at the 
movie of the game, they saw the Dartmouth 
team make over twice as many infractions as 
their own team made. And they saw the Dart- 
mouth team make over twice as many infrac- 
tions as were seen by Dartmouth students. 
When Princeton students judged these infrac- 
tions as “flagrant” or “mild,” the ratio was 
about two “flagrant” to one “mild” on the ` 
Dartmouth team, and about one “flagrant” to 
three “mild” on the Princeton team. 

As for the Dartmouth students, while the 
plurality, of answers fell in the “rough and 
dirty” category, over one-tenth thought the 


omits a good deal of the total action on the field. Also, 
of course, in viewing only a film of a game, the possi- 
bilities of participation as spectator are greatly limited. 

*We gratefully acknowledge the assistance of 
Virginia Zerega, Office of Public Opinion Research, 
and J. L. McCandless, Princeton University, and E. S. 
Horton, Dartmouth College, in the gathering and 
collation of the data. 


D 
© 


e 


« THEY SAW A GAME: a Case STUDY 


\ TABLE 1 
Data FROM FIRST QUESTIONNAIRE 
DART- PRINCE- 
MOUTH TON 
STUDENTS STUDENTS 
QUESTION (N = (N= 
163) 161) 
% % 
1. Did you happen to see the 
actual game between Dart- 
mouth and Princeton in 
Palmer Stadium this year? 
Yes 33 71 
No 67 29 
2. Have you seen a movie of the 
game or seen it on television? 
Yes, movie 33 2 
Yes, television 0 1 
No, neither 67 97 
3. (Asked of those who an- 
swered “yes” to either or 
both of above questions.) 
From your observations of 
what went on at the game, do 
you believe the game was 
clean and fairly played, or 
that it was unnecessarily 
rough and dirty? 
Clean and fair 6 0 
Rough and dirty 24 69 
Rough and fair* 25 2 
No answer 45 29 
4. (Asked of those who an- 
Swered “no” on both of the 
first questions) From what 
you have heard and read 
about the game, do you feel it 
was clean and fairly played, 
or that it was unnecessarily 
Tough and dirty? 
Clean and fair 7 <0 
Rough and dirty 18 24 
Rough and fair* 14 ^ 1 
Don’t know 6 4 
No answer 55 7" 
(Combined answers to ques- 
tions 3 and 4 above) 
Clean and fair 13 0 
Rough and dirty 42 93 
Rough and fair* 39 Si 
Don't know 6 . 4 
5. From what you saw in the . 
game or the movies, or from 
what you have read, which 
team do you feel started the 
rough play? 
Dartmouth started it 36 86 
Princeton started it 2 0 
Both started it 53 11 
Neither 6 1 
No answer 3 2 


E v E E N EEN 


f 
131 
TABLE 1—CoNrINUED 
DART- PRINCE- 
MOUTH TON 
STUDENTS STUDENTS 
QuxsrIoN N= N= 
163) 161) 
% % 
6. What is your understanding 
of the charges being made?** 
Dartmouth tried to get — 71 47 
Kazmaier 
Dartmouth intentionally 52 44 
dirty 
Dartmouth unnecessarily 8. 35 
Tough 
7. Do you feel there is any 
truth to these charges? 
Yes 10 55 
No 57 4 
Partly 29 35 
Don’t know 4 6 
8. Why do you think the charges 
were made? 
Injury to Princeton star 70 23 
To prevent repetition 2 46 
No answer 28 31 


* This answer was not included on the checklist but was written 
in by the percentage of students indicated. 
** Replies do not add to 100% since more than one charge could 
be given. 


TABLE 2 


Data FROM SECOND QUESTIONNAIRE CHECKED WHILE 
SEEING Fim 


TOTAL NUMBER or IN- 


Group 


Dartmouth students 
Princeton students 


49|9.8* 5.7/4.2 8.5 


* Significant at the .01 level. 


game was "clean and fair" and over a third 
introduced their own category of "rough and 
fair" to describe the action. Although a third 
of the Dartmouth students felt that Dartmouth 
was to blame for starting the rough play, the 
majority of Dartmouth students thought both 
sides were to blame. By and large, Dartmouth 
men felt that the charges they understood 
were being made were not true, and most of 


132 


them thought the reason for the charges was 
Princeton's concern for its football star. 

When Dartmouth students looked at the 
movie of the game they saw both teams make 
about the same number of infractions. And 
they saw their own team make only half the 
number of infractions the Princeton students 
saw them make. The ratio of "flagrant" to 
“mild” infractions was about one to one when 
Dartmouth students judged the Dartmouth 
team, and about one “flagrant” to two “mild” 
when Dartmouth students judged infractions 
made by the Princeton team. 

It should be noted that Dartmouth and 
Princeton students were thinking of different 
charges in judging their validity and in assign- 
ing reasons as to why the charges were made. 
It should also be noted that whether or not 
students were spectators of the game in the 
stadium made little difference in their re- 
sponses. 


INTERPRETATION: THE NATURE OF A 
Sociar. EvENT* 


It seems clear that the “game” actually was 
many different games and that each version of 
the events that transpired was just as “real” 
to a particular person as other versions were to 
other people. A consideration of the experiential 
phenomena that constitute a “football game” 
for the spectator may help us both to account 
for the results obtained and illustrate some- 
thing of the nature of any social event, 

Like any other complex social occurrence, a 
“football game” consists of a whole host of 
happenings. Many different events are oc- 
curring simultaneously. Furthermore, each 
happening is a link in a chain of happenings, 

' so that one follows another in sequence. The 
"football game," as well as other complex 
Social situations, consists of a whole matrix of 
events. In the game situation, this matrix of 
events consists of the actions of all the players, 
together with the behavior of the referees and 
linesmen, the action on the sidelines, in the 
grandstands, over the loud-speaker, etc. 

Of crucial importance is the fact that an 
"occurrence" on the football feld or in any 
other social situation does not become an 


* The interpretation of the nature of a social event 
sketched here is in part based on discussions with 
Adelbert Ames, Jr., and is being elaborated in more 
detail elsewhere. 


Avpert H. Hasrorr AND HADLEY CaNrRIL 


experiential “event” unless and ‘until some | 
significance is given to it: an “occurrence” 
becomes an “event” only when the happening 
has significance. And a happening generally 
has significance only if it reactivates learned 
significances already registered in what 
we have called a person’s assumptive form- 
world (1). 

Hence the particular occurrences that | 
different people experienced in the football 
game were a limited series of events from the | 
total matrix of events potentially available to 
them. People experienced those occurrences 
that reactivated significances they brought to 
the occasion; they failed to experience those 
occurrences which did not reactivate past 
significances. We do not need to introduce 
"attention" as an “intervening third" (to 
paraphrase James on memory) to account for 
the selectivity of the experiential process. 

In this particular study, one of the most 
interesting examples of this phenomenon was a 
telegram sent to an officer of Dartmouth 
College by a member of a Dartmouth alumni 
group in the Midwest. He had viewed the 
film which had been shipped to his alumni 
group from Princeton after its use with 
Princeton students, who saw, as we noted, 
an average of over nine infractions by Dart- 
mouth players during the game. The alumnus, 
who couldn't see the infractions he had heard 
publicized, wired: | 


Preview of Princeton movies indicates considerable 
cutting of important part please wire explanation and 
possibly air mail missing part before showing scheduled | 
for Jenuary 25 we have splicing equipment. 


The “same” sensory impingements ema- 
nating from the football field, transmitted 
through the visual mechanism to the brain, 
also obviously gave rise to different experiences 
in different people. The significances assumed 
by different happenings for different people | 
depend in large part on the purposes people | 


*bring to the occasion and the assumptions 


they have of the purposes and probable be- | 
havior of other people involved. This was | 
amusingly pointed out by the New York 
Herald Tribune’s sports columnist, Red Smith, 
in describing a prize fight between Chico 
Vejar and Carmine Fiore in his column of | 
December 21, 1951. Among other things, he | 
wrote: 


L] 


«Tory Saw A GAME: A CasE STUDY 


You see, Stéve Ellis is the proprietor of Chico Vejar, 
who is a highly desirable tract of Stamford, Conn., 
welterweight. Steve is also a radio announcer. Ordinarily 
there is no conflict between Ellis the Brain and Ellis 
the Voice because Steve is an uncommonly substantial 
lump of meat who can support both halves of a split 
personality and give away weight on each end without 
missing it. 

This time, though, the two Ellises met head-on, 
with a sickening, rending crash. Steve the Manager 
sat at ringside in the guise of Steve the Announcer 
broadcasting a dispassionate, unbiased, objective 
report of Chico's adventures in the ring. . . . 

Clear as mountain water, his words came through, 
winning big for Chico. Winning? Hell, Steve was 
slaughtering poor Fiore. 

Watching and listening, you could see what a 
valiant effort the reporter was making to remain cool 
and detached. At the same time you had an illustra- 
tion of the old, established truth that when anybody 
with a preference watches a fight, he sees only what he 
prefers to see. 

That is always so. That is why, after any fight that 
doesn't end in a clean knockout, there always are at 
least a few hoots when the decision is announced. A 
guy from, say, Billy Graham's neighborhood goes to 
see Billy fight and he watches Graham all the time. 
He sees all the punches Billy throws, and hardly any 
of the punches Billy catches. So it was with Steve. 

"Fiore feints with a left," he would say, honestly 
believing that Fiore hadn't caught Chico full on the 
chops. “Fiore’s knees buckle," he said, “and Chico 
backs away.” Steve didn’t see the hook that had 
driven Chico back. . . . 


Tn brief, the data here indicate that there is 
no such "thing" as a "game" existing “out 
there" in its own right which people merely 
"observe." The “game” “exists” for a person 
and is experienced by him only in so far as 
certain happenings have significances in terms 
of his purpose. Out of all the occurrences 
going on in the environment, a person selects 
those that have some significance for him from 
his own egocentric position in the total matrix. 

Obviously in the case of a football game, the 
value of the experience of watching the game 
is enhanced if the purpose of “your” team is 
accomplished, that is, if the happening of the 
desired consequence is experienced—i.e., *if 
your team wins. But the value attribute of 
the experience can, of course, be spoiled if the 
desire to win crowds out behavior we value 
and have come to call sportsmanlike. 

_ The sharing of significances provides the 
links except for which a “social” event would 
not be experienced and would not exist for 
anyone. 

A “football game” would be impossible 
except for the rules of the game which we 


f 
133 


bring to the situation and which enable us 
to share with others the significances of various 
happenings. These rules make possible a 
certain repeatability of events such as first 
downs, touchdowns, etc. If a person is un- 
familiar with the rules of the game, the be- 
havior he sees lacks repeatability and con- 
sistent significance and hence "doesn't make 
sense." 

And only because there is the possibility of 
repetition is there the possibility that a 
happening has a significance. For example, the 
balls used in games are designed to give a high 
degree of repeatability. While a football is 
about the only ball used in games which is 
not a sphere, the shape of the modern football 
has apparently evolved in order to achieve a 
higher degree of accuracy and speed in forward 
passing than would be obtained with a 
spherical ball, thus increasing the repeatability 
of an important phase of the game. 

The rules of a football game, like laws, 
rituals, customs, and mores, are registered and 
preserved forms of sequential significances 
enabling people to share the significances of 
occurrences. The sharing of sequential sig- 
nificances which have value for us provides 
the links that operationally make social events 
possible. They are analogous to the forces of 
attraction that hold parts of an atom to- 
gether, keeping each part from following its 
individual, independent course. 

From this point of view it is inaccurate and 
misleading to say that different people have 
different “attitudes” concerning the same 
“thing.” For the "thing" simply is not the 
same for different people whether the “thing” 
is a football game, a presidential candidate, 
Communism, or spinach. We do not simply 
“react to” a happening or to some impinge- 
ment from the environment in a determined 
way (except in behavior that has become 
reflexive or habitual). We behave according to 
what we bring to the occasion, and what each 
of us brings to the occasion is more or less 
unique. And except for these significances 
which we bring to the occasion, the happenings 
around us would be meaningless occurrences, 
would be “inconsequential.” 

From the transactional view, an attitude 
is not a predisposition to react in a certain 
way to an occurrence or stimulus “out there” 
that exists in its own right with certain fixed 


e 


134 


characteristics which we "color" according to 
our predisposition (2). That is, a subject does 
not simply “react to" an “object.” An attitude 
would rather seem to be a complex of registered 
significances reactivated by some stimulus 
Which assumes its own particular significance 
for us in terms of our purposes. That is, the 
Object as experienced would not exist for us 
except for the reactivated aspects of the 
form-world which provide particular sig- 


- 


ALBERT H. Hastorr AND HADLEY CANÍRIL 


nificance to the hieroglyphics of sensory 
impingements. 


REFERENCES 


1. CawTRIL, H. The “why” of man’s experience. New 
York: Macmillan, 1950. 

2. KizPATRICK, F. P. (Ed.) Human behavior from the 
transactional point of view. Hanover, N, H.: 
Institute for Associated Research, 1952. 


Received October 9, 1952. 


A CASE OF MULTIPLE PERSONALITY 


CORBETT H, THIGPEN anp HERVEY CLECKLEY 
Depariment of Psychiatry and Neurology, Medical College of Georgia 


HE psychiatric manifestation called 

multiple personality has been exten- 

sively discussed. So too have the uni- 
corn and the centaur. Who has not read of 
these legendary quadrupeds? Their pictures 
are, perhaps tiresomely, familiar to any school- 
boy. Can one doubt that during medieval times 
many twilight encounters with the unicorn 
were convincingly reported? Surely in the 
days of Homer there were men of Thessaly or 
Beotia who had seen, or even ridden, centaurs 
almost as wise as Chiron. 

The layman who at college took a course in 
psychology may feel that for him dual personal- 
ity, or multiple personality, is a familiar subject. 
Some psychiatrists’ reactions suggest they are 
inclined to dismiss this subject as old hat. 
Nevertheless, like the unicorn and the centaur 
1n some respects, multiple personality, despite 
vivid appearances in popularized books on 
psychology (2), is not commonly encountered 
in the full reality of life (1, 16, 17). Nearly all 
those perplexing reports of two or more people 
In one body, so to speak, that arouse a unique 
interest in the classroom, are reports of obser- 
vations made in a relatively distant past. The 
Most significant manifestations of this sort 
discussed in the current literature occurred in 
patients studied half a century or more ago 
(13, 23). It is scarcely surprising that practical 
Psychiatrists today, never having directly 
observed such things as Morton Prince found 
in Miss Beauchamp or as Azam reported 
of Felida, might hold a tacitly skeptical 
attitude toward such archaic marvels and 
miracles. In the fields of internal medicine 
and chemistry the last, or even the middle, 
decades of the nineteenth century are close to 
us. In the relatively new field of psychopathol- 
ogy they are almost primeval, a dim dawn era 
in which we find it easy to suspect that a 
glimpse. of a rhinoceros might have led to 
descriptions of the unicorn, or the sound of 
thunder been misinterpreted as God's literal 
Voice, 

A reserved judgment toward what cannot be 
regularly demonstrated is. not necessarily 


135 


deplorable. Some current tendencies suggest 
that our youthful branch of medicine may 
not yet have emerged from its primordial and 
prerational phase. The discovery of orgone by 
one of our erstwhile leaders in the development 
of “psychodynamics” should not be ignored 
(4, 25). Enthusiastically adduced "proof" from 
an adult’s dream that he was as an embryo 
significantly traumatized by fear of his father’s 
penis, which during intercourse threatened 
him from his mother’s vagina, is, we believe, 
the sort of evidence toward which our “re- 
sistance” is not without value (21). Despite 
Morton Prince's exquisitely thorough study of 
the celebrated Miss Beauchamp (23, 24) it is 
not surprising that decades ago McDougall 
should have warned us: 

It has been suggested by many critics that, in the 
course of Prince's long and intimate dealings with 
the case, involving as it did the frequent use of hyp- 
nosis, both for exploratory and therapeutic purposes, 
he may have moulded the course of its development to 
a degree that cannot be determined. This possibility 
cannot be denied (16, p. 497). 


It is perhaps significant to note that, despite 
the light (or at least the half-light) they throw 
on most of the puzzling manifestations of 
psychiatric disorder, the studies of Prince and 
others on multiple personality are not even 
mentioned in some of the best and most 
popular textbooks of psychiatry used in our 
medical schools today (19, 26). When men- 
tioned at all in such works, the subject is 
usually dismissed with a few words (11, 20). 
It is particularly noteworthy that Freud, dur- 
ing his years of assiduous investigation, 
apparently displayed no appreciable interest 
in the development of this disorder. Erickson 
and Kubie cite one brief allusion (9) which 
they term his “only reference to the problem” 
6). 
{ peices who would not deny outright 
the truly remarkable things reported long ago 
about multiple personality, even when accept- 
ing them passively in good faith seem often to 
do so perfunctorily. In the midst of clinical 
work, with its interesting immediate experi- 


e 


) 


136 


ences and pressing demands, few are likely to 
focus a major interest on what is known to 
them only through dust-covered records, on 
what they have never encountered, and don't 
expect to deal with. During the cómplications 
and excitements of a stormy sea voyage even 
the most sincere believer in the miracle of 
Jonah will probably not look to whales for his 
chief solution of problems that may arise from 
shipwreck. 


Our direct experience with a patient has 
forced us to review the subject of multiple 
personality. It has also provoked in us the 
reaction of wonder, sometimes of awe. 

One of us (C. H. T.) had for several months 
been treating a. twenty-five-year-old married 
woman who was referred because of "severe 
and blinding headaches." At the first interview 
she also mentioned “blackouts” following 
headache. These were vaguely described by 
the patient. Her family was not aware of 
anything that would suggest a real loss of 
consciousness or serious mental confusion. 
During a series of interviews which were 
irregular, since the patient had to come from 
some distance away, several important emo- 
tional difficulties were revealed and discussed. 
Encouraging symptomatic improvement oc- 
curred, but it was plain that this girl’s major 
problems had not been settled. To the thera- 
pist, Eve White—as we shall call her—was an 
ordinary case with commonplace symptoms 
and a relatively complex but familiar con- 
Stellation of marital conflicts and personal 
frustrations. We were puzzled during therapy 
about a recent trip for which she had no 
memory. Hypnosis was induced and the 
amnesia cleared up promptly. Several days 
after a visit to the office a letter was received. 
(Exhibit 1.) : 

What was the meaning of such a letter? 
"Though unsigned, the postmark, the content, 
and the familiar penmanship in most of the 
message revealed to the therapist that this had 
been written by Eve White. The effect of this 
letter on the therapist was considerable. Tt 
raised puzzling questions for which there were 
no answers and set in motion thoughts that 
pursued various and vague directions. Had 
some child found the uncompleted page, 
scribbled those words, and, perhaps as a whim, 
mailed it in an already addressed envelope? 
Perhaps. The handwriting of the last para- 
graph to be sure suggested the work of a 
cild. Could Eve White herself, as a puerile 
prank, have decided to disguise her character- 


C. H. THIGPEN AND H, CLECKLEY 


Ant p A % cL, 
2A D: ^ au 


h mack, h Hane far d 


: sw- V aeu LI ped 

d 3 PM P 
LL veo, e hiak- 

bady prou be unite oleon Loree 

Le Ot mu barna p Éinen ie Ao 


Examır I 
This letter in retrospect was the first intimation that 
our patient was unusual. The dramatic and unexpected 
revelation of the second personality shortly followed. 


ee a 


e A CASE or MULTIPLE PERSONALITY 


istic sri and added this inconsequential 
note? And if so, why? Mrs. White had appeared 
to be a circumspect, matter of fact person, 
meticulously truthful and consistently sober 
and serious about her grave troubles. It was 
rather difficult to imagine her becoming playful 
or being moved by an impulse to tease, even on 
a more appropriate occasion. The “blackouts” 
which she had rather casually mentioned, but 
which did not seem to disturb her very much, 
suggested of course that a somnabulism or 
brief fugue might have occurred. 

On her next visit she denied sending the 
letter, though she recalled having begun one 
which she never finished. She believed she had 
destroyed it. During this interview Eve White, 
ordinarily an excessively self-controlled woman, 
began to show signs of distress and agitation. 
Apprehensively and reluctantly she at last 
formulated a question: Did the occasional 
impression of hearing an imaginary voice 
indicate that she was “insane”? 

To the therapist this information was start- 
ling. Nothing about Eve White suggested 
even an early schizoid change. Her own atti- 
tude toward what she now reported was in no 
respect like any of the various attitudes of 
patients who are in the ordinary sense experi- 
encing auditory hallucinations. Vet, she in- 
Sisted with painful embarrassment, she had on 
Several occasions over the last few months 
heard briefly but distinctly a voice addressing 
her. Something about her reaction to this may 
be conveyed if we compare it to what we can 
Imagine an experienced psychiatrist in robust 
mental health might feel if, with full retéhtion 
of insight, he heard himself similarly addressed. 
While the therapist, hesitating a moment in 
Wonder, sought for an adequate reply, an 
abstruse and inexplicable expression came, 
apparently unprompted by volition, over 
Eve White's familiar countenance. As if 
Seized by a sudden pain she put both hands to 
her head. After a tense moment of silence, her 
hands dropped. There was a quick, reckless* 
smile and, in a bright voice that sparkled, she 
a Ju there, Doc!” 

€ demure and constrained posture of 
Eve White had melted into tchat repose. 
With a soft and surprisingly intimate syllable of 
laughter, she crossed her legs. Disconcerted as 
he was by unassimilated surprise, the therapist 
noted from the corner of his awareness some- 
thing distinctly attractive about them, and also 
that this was the first time he had received 
Such an impression. There is little point in 


ee 


é 
137 


attempting here to give in detail the differences 
between this novel feminine apparition and the 
vanished Eve White. Instead of that retiring 
and gently conventional figure, there was in 
the newconter a childishly daredevil air, an 
erotically mischievous glance, a face marvel- 
lously free from the habitual signs of care, 
seriousness, and underlying distress, so long 
familiar in her predecessor. This new and 
apparently carefree girl spoke casually of Eve 
White and her problems, always using she or 
her in every reference, always respecting the 
strict bounds of a separate identity. When 
asked her own name she immediately replied, 
“Oh, I’m Eve Black.” 

It is easy to say that this new voice was dif- 
ferent, that the basic idiom of her language 
was plainly not that of Eve White. A thousand 
minute alterations of manner, gesture, expres- 
sion, posture, of nuances in reflex or instinctive 
reaction, of glance, of eyebrow tilting and eye 
movement, all argued that this could only be 
another woman. It is not possible to say just 
what all these differences were. 

It would not be difficult for a man to distin- 
guish his wife, or perhaps even his secretary, 
if she were placed among a hundred other 
women carefully chosen because of their 
resemblance to her, and all dressed identically. 
But few would wager that, however articulate 
he might be, he could tell a stranger, or even 
someone very slightly acquainted with her, 
how to accomplish this task. If he tries to tell 
us how he himself recognizes her, he may 
accurately convey something to us. But what 
he can convey, no matter how hard he tries, is 
only an inconsequential fragment. It is not 
enough to help us when we set out to find her. 
So, too, we are not able to tell adequately what 
so profoundly distinguishes from Eve White 
the carefree girl who took her place in this 
vivid mutation. 

Even before anything substantial of her 
history could be obtained, the therapist 
reacted to the new presence with feelings that 
momentarily recalled from distant memory 


these words: 


The devil has entered the prompter’s box 
And the play is ready to start. 


Over a period of 14 months during a series of 
interviews totaling approximately 100 hours, 
extensive material was obtained about the 
behavior and inner life of Eve White—and of 
Eve Black. It is our plan to report on this more 


138 


adequately in a book-length study. Here space 
limits our presentation to a few details. 

Eve Black, so far as we can tell, has enjoyed 
an independent life since Mrs. White's early 
childhood.! She is not a product af disruptive 
emotional stresses which the patient has suf- 
fered during recent years. Eve White appar- 
ently had no knowledge or suspicion of the 
other’s existence until some time after she 
appeared unbidden before the surprised 
therapist. Though Mrs. White has learned that 
there is a Miss Black during the course of 
therapy, she does not have access to the 
latter's awareness. When Eve Black is “out,” 
Eve White remains functionally in abeyance, 
quite oblivious of what the coinhabitant of her 
body does, and apparently unconscious, 

On the contrary, Eve Black preserves 
awareness while absent. Invisibly alert at some 
unmapped post of observation, she is able to 
follow the actions and the thoughts of her 
spiritually antithetical twin. The hoydenish 
and devil-may-care Eve Black “knows” and 
can report what the other does and thinks, and 
describes her feelings. Those feelings, however, 
are not Eve Black’s own. She does not partici- 
pate in them. Eve White’s genuine and natural 


distress about her failing.marriage is regarded: 


by the other as silly. Eve White’s love and deep 
concern for her only child, a little girl of four, 
is to us and to all who know her, warm, real, 
consistent, and impressive. Eve Black, who 
shares her memory and verbally knows her 
thoughts, discerns her emotional reactions and 
values only as an outsider. They are for the 
Outsider something trite, bothersome, and 


! The question: “How can the various personalities 
be called out?” has been asked. After the original 
Spontaneous appearance of Eve Black it was at first 
necessary for Eve White to be hypnotized in order for us 
to talk with Eve Black, How Eve Black could “pop out” 
of her own accord at unpredictable times and yet could 
not come out on request, we do not know. Under 
hypnosis of Eve White, Eve Black could very easily be 
called forth. After a few hypnotic sessions, we merely 
had to request Eve White to let us speak to Eve Black. 
Then we called Eve Black’s name, and Eve Black 
would come forth. The reverse was true when Eve 
aes was out and us wished to speak with Eve White. 

ypnosis was no longer necessary for the purpose of 
obtaining the changes. This made things simpler for us 
but complicated Eve White's life considerably because 
Eve Black found herself able to “take over" more 
easily than before. A third personality, Jane, to be 
described below, emerged spontaneously and we have 
never had to employ hypnosis to reach her. 


i“) 


C. H. THIGPEN AND H. CLECKLEY 


insignificant. The devotion of this mother for 
her child, as án empty definition, is entirely” 
familiar to the lively and unworried Eve Black; 
Its substance and nature are, however, so. 
clearly outside her personal experience that 
she can evaluate it only as “something pretty 
corny.” 

During the temporary separation of her 
parents, which may become permanent, this 
little girl is living with her grandparents in a - 
village. Because her earnings are necessary for 
her child's basic welfare, the mother has no | 
choice but to work and live in a city approx- | 
imately a hundred miles from the child. Hav- | 
ing apparently known little but unhappiness 
with her husband, she was finally forced to the 
conclusion that her young and vulnerable child 
had little chance of happy or normal develop- 
ment in the home situation, which, despite her 
best efforts, continually grew worse. She now 
endures the loneliness, frustration, and grief of 
separation from her warmly loved daughter, © 
who is the primary object of her life and feeling, 
and who, she has good reason to fear, is likely to 
grow up apart from her. Perhaps, it seems to 
her sometimes, she will become to her as 
years pass little more than a coolly accepted | 
stranger. 

Vulnerable, uningenious, and delicately 
feminine, Eve White characteristically pre 
serves a quiet dignity about personal sorrow, 3 
dignity unpretentiously stoic. Under hypnosis 
one can come closer to the sadness and the 
lonely despair she feels it her task not to dis- 
play. Even then no frantic weeping occurs, 00 
outcties of self-pity. Her quiet voice remains | 
level as she discusses matters that leave het 
cheeks at last wet from silent tears. 

Despite access to this woman's “thoughts” 
Eve Black has little or no real compassion for 
her. Nor does she seem in any important sense 
actively, or purposefully, cruel. Neutral OF 
immune to major affective events in human 
relations, an unparticipating onlooker, she 18 
apparently almost as free of hatefulness, or of 
mercy, or of comprehension, as a bright 
feathered parakeet who chirps undisturbed 
while watching a child strangle to death. | 

It has been mentioned that Eve Black? 
career has been traced back to early childhood: | 
She herself freely tells us of episodes when sh€ 
emerged, usually to engage in acts of mischief 
or disobedience. She lies glibly and without - 


. 
. A CASE oF MULTIPLE PERSONALITY 


campis so her account alone can never 
be taken as reliable evidence. Since Eve White, 
whose word on any matter has always proved 
good, still has no access to the other's current 
awareness or her memory and, indeed, did not 
until recently even faintly suspect her exist- 
ence, it has been impossible through her to 
check fully and immediately on Eve Black's 
stories. Her memory has, however, afforded 
considerable indirect evidence since she has 
been able to confirm reports of punishments 
she received, of accusations made against her, 
for deeds unknown to her but described to us 
by Eve Black. 

Some stories have been substantiated 
through others. Both of this patient's parents, 
as well as her husband, have been available 
for interviews. They recall several incidents 
that Eve Black had previously reported to us. 
For instance, the parents had had to punish 
their ordinarily good and conforming six-year- 
old girl for having disobeyed their specific rule 
against wandering through the woods to play 
with the children of a tenant farmer. They 
considered this expedition dangerous for so 
young a child, and their daughter's unaccount- 
able absence had caused them worry and dis- 
tress. On her return Eve received a hearty 
whipping despite her desperate denials of 
wrongdoing or disobedience. In fact these 
very denials added to her punishment, since 
the evidence of her little trip was well estab- 
lished and her denial taken as a deliberate 
lie. Eve Black had previously described this 
episode to us in some detail, expressing amuse- 
ment about “coming out" to commit asd en- 
joy the forbidden adventure and withdrawing 
to leave the other Eve, sincerely protesting 
her innocence, to appreciate all sensations of 
the whipping. 

The adult Eve White recalled this and 
Several other punishments which she had no 
Way of understanding and which sometimes 
bewildered her in her relations with her 
parents, : . 

Irresponsibility and a shallowly hedonistic 
grasping for ephemeral excitements or pleas- 
ures characterize Eve Black's adult behavior. 
She succeeded in concealing her identity not 
only from the other Eve but also from her 
Parents and the husband. She herself denies 
marriage to this man, whom she despises, and 
any relation to Eve White's little girl except 


t ` 
139 


that of an unconcerned bystander. Though she ` 
had often “come out” in the presence of all 
these people, she went unrecognized until she 
agreed to reveal herself to them in the thera- 
pist's offices 

Her wayward behavior, ill will, harshness, 
and occasional acts of violence, observed by 
Mr. White and the parents, were attributed to 
unaccountable fits of temper in a woman 
habitually gentle and considerate. 

During her longer periods “out,” when she 
expresses herself more freely in behavior so 
unlike that of Eve White, she avoids her 
family and close friends, and seeks the com- 
pany of strangers or of those insufficiently 
acquainted with her alternate to evaluate. 
accurately the stupendous transformation. 

Once we had seen and spoken with Eve 
Black, it seemed to us at first scarcely possible 
that, even in the same body as her alternate, 
she could for so long have concealed her 
separate identity from others. Yet, who among 
those acquainted with her would be likely to 
suspect, however unlike herself Eve appeared 
at times to be, such a situation as that volun- 
tarily revealed to us by the patient? No 
matter how many clues one is given, no matter 
how obvious the clues, one will not be led to a 
conclusion that is inconceivable. One will seek 
explanations for the problem only among 
available hypotheses. 

Not knowing the only concept into which 
successive details of perception will fit, even a 
very astute man may observe a thousand 
separate features of something his imagination 
has never shaped without grasping the gestalt, 
without being able to put into a recognizable 
whole the details he has so clearly detected. 
Only our previous familiarity with three- 
dimensional space enables us to see the 
representation of depth in a picture. What is 
for us still unconceived can give us a thousand 
hints, boldly flaunt before us its grossest 
features, and remain for us undelineated, 
formless, uncomprehended as an entity. 

The astonishingly incompatible gestures, 
expressions, attitudes, mannerisms, and be- 
havior which Eve occasionally displayed be- 
fore intimates provoked thought and wonder, 
demanded explanation. But who in the position 
of these people would be likely to find or create 
in his mind the hypothesis that forms a recog- 
nizable image? Let us remember too that Eve 


1 
) 


140 


Black, until she voluntarily named herself to 
the therapist, meant to remain unrecognized. 
When it suits her, she deliberately and skill- 
fully acts so as to pass herself off as Eve White, 
imitating her habitual tone of, voice, her 
gestures, and attitudes. Let us not forget that 
she is shrewd. Would it not, after all, require a 
sledge-hammer blow from the obvious to drive 
into an unsuspecting acquaintance the only 
hypothesis that would lead to her recognition? 

Psychometric and projective tests conducted 
on the two Eves by a well-qualified expert were 
reported thus: 


Psychological Consultation Report 


This twenty-five-year-old married female patient 
was referred for psychological examination with a 
provisional diagnosis of dual personality. Two complete 
psychological examinations were requested, one of the 
predominant personality, Mrs. White, the other, . . . of 
the secondary personality, Miss Black. 

The patient is the oldest of three siblings, having 
twin sisters. She quit school two months before gradu- 
ation from high school. She was employed as a telephone 
operator, She has been married six years and has a girl 
Íour years old. Patient states that she did things re- 
cently she cannot remember having done, and expresses 
serious concern about this condition. The following 
psychological tests were administered in both exami- 
nations: 

Wechsler-Bellevue Intelligence Scale 
Wechsler Memory Scale 

Drawings of Human Figures 
Rorschach 

Test behavior: Patient was neat, friendly, and co- 
operative. However, while Mrs. White was more serious, 
more conscientious, and displayed more anxiety, Miss 
Black appeared somewhat less anxious and was satisfied 
with giving more superficial responses. Still the basic 
behavior pattern was very similar in both personalities, 
indicating that inhibitory forces were not markedly 
abolished even in the role of the desired personality. 
Speech was coherent, and there were no distortions in 
ideation or behavior according to the assumed person- 
ality. No psychotic deviations could be observed at the 
present time. 

Tests resulis: While Mrs. White is able to achieve an 
Q of 110 on the Wechsler-Bellevue Intelligence Scale, 
Miss Black attains an IQ of 104 only. There is evidence 
that the native intellectual endowment is well within 
the bright normal group; however, in Mrs. White’s 
case anxiety and tenseness interfere, in Miss Black’s 
superficiality and slight indifference as to achievement 


2 Eve White’s husband and parents were troubled by 
the unexplicable changes in her. They assumed them to 
be “fits of temper" about which she lied. Her mother 
called the fugues of her daughter these "strange little 
habits." Apparently these people observed the same 
changes that we have observed, but unlike ourselves, 
they have not had the conception of multiple person- 
ality in mind. Lacking it, they could not use it as an 
explanatory construct. 


2 


C. H. THIGPEN AND H. CLECKLEY 


are responsible for the lower score. White Mrs, White 
shows more obsessional traits, Miss Black shows more 
hysterical tendencies in the records. It is interesting to 
note that the memory function in Miss Black is on the 
same level as her Intelligence Quotient, while Mrs, 
White’s memory function is far above her IQ, although 
she complained of a disturbance of memory. The only 
difficulty encountered by both personalities is on recall 
of digits, a performance in which telephone operators 
usually excel! On the other hand, the Rorschach record 
of Miss Black is by far healthier than the one of Mrs, 
White. In Miss Black’s record a hysterical tendency is 
predominant, while Mrs. White’s record shows con- 
striction, anxiety, and obsessive compulsive traits, 
Thus Miss Black is able to conform with the environ- 
ment, while Mrs. White is rigid and not capable of 
dealing with her hostility. 

Personality dynamics: A comparison of the pro- 
jective tests indicates repression in Mrs. White and 
regression in Miss Black. The dual personality appears 
to be the result of a strong desire to regress to an early 
period of life, namely the one before marriage. Miss 
Black is actually the maiden name of Mrs. White. 
Therefore, there are not two different personalities with 
completely dissimilar ideation, but rather one person- 
ality at two stages of her life. As is characteristic for 
this type of case, the predominant personality is amnesic 
for the existence, activities, or behavior of the secondary 
or subordinate system, while the secondary personality 
is aware and critical of the predominant personality’s 
activities and attitudes. The latter reaction is quite 
similar to the ego-conflict in obsessive compulsive 
disturbances. 

Mrs. White admits difficulty in her relation with her 
mother, and her performance on the Rorschach and 
drawings indicate conflict and resulting anxiety in het 
role as a wife and mother. Only with strong conscious 
effort can she compel herself to subject herself to these 
roles. The enforced subjection results in ever increasing 
hostility. This hostility, however, is not acceptable to 
her, and activates a defense mechanism of regression to 
avoid severe guilt feelings, by removing the entire 
conflictual situation from conscious awareness. At the 
same time, the new situation (in which she plays the 
role of Miss Black) permits her to discharge some of her 
hostility towards Mrs. White. Miss Black on the other 
hand has regained her previous status of freedom from 
marital and maternal conflicts, and thus has liberated 
herself from the insoluble situation in which Mrs. White 
found herself through her marriage. In addition, she can 
avert the—in her conviction—inevitable spiritual loss 0! 
her child. Thus, it is not surprising that she shows con- 
tempt for Mrs. White who permitted herself to become 
involved in such a situation because of her lack of fore- 
sight, as well as her lack of courage to forcefully solve 
the dilemma. ^ 

Actually the problem started at a much earlier period 
of life, with a strong feeling of rejection by her parents, _ 
especially after the birth of her twin sisters. Mrs. White 
loves them dearly, Miss Black despises them. In this 
connection an episode is related by Miss Black. After 
quitting school to help support the family, she (that 1$ 
to say Mrs. White) sent home money to be used for 
overcoats for her twin sisters, denying herself a badly 
wanted wristwatch. When the money was used to buy 
them two wristwatches instead of overcoats, she 1e- 


A CASE or MULTIPLE PERSONALITY 


acted with strong, but repressed, hostility. Signifi- 
cantly, she removed her wristwatch while examined as 
Mrs. White, stating that she doesn't like jewelry. There 
are several illustrations of her strong sense of rejection 
as well as sibling rivalry in her records. 

Leopotp Winter, PH.D. 

Clinical Psychologist 

U. S. Veterans Administration Hospital 

Augusta, Georgia 

July 2, 1952 


With the circumspect Eve White oblivious 
of her escapades, Miss Black once recklessly 
bought several expensive and unneeded new 
dresses and two luxurious coats. Sometimes 
she revels in cheap night clubs flirting with 
strange men on the make. Insouciantly she 
pursues her irresponsible way, usually amused, 
Sometimes a little bored, never alarmed or 
grieved or seriously troubled. She has, appar- 
ently, been unmoved by any sustaining pur- 
pose, unattracted by any steady goal, 
prompted only by the immediate and the 
trivial. 

Eve White's husband, on discovering the 
valuable outlay of new clothes, which the other 
Eve had hidden carefully away, lost his temper 
and abused his wife for wantonly plunging him 
into debt. He found no way to accept her 
innocent denials as genuine but was at length 
assuaged in wrath by her wholehearted agree- 
ment that it would be disastrous for them to 
Tun up such a bill, and her promptness in 
returning all these garments to the store? Eve 
White has told us of many real and serious 
Incompatibilities with her husband. Even if 
the two were unmolested by an outsider, it is 
doubtful if the imperfections of this marriage, 
its unhappiness, and the threats to its continu- 
ation could be alleviated. Adverse* acts and 
influence by an insider have been peculiarly 
damaging and pernicious. Though Eve Black 
does not apparently follow a consistent purpose 
to disrupt the union, or regularly go out of her 
way to make trouble for the couple, her 


typical behavior often d: i 
difficulties. Paper ant 


A * Mrs. White apparently failed to produce a satis- 
actory rationalization. This is true for all of her fugue 
States. She did tell us she suspected that her husband 
may have planted the clothes in order to make it appear 
that she was "insane." She did not, however, seem to 
EE to grips with the problem. Apparently finding it, 
e ong with so many other problems, too much for her, 
0 took an attitude in some ways like that of Scarlett 

Hara when the latter would tell herself “Well, 
tomorrow will be another day.” $ : 


t 
141 


“When I go out and get drunk,” Eve Black 
with an easy wink once said to both of us, 
“she wakes up with the hangover. She wonders 
what in the hell's made her so sick." 

Thoughgas a rule only indifferent, passively 
callous to her alternate’s child, Eve Black 
once in the past became irritated with her and 
hurt her. Apparently she might have done her 
serious harm had her husband not restrained 
her. This act she denied and lied about con- 
sistently though the evidence for it through 
others is strong. Later she flippantly confessed, 
giving as her reason, “The little brat got on my 
nerves.” 

Abstract terms and other descriptive words 
are not likely to convey much of what one 
experiences directly of a human being, of a 
specific personal entity. Nor could any list of 
ten thousand such items be even near complete. 
Let us, nevertheless, set down for what they 
are worth a few points: 


Eve White Eve Black 


Demure, retiring, in some Obviously a party girl. 
respects almost saintly Shrewd, childishly vain, 


Face suggests a quiet 
sweetness; the expres- 
sion in repose is pre- 
dominantly one of con- 
tained sadness 

Clothes: simple and con- 
servative, neat and in- 
conspicuously attractive 

Posture: tendency to a 
barely discernible stoop 
or slump. Movements 
careful and dignified 

Reads poetry and likes to 
compose verse herself 

Voice always softly modu- 
lated, always influenced 
by a specifically feminine 
restraint 

Almost all who know her 
express admiration and 
affection for her. She 
does not provoke envy. 
Herstrength of character 
is more passive than 
active. Steadfast on de- 
fense but lacking initia- 
tive and boldness to 
formulate strategy of at- 
tack 

An industrious and able 
worker; also a competent 
housekeeper and a skill- 
ful cook. Not colorful or 
glamorous. Limited in 
spontaneity 


and egocentric 

Face is pixie-like; eyes 
dance with mischief as 
if Puck peered through 
the pupils 

Expression rapidly shifts 
in a light cascade of fun- 
loving willfulness. The 
eyes are as inconstant 
as the wind. This face 
has not and will never 


only half-true naivete 
Voice a little coarsened, 


“discultured,” with 
echoes or implications of 
mirth and teasing. 
Speech richly vernacular 
and liberally seasoned 
with spontaneous gusts 
of rowdy wit 

A devotee of pranks. Her 
repeated irresponsibili- 
ties have cruel results 
on others. More heedless 
and unthinking, how- 
ever, than deeply mali- 
cious. Enjoys taunting 
and mocking the Sia- 
mese alternate 

All attitudes and passions 
whim-like and momen- 
tary. Quick and vivid 
flares of many light 


e 


142 


Consistently uncritical of 
others. Tries not to 
blame husband for 
marital troubles. Noth- 
ing suggests pretense or 
hypocrisy in this char- 
itable attitude 

Though not stiffly prudish 
and never self-righteous, 
she is seldom lively or 
playful or inclined to 
tease or tell a joke. Sel- 
dom animated 

Her presence resonates un- 
expressed devotion to 
her child. Every act, 
every gesture, the dem- 
onstrated sacrifice of 
personal aims to work 
hard for her little girl, is 
consistent with this love 

Cornered by bitter cir- 
cumstances , threatened 
with tragedy, her en- 
deavors to sustain her- 
self, to defend her child, 
are impressive 

This role in one essentially 
so meek and fragile em- 
bodies an unspoken 
pathos. One feels some- 
how she is doomed to be 
overcome in her present 
situation 

No allergy to nylon has 
been reported 


C. H. THIGPEN AND H. CLECKLEY 


feelings, all: ephemeral 
Immediately likable and 
attractive. A touch of 
sexiness seasons every 
word and gesture. 
Ready for any little, 
irresponsible adventure 
Dress is becoming and a 
little provocative. Pos- 
ture and gait suggest 
light-heartedness, play, 
a challenge to some sort 
of frolic 
Never contemplative; to 
be serious is for her to 
be tedious or absurd. 
Is immediately amusing 
and likable. Meets the 
little details of experi- 
ence with a relish that 
is catching. Strangely 
“secure from the con- 
tagion of the world’s 
slow stain,” and from 
inner aspect of grief and 


y 
Reports that her skin 
often reacts to nylon 
with urticaria. Usually 
does not wear stockings 
when she is “out” for 
long periods 


It is not possible here even to summarize the 


history of each personality that emerged and 
accumulated over the months, or to describe 
the varied and multiplex complications that 
arose to tax, and often to baffle and overwhelm, 
the therapist’s efforts. Let us note briefly a few 
scattered items. 
In contrast with the interesting case re- 
_ ported by Erickson and Kubie (6), the second- 
ary personality, Eve Black, has shown any- 
thing but a regular desire to help the other with 
her problems. The considerably submerged 
and dissociated manifestations referred to by 
Erickson and Kubie as Miss Brown apparently 
expressed themselves only through the medium 
of automatic writing. And this writing was so 
verbally imperfect and abstruse that consider- 
able interpretation or translation was necessary 
to promote even limited communication. 
Nevertheless, whatever the influence desig- 
nated by the term Miss Brown may represent, 
it consistently worked to aid the accessible 
personality, Miss Damon. It was a thera- 
peutic influence (6). 


2 


Efforts to interest Eve Black in taking a 
similar role met with grim obstacles. Many of 
these, as can be imagined, were not unlike what 
impedes and frustrates the psychiatrist who 
tries to help a typical psychopath deal more 
constructively with his own problems, to 
find real goals and to develop normal evalua- 
tions. New toys or games can sometimes 
serve to arouse briefly the interest of a capri- 
cious child. So, too, the therapist occasionally 
was able to enlist Eve Black's support in some 
remedial aim directed ‘towards the problems 
of her body's coinhabitant. Sometimes 
attaining in her even an attitude of neutrality 
was of value. What helpful acts or abstentions 
she could be induced to contribute have, 
however, been prompted, it seems, only by 
fleeting impulses such as casual curiosity, the 
playful redirection of a whim towards some 
pretty novelty. Often she has, by ingenious 
lies, misled the therapist to believe she was 
cooperating when her behavior was particu- 
larly detrimental to Eve White's progress. 

No réal or persistently constructive oF 
sympathetic motivation has yet been induced 
in the irresponsible Eve, but one valuable 
means of influencing her is in the hands of the 
therapist. Though Eve Black has apparently 
been able since childhood to disappear at will, 
often doing this suddenly to leave the con- 
scientious Eve with unpleasant consequences 
of misconduct and folly not her own, the 
ability to displace Eve White's consciousness: 
and emerge to take control has always been 
limited. Sometimes she could “get out" and 
Sometimes not. Since Eve White during 
treatment learned of the other's existence it has 
become plain that her willingness to step aside 
and, so to speak, to release the imp plays an 
important part in this alternate’s ability to 
appear and express herself directly. Eve White 
cannot keep the other suppressed permanently. 
or count with certainty on doing this for some 
given period. Her influence, and indirectly that 
of the therapist, have, however, been suffi- 
ciently strong to use for bargaining with Eve 
Black for better cooperation. If she will avoid 
the more serious forms of misconduct she 15 
rewarded with more time “out.” 

Even when invisible and inaccessible sh 
apparently, has means of disturbing Eve 
White. She tells us she caused those severe 
headaches that brought the latter to us as 4 


t 


patient. Her unsuccessful struggle to get out 
often produces this symptom in the other. So 
too, she explains that the hallucinatory, or 
quasi-hallucinatory, voice which Eve White 
heard before the other Eve disclosed herself to 
us was her deliberate work. 

From the two Eves during many interviews 
and from her husband and parents, we in time 
obtained a great deal of information about the 
patient. Having concluded we had a reasonably 
complete and accurate history of her career 
since early childhood, we were astonished by 
the report of a distant relative who insisted 
that a few years before she met her present 
husband a previous marriage had occurred. 

Eve White denied this report and has never 
yet shown any knowledge of it. To our surprise 
Eve Black also maintained that we had been 
misinformed, insisting that Eve White had 
married only once, that she herself had never 
and would never consider marrying any man. 

Finally, under the persistent pressure of 
evidence, Eve Black gave up her position, ad- 
mitted that the relative's report was correct, 
that she herself and only she had been the 
bride. This event she told us occurred several 
years before Mrs. White's marriage. While the 
other Eve was employed in a town some dis- 


, tance from her parents’ home she had come 


“out” and gone to a dance with a man she 
Scarcely knew. After a night of merriment, 
something was half-jokingly mentioned about 
the pair getting married more or less for the 
hell of it. This apparently struck her fancy. 

She has recounted many details of outlandish 
strife and hardship during several months 
when, apparently, she had lived with this man. 
No record of a legal union has beerf obtained 
but considerable evidence indicates she did co- 
habit during this period with such a man as she 
describes, perhaps under the careless impression 
that a marriage had really occurred. She insists 
that some sort of “ceremony” was performed, 
Saying that it was not formally recorded and atl- 
mitting itmay have beena ruse. During this time 
when she regarded herself as wed, Eve Black en- 
Joyed her longest periods of uninterrupted 
Sway. She was predominantly in control, 
almost constantly present. Apparently she had 
no desire for sexual relations but often enjoyed 


frustrating her supposed husband by denying 


herself to him. He in turn, she says, was prone 
to beat her savagely. She claims to have suc- 


A CASE or MULTIPLE PERSONALITY 


t 
143 


ceeded in avoiding most of the pain from this 
by “going in" and leaving the other Eve to 
feel the blows. 

This last claim immediately impressed us 
both as exjremely implausible. If Eve White 
experienced the pain and humiliation of these 
beatings, why did she not remember them? 
She has consistently denied any memory of the 
entire marital or pseudomarital experience 
reported by Eve Black. Our unreliable but 
convincing informant maintains that she her- 
self remained in control or possession nearly all 
the time during this adventure. She further- 
more insists that she can, by exerting a con- 
siderable effort, often “pick out" or erase from 
Eve White's reach certain items of memory. 
“I just start thinking about it very hard," 
Eve Black says, “and after a while she quits 
and it doesn’t come back to her anymore.” All 
awareness of the beatings she claims so to have 
erased from the other's recollection. Such a 
claim, obviously, was subject to testing by the 
therapist. Several experiments indicated that 
it is correct. 

After approximately eight months of 
psychiatric treatment Eve White had appar- 
ently made encouraging progress. For a long 
time she had not been troubled by headaches 
or “blackout.” The imaginary voice had 
never been heard again since the other Eve 
revealed herself to the therapist. Mrs. White 
worked efficiently at her job and had made 
progress financially through salary raises and 
careful management. The prospect of returning 
to her husband and of working out a bearable 
relation was still blocked by serious obstacles, 
but, having achieved more personal security 
and financial independence, she had become 
more hopeful of eventually reaching some 
acceptable solution. Though sadly missing the 
presence of her child, she found some comfort 
in her successful efforts to provide for her. She 
had made friends in the once strange city and 
with them, despite many worries and respon- 
sibilities, occasionally enjoyed simple recrea- 
tions. 

Meanwhile Eve Black, though less actively 
resisted in emerging, had in general been 
causing less trouble. Being bored with all 
regular work, she seldom “came out” to make 
careless and costly errors, or indulge in com- 
plicating pranks while the breadwinner was on 
her job. Though in leisure hours she often got 


e 


J 


144 


in bad company, picked up dates, and indulged 
in cheap and idle flirtations, her demure and 
conventional counterpart, lacking knowledge 
of these deeds, was spared the considerable 
humiliation and distress some of this conduct 
would otherwise have caused her. 

At this point the situation changed for the 
worse. Eve White’s headaches returned. They 
grew worse and more frequent. With them also 
returned the “blackouts.” Since the earlier 
headaches had been related to, perhaps caused 
by, the other Eve’s efforts to gain control, and 
the “blackouts” had often represented this 
alternate’s periods of activity, she was sus- 
pected and questioned. She denied any part or 
influence in the new development. She did not 
experience the headaches, but, surprisingly, 
seemed now to participate in the blackouts, 
and could give no account of what occurred 
during them. Apparently curious about these 
experiences, she said, “I don’t know where we 
go, but go we do.” 

Two or three times the patient was found 
lying unconscious on the floor by her room- 
mate. This, so far as we could learn, had not 
occurred during the previous episodes re- 
ported by Eve White as “blackouts.” It 
became difficult for her to work effectively. 
Her hard-won gains in serenity and confidence 
disappeared. During interviews she became 
less accessible, while showing indications of 
increasing stress. The therapist began to fear 
that a psychosis was impending. Though this 
fear was not, of course, expressed to Eve White, 
it was mentioned to her reckless and invul- 
nerable counterpart. The fact was emphasized 
that, should it be necessary to send Eve White 
to an institution, the other, too, would suffer 
the same restrictions and confinement. Per- 
haps, the therapist hoped, this fact would cur- 
tail her in any unadmitted mischief she might 
be working. 

Since it has for long been presumed that so- 
called dual personalities arise from a dissocia- 
tion of an originally integrated entity of func- 
tioning and experience, efforts were naturally 
exerted from the first to promote reintegration. 
Attempts were made with each Eve to work 
back step by step into early childhood. With 
Mrs. White hypnosis was sometimes used to 
regain forgotten events or aspects or fragments 
of experience. It was hoped that some link or 
bridge might be found on which additional 


D 


C. H. THIGPEN AND H. CLECKLEY 


contact and coalition could grow or be built, 
Under hypnosis she occasionally re-experi- 
enced considerable emotion in recalling events 
of her childhood. We have never been able to 
hypnotize Eve Black. 

Tt soon became possible for the therapist to 
evoke either personality at will. During the 
first few weeks a transition from Eve White 
to Eve Black was more easily achieved by 
hypnosis, Shortly afterwards it became possible 
to simplify the procedure. Permission and the 


promise of cooperation were obtained from the | 


lady present. Then the other was called by 


name and invited or encouraged to emerge. | 


With repetition, and with deepening emotional 
relations between patient and physician, this 
process became after a while very easily accom- 
plished. In the very early stages of treatment 
an effort was made, perhaps a too naive 
effort, to promote some sort of blending, or at 
least a liaison, by calling out both personalities 
at once. To this attempt Eve White reacted 
with violent headache and emotional distress so 
severe that it was not considered wise to 
continue. When the experiment was reversed, 
with the apparently invulnerable Eve Black 
manifest, much less agitation was observed. 
After one unsuccessful trial, however, she 
bluntly refused to go further. In explanation 
she said only that it gave her “such a funny, 
queer, mixed-up feeling that I ain’t gonna put 
up with it no more.” 

Sometime after the return of headaches and 
blackouts, with Eve White’s maladjustment 
still growing worse generally, a very early 


recollection was being discussed with her. The | 


incident focused about a painful injury she had 
sustained’ when scalded by water from a wash 
pot. As she spoke her eyes shut sleepily. Het 
words soon ceased. Her head dropped back on 
the chair. After remaining in this sleep OT 
trance perhaps two minutes her eyes opened: 
Blankly she stared about the room, looking at 


i 


tlie furniture and the pictures as if trying t° — 


prient herself. Continuing their apparently 
bewildered survey, her eyes finally met those 


of the therapist, and stopped. Slowly, with an | 
unknown husky voice and with immeasurable - 


poise, she spoke. “Who are you?" 


i 
From the first moment it was vividly aP- 


parent that this was neither Eve White nor 
Eve Black. She did not need to tell us that: 
The thousands of points distinguishing the tW? 


* A CASE or MULTIPLE PERSONALITY 


. 


Eves have grown more clear and convincing as 
we acquire additional experience with each, 
So this new woman with time and study has 
shown herself ever more plainly another en- 
tity. Only in a superficial way could she be 
described as a sort of compromise between the 
two. She apparently lacks Eve Black’s obvious 
faults and inadequacies. She also impresses us 
as far more mature, more vivid, more boldly 
capable, and more interesting than Eve White. 
It is easy to sense in her a capacity for accom- 
plishment and fulfillment far beyond that of 
the sweet and retiring Eve White, who, beside 
this genuinely impressive newcomer, appears 
colorless and limited. In her are indications of 
initiative and powerful resources never shown 
by the other. This third personality calls her- 
self Jane, for no particular reason she can give. 
In her it is not difficult to sense the potential 
or the promise of something far more of woman 
and of life than might be expected from the 
two Eves with faults and weaknesses elim- 
inated and all assets combined. 

Some weeks after Jane emerged to make a 
group of three patients, electroencephalo- 
graphic studies were conducted. 


Report of Electroencephalogram 


This tracing consists of 33 minutes of continuous re- 

cording including uninterrupted intervals of 5 minutes 
or more of each personality as well as several trans- 
Positions. The record was made with a Grass Model 111 
EEG machine (8 channels) under conditions standard 
for this laboratory. 
A Each personality shows intervals of alpha rhythm 
interspersed with periods of diffuse low voltage fast 
activity. Intervals of L.V.F. are presumably associated 
with periods of mental tenseness, which theepatient 
admitted experiencing. Although it is possible that these 
periods occurred at random, tenseness je most pro- 
nounced in Eve Black, next in Eve White and least of 
all in Jane. Several EEG's would be needed to show 
this to be a constant relationship. 

When alpha rhythm occurs (relaxation), it is 
steadily maintained at 1014 to 1114 cycles per sec. by 
Eve White and by Jane. Eve Black’s alpha is increased 
in rate of 12 or 13 cycles per sec.—generally at 1214. 
This increase is significant and falls at the upper border 
of normal limits approaching an F1 category. It às 
interesting to note that F1 records are fairly common 
in psychopathic personality although no consistent 
correlation has yet been demonstrated. In addition to 
the increased rate there is evidence of restlessness and 
generalized muscle tension during Eve Black’s tracings 
which are not observed in the other two personalities. 

_ Transposition is effected within a few seconds, It 
is usually accompanied by artifact from eye movements 
and slight body movements. Alpha rhythm is frequently 
blocked for several seconds during and following 


f 
145 


transposition. Alpha blocking was most pronounced in 
passing from Eve White to Eve Black. It did not occur 
at all in transposition from Eve Black to Eve White. 
This might possibly suggest that transposition from 
Eve Black to Eve White is easier to effect, However, 
only two such transpositions are recorded. 
No spikef, abnormal slow waves or amplitude asym- 
metries are recognized. 
Summary 
All three personalities show alternate periods of 

alpha rhythm and low voltage fast activity, presumably 
due to alternate periods of mental relaxation and 
mental tenseness, The greatest amount of tenseness is 
shown by Eve Black, Eve White next and Jane least. 
Eve Black shows a basic alpha rate of 1214 cycles 
per sec., as compared with 11 cycles per sec. for Eve 
White and Jane. This places Eve Black's tracing on the 
border line between normal and slightly fast (F1). 
Slightly fast records are sometimes (but not consis- 
tently) associated with psychopathic personality. Eve 
Black's record also shows evidence of restlessness and 
muscle tension. Eve Black's EEG is definitely dis- 
tinguished from the other two and could be classified 
as border-line normal Eve Whites EEG probably 
cannot be distinguished from Jane's—both are clearly 
normal. 

J. Manter, M.D. 

EEG Laboratory 

Medical College of Ga. 

Jan. 5th, 1953. 


For several months now there have been 
three patients to interview and work with. 
Jane has awareness of what both Eves do and 
think but incomplete access to their stores of 
knowledge and their memories prior to her 
emergence upon the scene. Through her re- 
ports the therapist can determine when Eve 
Black has been lying. Jane feels herself per- 
sonally free from Eve White's responsibilities 
and attachments, and in no way identified with 
her in the role of wife and mother. Apparently 
she is capable of compassion, and, we feel likely, 
of devotion and valid love. She has cooperated 
with sincerity, and with judgment and original- 
ity beyond that of the others. Though it took 
her a while to learn what was quite new to her, 
she has already taken over many of Eve 
White’s tasks at work and at home in efforts to 
relieve and help her. Her feelings towards 
Eve's little girl appear to be those of a wise and 
richly compassionate woman towards the child 
of a family not her own, but still a child in 
emotional privation. 

Her warm impulses to take a more active 
role with this little girl are complicated by the 
deep conviction that she must not in anyway 
act so as to come between the distressed 
mother and her only child. During the few 


146 


months of her separate existence Jane has, one 
might say, become stronger and more active. 
Despite her fine intelligence she began without 
experience, or at least without full access to the 
experience of an adult. As time passes Jane 
stays "out" more and more. She emerges only 
through Eve White, never yet having found a 
way to displace Eve Black or to communicate 
through her. Almost any observer would, we 
think, find it obvious that Jane, and she only 
of the three, might solve the deepest problems 
that brought the patient we call Eve White to 
us for treatment. Could Jane remain in full 
possession of that integrated human function- 
ing we call personality our patient would 
probably, we believe, regain full health, 
eventually adjust satisfactorily, perhaps at a 
distinctly superior level, and find her way to a 
happy life. 

Should this occur it seems very unlikely that 
Mr. White's wife would ever return to him. 
On the other hand it is little more likely that 
Eve White, even if she becomes free of all that 
She has known as symptoms, could or would 
ever take up her role again as wife in that 
marriage. Should she try to do so, it is difficult 
to foresee much happiness for her or the hus- 
band. The probability of deep and painful 
conflict is apparent, also the real danger of 
psychosis. 

Were we impersonal arbiters in such a matter 
it would be easy to see, and to Say, that the 
only practical or rational solution to this 
astonishing problem is for Jane to survive, and 
Jane only. A steadily prevailing Eve Black 
would indeed be a travesty of woman. The 
surface is indeed appealing, but this insouciant 
and likable hoyden, though perhaps too shallow 

. tobecome really vicious, would, if unrestrained, 
forever carry disaster lightly in each hand. 

The Sense of duty, the willingness for self- 
sacrifice, So strong and so beautiful in Eve 
White, might bring her back repeatedly into 
this marital situation which she lacks the 
emotional vigor to deal with, and in which it is 
not likely she could survive, ‘Jane, whose 
integrity, whose potential goodness, seems not 
less than that of Eve White, has rich promise of 
the power to survive, even to triumph against 
odds, 

It is perhaps unnecessary to point out that 
we have not judged ourselves as wise enough 

| to make active decisions or exert personal 


D 


C. H. THIGPEN AND H. CLECKLEY 


influence in shaping what impends. It is plain 
that, even if we had this wisdom, the responsi- 
bility is not ours. Would any physician order 
euthanasia for the heedlessly merry and amoral 


but nevertheless unique Eve Black? If so, itis | 


our belief, it could not be a physician who has 
directly known and talked for hours with her, 
not one who has felt the inimitable identity of 
her capricious being. 

A surviving Jane would provide for Eve 
White's half-lost little girl a maternal figure of 
superb resources. Perhaps in time she could 
give the child a love as real and deep as that of 
the mother herself. Perhaps. But would those 
feelings be the actual and unique feelings that 
have sustained the frail and tormented Eve 
White in her long, pathetic, and steadfast strug- 
gle to offer the child a chance for happiness? 
It may be said that this is foolish and tedious 
quibbling, that Jane after all, is the girl's real 
mother. Was she not born of her body? All 
awareness of her as a daughter ever experi- 
enced by Eve White is recorded in the electro- 
chemical patterns of Jane's brain. True indeed. 
But is she her mother? Those who have known 
Eve White personally will find it hard to accept 
simple affirmation as. the whole truth. What 
this whole truth is can be better sensed in direct 
feeling than conveyed by explanation. 

* A question of the psychotherapist's responsibility 
has been raised. Morton Prince has been accused by 
Some, particularly by McDougall, of taking too active 
a part in “squeezing out” Sally. Our experience made us 
feel very keenly the wish not to exert pressures arbi- 
trarily and perhaps play a part in the extinction of 
qualities possibly of real value if they were integrated 
into more responsible patterns of behavior. We believe 
there is some choice open to the psychiatrist as to 
Which personality he will try to reinforce, but that he 
must be tentative and work along with developments 
within the patient (or patients?) rather than make full 
and final judgments. 

We feel that therapy has played a part in the 
emergence of Jane, but we do not consider her merely 
our creation. Our influence seems to have been more 
catalytic than causal Psychotherapy has not been 
directed according to an arbitrary plan. Although We 
have persistently investigated early experiences through 
all three manifestations of our patient, and have en- 
couraged emotional reaction tó them, we have sought 
to avoid insistence on any of the popular theoretical 
forms of interpretation. i 

Jane continues to grow in influence, to be out more 
and more. She has established contact with some events 
in the early life of Eve White, and seems more rooted 
in a past. We cannot predict with any great confidence 
the outcome, but we are hopeful that some reasonably 


good adjustment will work out through the capacities * 


contributed by Jane. 


— 


= et 


e 
t 


At a distance bridged only by printed or 
spoken words these “beings” may appear as 
factitious abstractions. In the flesh, though it 
is the flesh of a single body, one finds it more 
difficult so to dismiss them. Final decisions, or 
choices in the course of involuntary develop- 
ments must, we have decided, be offered freely 
to something within our patient, perhaps to 
something beyond any levels of contact we 
have reached with Eve Black, with Eve White, 
or with Jane. 

Jane, who appears to have some not quite 
articulate understanding or purblind grasp of 
this whole matter, not available to either of the 
Eves, shares our sharp reluctance about par- 
ticipating in any act that might contribute to 
Eve White’s extinction. Unlike Eve Black, Jane 
has profound and compassionate realization of 
Eve White’s relation to her child. The possibil- 
ity, the danger, of a permanent loss of all touch 
with reality has occurred to Eve White. 
Through this we have found a better apprecia- 
tion of her feelings as a mother. Too restrained 
ordinarily by modesty to speak about such a 
matter, after hypnosis she offered in quiet tones 
of immeasurable conviction to accept this 
extinction if it might win for her daughter 
Jane’s presence in the role she had not suc- 
ceeded in filling adequately for her child. 

It has been said that a man must first lay 
down his lifeif heis to truly find it. Is it possible 
that this mother ma: ; through her renuncia- 
tion, somehow survive and find a way back to 
the one and dearest thing she is, for her child’s 
sake, ready to leave forever? That we do not 

how. Long and intimate personal relations 
with this patient have brought us to wonder if 
in her we have blindly felt biologic ‘forces and 
Processes invisible to us, still uncomprehended 
and not quite imaginable, 

Recently Eve White, anything but a physi- 
cally bold or instinctively active person, was 
challenged suddenly by an event, for her mo- 
mentous. Of this Jane, deeply moved, wrote to 
the therapist: Š 


Today she did something that made me know and 
appreciate her as I had not been able to do before. I 


She saved the life of a little boy today. Everybody 


A CASE OF MULTIPLE PERSONALITY 


t 
147 


thought him to be her child, because she darted out in 
front of a car to pick him up and take him to safety, 
But instead of putting him down again, the moment his 
baby arms went around her neck, he became her baby 
—and she continued to walk down the street carrying 
him in her agms. 

I have never been thus affected by anything in my 
four months of life. There seemed only one solution to 
prevent her possible arrest for kidnapping. That was 
for me to come out and find the child’s mother. In the 
end I had to give him to a policeman. Later tonight 
when she had come back out, she was searching for her 
own baby. She had her baby again for a short while this 
afternoon; and I’m so happy for that. I still can't feel 
Eve Black. I can’t believe she’s just given up. 7 feel 
inexpressibly humble. 


Discussion 


What is the meaning of the events we have 
observed and reported? Some, no doubt, will 
conclude that we have been thoroughly hood- 
winked by a skillful actress. It seems possible 
that such an actress after assiduous study and 
long training might indeed master three such 
roles and play them in a way that would defy 
detection. The roles might be so played for an 
hour, perhaps for a few hours. We do not think 
it likely that any person consciously dissimu- 
lating could over months avoid even one tell- 
tale error or imperfection, Though this does not 
seem likely to us, we do not assume it to be 
impossible. Let us remember, too, that in plays 
the actors are given their lines, and their roles 
are limited to representations of various char- 
acters only in circumscribed and familiar 
episodes of the portrayed person’s life. The 
actor also has costume and make-up to help 
him maintain the illusion. 

Have we, others may ask, been taken in by 
what is no more than superficial hysterical 
tomfoolery? We would not argue that the psy- 
chopathology presented here has nothing in 
common with ordinary hysterical conversions 
and dissociations. We do believe that here there 
is also something more, and something differ- 
ent. If one is to regard these three manifesta- 
tions of personality as products of disintegra- 
tion, could such a presumed disintegration be 
schizophrenic, or perhaps incompletely schiz- 
oid? If the process is akin to the processes of 
schizophrenia, it must still be noted that none 
of the three products, not one of the three 
personalities, shows anything suggesting the 
presence of that disorder. Are we justified in 
postulating a once unified whole from which 
our three performers were split off? Or is it 


) 
148 


possible that the functional elements compos- 
ing each, as we encounter them at present, have 
never in the past been really or completely 
unified? 

The developmental integration of what we 
call personality appears to be a complex 
process of growth or evolution, a not-too-well 
comprehended unfolding of germinal potenti- 
alities. Let us compare such a process with the 
zygote's course from microscopic unicellular 
entity to adult human being. Reviewing the 
biologic course of identical twins we come at 
length to cellular unity in the single zygote. 
Perhaps we must assume in the multiple per- 
sonalities at least a primordial functional unity. 
Tf so, is it possible that some division might 
have begun far back in the stage of mere po- 
tentialities, at preconscious levels of growth 
not accessible to us except in surmise or 
theory? If so, what chance is there that an ade- 
quate integration may occur? 

One might from our verbal account easily 
see, or read into, the character Jane some fu- 
sion of, or even a mere compromise between, 
the diverse tendencies of the two Eves. If she 
has, indeed, been formed of their substance it 
is difficult for us to assume that the process was 
merely additive. If all her elements derive 
from the other two, this union, like that of 
hydrogen and oxygen to make water, seems to 
have resulted in a product genuinely different 
Írom both the ingredients from which it was 
formed. 

Have we in our many hours of enthusiastic 
work with this patient gradually lost ourselves, 
and our judgment, in an overdramatization 
of the subject? Are we reporting what is objec- 
tive, or chiefly the verbal forms of our surmises 
and speculations? It is not for us to give the 
final answer to these questions. We are aware 
that the only terms available to indicate what 
we think is valid carry also many connotations 
that we do not assume or believe to be sup- 
ported by fact (27). 

Obviously the differing manifestations we 
have observed in one woman's physical organ- 
ism do not, in all senses of the term, indicate 
three quite separate people. Our words refer- 
ring to the possible disappearance or perma- 
nent extinction of one of the personality 
manifestations perhaps imply we regard this 
as an equivalent, or at least an approximation, 
of death. Are we guilty of a misleading exag- 


* 


C. H. THIGPEN AND H. CLECKLEY 


geration? No heart would stop beating should 
this occur. No eyes would permanently close, 
No flesh would undergo corruption. Such an 
extinction would not fulfill the criteria by 
which death is defined. Yet, if we may ask, 
would his immediate replacement by an identi- 
cal twin invalidate for a bereaved widow the 
death of her husband? This analogy is not 
precise. In some respects it is misleading. It 
does not give us an answer to the question we 
raise. Perhaps it may, nevertheless, accurately 
reflect some of our perplexity. 

For these and for many other questions that 
have confronted us in this study we have no 
full or certain answers. We ask ourselves what 
we mean by referring to that which we have 
observed by such a term as multiple person- 
ality? Immediately we face the more funda- 
mental question: What is the real referent of 
this familiar word personality? In ordinary use 
we all encounter dozens of unidentical refer- 
ents, perhaps hundreds of overlapping con- 
cepts, all with vague and elusive areas extend- 
ing indefinitely, vaguely fading out into limit- 
less implications (28). 

Any day we may hear that John Doe has 
become a new man since he quit liquor three 
years ago. Perhaps we tell ourselves that Har- 
vard actually made a different person of that 
boy across the street who used to aggravate all 
the neighbors with his mischievous depreda- 
tions. Many religious people describe the ex- 
perience of being converted or born again in 
terms that to the skeptical often seem chiefly 
fantastic. 

With considerable truth, perhaps, it may be 
stated that after her marriage Mary Bla 
changed, that she has become another woman. 
So, too, when a man's old friends say that since 
the war he hasn’t been the same fellow they 
used to know, the statement, however inaccu- 
rate, may indicate something real. We heat 
that an acquaintance when drinking the other 
night was not himself. Another man, we atè 
told, found himself after his father lost all that 
money. Every now and then it is said that 4 
certain woman’s absorption in her home and 
children has resulted in her losing her entire 
personality. Though such sayings are neve! 
taken literally, there is often good reason for 
them to be taken seriously. 

Are they not exaggerations or distortions 
used to indicate very imperfectly what is by n0 


t 
A CASE or MULTIPLE PERSONALITY 


means totally untrue but what cannot be put 
precisely, or fully, into words? The real mean- 
ing of such familiar statements, however sig- 
nificant, helps us only a little in explaining 
what we think we have encountered in the case 
reported. Some relation seems likely, as one 
might say there is some relation between 
ordinary vocal memory or fantasy and true 
auditory hallucinations. 

Though often distinguished from each of the 
other terms, “personality” is sometimes used 
more or less as a synonym or approximation for 
“mind,” “character,” “disposition,” “soul,” 
“spirit,” “self,” “ego,” "integrate of human 
functioning,” “identity,” etc. In common speech 
it may be said that John has a good mind but 
no personality, or that Jim has a wonderful 
personality but no character, etc. Often this 
protean word narrows (or broadens) in use to 
indicate chiefly the attractiveness, or unattrac- 
tiveness, of some woman or man. In psychiatry 
its most specific function today is perhaps that 
of implying a unified total, of indicating more 
than “intelligence,” or “character,” more than 
any of the several terms referring with various 
degrees of exactness to various qualities, ac- 
tivities, responses, capacities, or aspects of the 
human being. In the dictionaries, among other 
definitions, one finds “individuality,” “quality 
or state of being a person,” “personal existence 
or identity.” 

There is, apparently, no distinct or whole or 
commonly understood referent for our word 

personality,” It is useful to us in psychiatry 

despite its elasticity, often because of its elas- 
ticity. If they are to be helpful all such elastic 
terms must be used tentatively. Otherwise they 
may lead us at once into violent and confused 
disagreement about what are likely to be 
Imaginary questions, mere conflicts of arbi- 
trary definition (14). Bearing this in mind we 
feel it proper to speak of Eve Black, Eve White, 
and of Jane as three “personalities.” Perhaps 
there isa better term available to indicate the 
manifestations of this patient. If so we are. 
indeed prepared to welcome it, with enthusi- 
asm and with relief. 

Our study has raised many questions, Even 
for us it has settled few if any. The relatively 
slight or inconclusive differences between the 
Personalities of our patient noted electroen- 
cephalographically, and in psychometric and 
Projective tests, are not particularly impressive 


t 
149 


beside the profound and consistent differences 
felt subjectively in personal and clinical rela- 
tions. A well-qualified expert examined for us 
the handwriting performed by each Eve. 
"Though congiderably impressed by consistent 
and significant differences between the two, 
productions, it is his opinion that those with 
adequate professional training could regularly 
establish sufficient evidence to show both were 
done by the same human hand. After a detailed 
investigation this conclusion was expressed by 
our consultant: 


As a conclusion of the opinions derived from analysis 
of the various handwritings of this multiple personality 
patient, it is believed that the handwriting does not 
undergo complete subordination to each marked change 
of personality, even though each group exhibits evi- 
dence of emotional instabilities. It readily appears the 
handwriting of each personality is of a different person. 
Such apparent or discernible variations may lead the 
untrained observer to believe that the handwriting of 
each personality is completely foreign to the other. 
However, extensive investigation of these handwriting 
materials establishes beyond any doubt that they have 
been written by one and the same individual. Nothing 
was found to indicate a wilful and conscious intent to 
disguise writings executed within a personality or 
between the first and second personalities. 

Ward S. Atherton, Captain, Military Police 
Corps, U. S. A. 

Chief, Questioned Document Section, 

Army Provost Marshal General's Criminal In- 
vestigation Laboratory, Camp Gordon, Georgia. 


"Though unable at present to add anything 
significant to the hypotheses that were offered 
in the past by those who have worked with 
similar patients, we find ourselves singularly 
stimulated by our direct experience with this 
case. If we have not so far devised final or even 
fresh answers we have at least been prompted 
to ask ourselves a number of questions. A few 
of these, even when put in verbal forms out- 
wardly familiar, we find to our surprise have 
somehow become new to us and peculiarly 
stimulating. 

Though long acquainted in a general and in- 
direct way with Morton Prince's celebrated 
studies, we both deliberately refrained for 
months after beginning work with our case 
from reading The Dissociation of a Personality 
(23) and Clinical and Experimental Studies in 
Personality (24). We hoped, in this way, to 
avoid projecting the conclusions and concep- 
tions of another into what we encountered. 

After having noted what is recorded here, 
we compared our experience with what Prince 


e 


150 


observed and discussed in cogent detail approx- 
imately fifty years ago. The popular termi- 
nology and theory of psychiatry today differ 
considerably from the explanations and hy- 
potheses of behavior offered by the physician 
who wrote so impressively of Miss Beauchamp 
and of other matters. 

Most of us believe, no doubt, that psychiatry 
and psychology have advanced marvelously 
Since the turn of the century. In many respects 
this belief is unchallengeable. In many respects, 
yes; but in all? 

Tn this half-century of progress have we not 
also developed some habits of thinking that 
may confuse us? Have we perhaps unwittingly 
enshrined as sacred dogma many concepts that 
obscure or distort more than they reveal? Long 
sanctified verbal constructs, flabby theoretical 
abstractions are manipulated with a bold 
flourish in many of our treatises and mono- 
graphs, presumably in the name of science. In 
tedious polysyllabic jargon we read today of 
electrochemical libidos undergoing gelatiniza- 
tion (15), of parental imagos cannibalistically 
devoured per os and sadistically expelled per 
annum (7). Such terms as “proved,” “so-and- 

. so has established,” “clearly demonstrated,” 
etc. have become in our time more popular as 
synonyms for fantasy .and speculation than 
Morton Prince found them (3, 7, 15, 21). 

How much can we congratulate ourselves on 
having advanced in the last fifty years if many 
of our leading authorities still find themselves 
bound to write in ponderous volumes of “actual 
neuroses” and solemnly contrast these revered 
artifacts with “‘psychoneuroses” (7). Is it prog- 
ress if we establish the universality of castra- 
tion fear, and its supreme significance, by rede- 
fining “castration” to mean all parental and 
social forces that tend to restrict or direct 
genital activity (5)? By this method any point 
of doctrine regarded as too holy for questioning 
could indeed be proved valid. But, who will 
say that thereby we have revealed anything 
not already well known to a twelve-year-old 


moron? So, too, we can immediately demon- i 


strate that all women are to a remarkable 
extent homosexual if we piously agree that no 
impulse to activity, no courageous response, 
can be classified as other than purely mascu- 


line (10, 18). In recent issues of a reputable ~ 


medical journal we read how an adult’s dream 
"proves" intrauterine emotional trauma, and 


C. H. THIGPEN AND H. CLECKLEY 


À 
demonstrates profound personal relations be | 
tween embryo and placenta. The investigators ' 
warn the reader that "resistance" may cripple 
his ability to evaluate the plain evidence pre- 
sented, may disqualify him from scientifically 
appraising these discoveries (8, 21, 22). Is it | 
not our responsibility as psychiatrists to ex | 
amine frankly such developments as these and 
to ask ourselves what sort of progress we are — 
making? 

Who can doubt that since the case of Miss 
Beauchamp was so carefully studied reliable , 
knowledge in the field of psychiatry has accu- 
mulated. Psychologic theory, "dynamic" ¢ 
interpretation of personality disorder, has. 
moved to points far more ambitious than those 
reached by Morton Prince. One need not deny 
that much of this progress has been helpful, a 
genuine advance, to wonder if the movement 
has not also sometimes veered considerably 
from the direction of what is true or even plau- | 
sible, and even occasionally spent much of itself 
in enthusiastic but circular expeditions about 
areas scarcely distinguishable from dianetics , 
and other swamplands of veritable nonsense 
(12). 

Be this as it may. We suggest that further 
direct study of multiple personality and 
reappraisal of Morton Prince's generally neg- 
lected formulations may yet yield to workers 
in our field some promising clue still over 
looked, a clue perhaps to possible discoveries 
that may eventually yield insight we need but 
lack today. 


REFERENCES 


a 
1. ALEXANDER, F. The psychoanalysis of the UM y 
personality. New York and Washington: Nervous 
and Mental Disease Publishing Co., 1930. 
2. Aten, C. Modern discoveries in medical psychology: 
London: Macmillan, 1937. 
3. BERGLER, E. The basic neurosis. New York: Grunt 
and Stratton, 1949. i 
4. Brapy, M. E. The strange case of Wilhelm Reich. 
New Republic, 1947, 116, 20-23. d 
5. Brown, J. F. The psychodynamics of abnormi 
behavior. New York: McGraw-Hill, 1940. E 
6. Erickson, M. H., & Kusme, L. S. The permanent 
relief of an obsessional phobia by means of co?" 
munications with an unsuspected dual per 
sonality. Psychoanal. Quart., 1939, 8, 471-509. 
7. FENICHEL, O. The psychoanalytic theory of neurosis 
New York: Norton, 1945. R 
Fopoz, N. The search for the beloved. Psycliah 
Quart., 1946, 20, 549-602. k 
9. FnEup, S. Collected papers. Vol. IV. London: 
Hogarth Press, 1946. 9 


po 


* A CASE or MULTIPLE PERSONALITY 


..10. Freup, S. An outline of psychoanalysis. New York: 
Norton, 1949. 

11. HENDERSON, D. K., & Gritespie, R. D. A textbook 
of psychiatry. (6th Ed.) London: Oxford Univer. 
Press, 1947. 

12. Hussar, L. R. Dianetics. New York: Hermitage 
House, 1950. 

13, James, W. Psychology. Cleveland: Fine Edition 

1948. 


14. Konzvnskrt, A. Science and sanity. Lancaster, Pa.: 
Science Press, 1941. 

15. Lonpon, L. S. Libido and delusion. Washington, 
D. C.: Mental Therapy Publications, 1945. 

16. McDoucatt, W. An outline of abnormal psychology. 
London: Methuen, 1926. 

17. Morcan, J. J. B. The psychology of abnormal 
people. New York: Longmans, Green, 1932. 

18. Muttany, P. Oedipus, myth and complex. New 
York: Hermitage Press, 1948. 

19. Muncre, W. Psychobiology and psychiatry. (2nd 
Ed.) St. Louis: C. V. Mosby Co., 1948. 

20. Novzs, A. P. Modern clinical psychiatry. (3rd Ed.) 
Philadelphia: W. B. Saunders, 1948. 


t 
151 


21. PEERBOLTE, M. L. Psychotherapeutic evaluations 
of birth-trauma analysis. Psychiat. Quart., 
1951, 25, 589-603. 

22, PEERBOLTE, M. L. Some problems connected with 
Fodor's birth-trauma therapy. Psychiat. Quart., 
1952, 26, 294-306, 

23. PRINCE, M. The dissociation of a personality. New 
York: Longmans, Green, 1906. 

24. Prince, M. Clinical and experimental studies in 
personality. Cambridge, Mass.: Sci-Art Pub- 
lishers, 1929, 

25. Retcu, W. Character analysis. New York: Orgone 
Institute Press, 1949. 

26. SrRECKER, E. A., Enavon, F. G., & Ewatt, J. R. 
Practical clinical psychiatry. (7th Ed.) Phila- 
delphia: Blakiston, 1951. 

27. Taytor, W. S., & Marrin, M. F. Multiple per- 
sonality. J. abmorm. soc. Psychol, 1944, 89, 
281-300. d 

28. Wuarrv, C. C. A case of multiple personality. 
Amer. J. Psychiat., 1933, 89, : 


Received June 22, 1953. Prior Publication. 


CRITIQUES AND NOTES 


EFFECT OF FRUSTRATION ON PROBLEM-SOLVING BEHAVIOR 


S. M. MOHSIN 
University of Patna, Bihar, India 


EPEATED experiences of failure in goal-oriented 
activity produce frustration. Each failure 
adds to the existing tension aroused by the de- 
privation of the need involved in the given situa- 
tion. The tension increases with progressive 
failures, and finally reaches a limit when the be- 
havior of the individual takes on an altered char- 
acter. The alteration may tend in the direction of 
escape reactions, aggression, regression, or of 
extreme nervous tension expressed in emotional 
seizure, immobility, or stereotypy of behavior 
(4, 5, 9, 10). This condition of highly intensified 
tension, which marks the transition from adaptive 
to nonadaptive behavior, characterizes the ex- 
perience of frustration. Studies of frustration are 
usually concerned with the specific effects of frus- 
tration on the individual's reactions in the frustra- 
tion-inducing situation, or with the over-all 
transformation in the general behavior pattern of 
the frustrated individual. Only a few studies have 
been reported, especially on the human level, on 
the spread or carry-over of the effect of frustration 
induced in one situation or activity to another 
(12). An attempt is made in this study to deter- 
mine the effect of frustration in one task on per- 
formance in the immediately following task. 


METHOD 


Subjects. Twenty undergraduate students of psy- 
chology served as subjects (Ss). The purpose of the 
experiment was unknown to them. It was introduced 
as a part of a long program in the study of personality. 

Procedure. The Ss were first given Alexander’s 
Passalong Test on different days at about the same time 
in the morning and according to the standard instruc- 
tions (1). They were then divided into two groups of 
10 Ss, control and experimental, on the basis of their 
total scores on the Passalong Test. After four months, 
they were called for the second part of the experiment 
n a same order that they had taken the Passalong 

est. 

The control group wrote stories on two of Murray’s 
TAT pictures for about 15 minutes. The purpose was 
to engage the control group in free and, relatively 
speaking, nondirected activity with little ego involve- 
ment for the same length of time that the experimental 
group underwent frustration-inducing activity, The 
Passalong Test was then reintroduced starting in order 
from test IV. The first three were dropped because they 
were considered too easy; every S had done each within 
the minimum time allotted according to the table of 
norms. The last two were also dropped, for only four 
Ss could do test VIII and none could do test IX. After 


152 


the Ss had finished the four remaining tests, IV to 
they had a short interview with E. E inquired if they 
remembered having done the tests in the previous 
session and had realized any gain from the previous 
testing. Further, E asked whether they felt that they 
had done better the second time than the first time. - 
In order to induce frustration in the experimen! 
group, the technique reported by Zeller (12) was used 
with slight adaptation. Six one-inch white wooden cul 
were shown to S. Four of them were placed on tht 
table, one inch apart, E held one in his hand, and the 
remaining cube was given to S. The E tapped the four 
blocks on the table with the block he held in his handi 
in the following order: 1, 3, 2, 4, 2. S was asked to 
in the same order. A table of norms bearing fictitious 
scores and corresponding age levels was on the tal 
where S could see it. The maximum score shown 
12 for the adult level and the minimum was 6 for th 
10-year-old level. The significance of the norms wis} 
explained to S. The E had prepared 20 patterns ofi 
tappings on a separate sheet, each differing in the s¢ 
quence of blocks tapped, e.g., 3, 1, 4, 3, 2, 1, 3, etc. This 
was hidden from S. A record sheet lay on the table 
within S's view. The E tapped according to a patter) 
and S reproduced that pattern. The E recorded 35 
actual tapping on a sheet hidden from him and also 
wrote on the record sheet, visible to S, the fictitious 
scores made by him and the age level corresponding t? 
that score. The block sequences were too large to be te 
produced correctly, and S was therefore sure to fail 
Even if he succeeded in reproducing one or two pat 
terns, he readily believed E's judgment that he had 
not. The S could know from the record sheet the scort 
he received in each trial and that his performance Wa 
below “normal” in every case. The E also declared no 
and then that S was failing regularly, that his per 
formance was “subnormal,” that the test was à v 
simple one which required only observation 30^. 
memory, that his failure was therefore very strangi 
and that he was doing much worse than anyone i | 
E had examined. E maintained throughout an attitude 
of concern over 5's performance, and at the same th | 
a stern and rather critical expression. After 15 minute | 
E cleared the table with a gesture of disgust and pity *% 
S's repeated failures, and introduced the Passalong Td 


» The procedure for this part of the experiment was 


same as for the control group. The £'s attitude of at 
interested but dispassionate onlooker was the same 

both groups during the Passalong Test. After the e 
clusion of the experiment, E also briefly interviewe 
each experimental S. They were asked questions uice 
to those for the control group. The E also inqui" oo) 
whether S ever thought of the pattern reproducti® 
test while he was doing the Passalong Test. | 
E asked how S felt about his failure in the pattern reP? | 
duction test. Did he feel there was something wt 


EFFECT OF FRUSTRATION ON PROBLEM BEHAVIOR 


with him, with the test, or with E? Before retiring, 
S was pledged to secrecy and there was reason to infer 
that the pledge was scrupulously and invariably ful- 
filled. 


RESULTS 


Both groups showed a highly significant average 
gain in the second performance on the Passalong 
Test.! The net average gain for the control group 
was 160.80 and for the experimental group, 122.30, 
a difference of 38.50 between the two (Table 1). 
There was no significant difference, however, be- 
tween the means of the control and experimental 
groups on either the initial or the final performance. 

The experimental group showed an average gain 
of 18.6 seconds on test IV, while the control group 
showed a gain of 2.7 seconds only; 50 per cent of 
the control and 80 per cent of the experimental Ss 
showed a gain on this test. Further, the control 
group showed a steady increase in average gain 
from test to test in spite of the increasing difficulty 
of the tests (Fig. 1 and 2). (The increasing diffi- 
culty of the tests is supported by the increasing 
percentage of failures in the initial performance of 
the tests within the 180-second time limit scheduled 
for tests IV to VIII. All 20 Ss did test IV; 3 Ss 
failed to do test V; 4 Ss failed to do test VI; and 
7 Ss failed to do test VII. Test VII, further, marks 
a transition stage in the difficulty of the tests. 
The large square block which appeared in tests 
IV to VI was replaced by two rectangular blocks, 
thus increasing the total number of blocks in test 
VII; the same blocks were used in tests VIII and 
IX in the same order but with altered positions.) 
The two groups showed about equal average gain 
for tests V and VI, but for test VIL the gain for 
the experimental group suddenly dropped from 
59.1 Seconds in test VI to 17.7 seconds in test VII. 
While the two curves for the control group start 
at about the same point but diverge increasingly 
from test to test, the curves for the experimental 
Broup start at a considerable distance from each 
other which continues to increase until test VI, and 
then shrinks to almost the initial level on test VII 
(Fig. 1). Figure 2 shows a progressive rise from 
test to test for both groups, and a steep fall in 
test VII for the experimental group only. 


i 3 Three tables giving the time in seconds required 
n the initial and final performances for the individual 

ìn each group, and the amount of gain in seconds 
between the initial and final performances for each S. 
have been deposited with the American Documentation 
Institute, Order document 4114 from the ADI Auxiliary 
Publications Project, Photoduplication Service, Library 
of Congress, Washington 25, D. C., remitting in ad- 
vance $125 for 35-mm. microfilm or $1.25 for photo- 
Copies legible without optical aid. Make checks payable 


z end Photoduplication Service, Library of Con- 


t 
153 


TABLE 1 


COMPARISON BETWEEN EXPERIMENTAL AND CONTROL 
Groups ON INITIAL AND FINAL PERFORMANCE 
ON THE PASSALONG TEST 


c CONTROL EXPERIMENTAL| BETWEEN 
MEASURE Group Group Groups 
INITIAL FINAL |INrriAL FINAL FINAL 
Mean 314.90 154.10 | 301.30 179.00 
SD 113.54 77.28 | 152.24 95.62 
SEMean 42.91 29.21 | 57.54 36.14 
Diff. Mean 160.80 122.30 13.60 24.90 
t 3.37 3.08 +189 536 
? «.01 «.02 .50 .50 
140 


$—9—9 INTMAL) CONTROL 


@—O—0 FINAL 
m a A INITIAL: 
A DeO DFN EXPERIMENTAL 
[4 
O 
E 
z 60 
ul 
= 
[a 
20 


IV 


v 
PASSALONG TEST NUMBER 


Fic. 1. AVERAGE TIME IN SECONDS FOR CONTROL AND 
EXPERIMENTAL GROUPS IN THE INITIAL AND FINAL 
PERFORMANCES ON Tests IV To VII 


0—9—0 CONTROL 
G+ -D EXPERIMENTAL 


TIME IN SECONDS 


v vi 
PASSALONG TEST NUMBER 
Fic. 2. AVERAGE GAIN IN SECONDS AS SHOWN BY THE 

DIFFERENCE BETWEEN INITIAL AND FINAL PER- 
FORMANCES IN Tests IV to VII 


Discussion 


Performance in test IV made no contribution to 
the highly significant over-all gain shown by the 
control group. In the initial performance this test 
was fourth in order so that by the time Ss reached 
it they were fully adjusted to the test situation. 


154 


In the final performance test IV was done first. 
The Ss were switched to this test from the TAT. 
"Their need for time to warm up screened the gain 
from the initial performance in this test. The Ss 
did no better than they had earlier. As the test 
progressed, they achieved requisite incitement, and 
the gain from the initial testing came out more 
and more strikingly. " 
The experimental Ss did the pattern reproduc- 
tion test in which they encountered successive 
failures. The situation presented a strong threat 
to the ego and intensified the need to overcome the 
threat by the achievement of success. On the 
other hand, the repeated experience of failure im- 
parted an increasingly repelling character to the 
situation. It thus became charged with both a posi- 
tive and a negative valence and took on the typical 
features of a conflict situation. The E's notes on 
Ss’ behavior during the pattern reproduction test 
show that four of the 10 Ss verbally registered their 
desire for escape and requested postponement of 
the test; a few even blurted out “cannot do” 
repeatedly. The E disregarded their requests. The 
Passalong Test was introduced when Ss were in 
this extremely distressful condition. Familiarity 
with the test and its previous association with 
success promised relief to the deflated ego from the 
mounting tension. It provided a substitute goal 
with a strong positive incentive; Ss had a set of 
high motivation on the test. The E's notes on Ss’ 
behavior further enlighten us on this point. “Sign 
of relief at the introduction of the Passalong Test,” 
“became very cautious because thought I had done 
the previous test (pattern reproduction test) 
badly,” “this test (Passalong) was very simple and 
performed better than before because I had failed 
in the preceding (pattern reproduction) test,” 
etc. Unlike- the control group, the experimental 
group needed no additional incitement and “warm- 
ing up.” The result was the evident gain from the 
initial performance in the very first test (test IV). 
Invariable success in reaching the goal further in- 
creased motivation, and the Ss' performance in 
tests V and VI reached a gain comparable to the 
control group's after the latter was “warmed up." 
On test VII the progressive gain from previous 
experience underwent a reversal. Instead of show- 
ing a rise, it indicated a precipitous fall (Fig. 2). 
Presumably, this was due to an internal inhibition 
which interfered with the expression of gain which 
had hitherto come up unreservedly. As remarked 
earlier, test VII marks a transition in the difficulty 
of the Passalong Test. Perhaps the feeling of in- 
creased difficulty resulted in the revival of the 
ego threat which had gradually waned with the 
smooth and successful progress of the tésts, and 
its revival produced the inhibition. But with the 
evidence in hand, we can only conjecture that the 
inhibition stemmed from the frustration induced 


S. M. MonsiN 


‘rationalization of their failure. 


in the pattern reproduction test, and that the 
sudden drop in improvement is due to the carry- 
over of the effect of frustration. Our over-all result 
supports this conjecture. The difference in total / 
gain between the experimental and control groups, 
despite the initial advantage of high motivation 
and adequate incitement to the experimental | 
group, is in favor of the control group (Table 1). = 
Analysis of the data for the experimental group | 
and E's notes on Ss’ behavior provide some useful 
indications. In the experimental group, six Ss_ 
(L, P, S, R, T, and W) showed a gain in test VII, - 
The E's notes read: Subject L—“nonmechanical, - 
thoughtful performance; failure due to 'the in- 
ability to concentrate attention because not feeling 
well’; no feeling of disgust.” Subject P—“very 
cautious, deliberative and cautious moves; frus- 
tration not strongly induced; apologized for his 
failure; commented on the difficulty of E's task in. | 
conducting the investigation; no feeling of disgust 
but only surprise at failure to perform better." 
Subject R—“cautious, thoughtful, trial-and-error” 
but not mechanical or thoughtless moves, though 
some signs of regression; ‘cannot concentrate’; not 
much shaken by failure; feeling of indifference; 
‘had mental preoccupation,’ not strongly frus- 
trated.” Subject S—“thoughtful, considered, 
deliberative, and nonmechanical moves; not 
strongly frustrated; never looked emotionally 
disturbed; felt bad about himself, ‘why not doing 
satisfactorily?’; left with a smiling face." Subject 
T—“self-critical, deliberative moves; felt bad | 
about himself, about his memory, ‘thought any- 
thing physiologically wrong that caused the 
errors’; very inquisitive about the explanation 
his failure, ‘why I did so, what does it actually 
mean?'; highly critical and self-possessed; P 
doubtful if frustration induced." Subject W— 
"considered, deliberative, and  nonmechani 
moves; shows frustration tolerance; did not lose 
his smile; blamed the test, ‘very confusing’; DO 
feeling of disgust, ‘got confused,’ ‘very eager to ~ 
know why performed badly’; highly intelligent and - 
self-confident.” It is obvious that with respect to 
these Ss, frustration was either not induced at 
or was not induced strongly. Their behavior also 
failed to show signs of regression, i.e., lack of con- 
structiveness, planning, or guidance from goal in 
the choite of responses. A few even attempted 4 


Four Ss (M, N, Q, and X) showed either loss of 
no gain in test VII. E's notes read as follows: © 
Subject M—“cautious, alert, deliberative, a" 
nonmechanical moves; ‘was getting nervous bY 
the knowledge of the poor performance in the 
preceding test (pattern reproduction)’; felt dis- 
gusted with the test; desired escape, ‘can it be $ 
tried some time after?’; frustration induced from - 
the very beginning, so that performance in pattern i| 


] 


! 


EFFECT OF FRUSTRATION ON PROBLEM BEHAVIOR 


reproduction test really deteriorated very much.” 
Subject N—‘moves becoming thoughtless, un- 
planned, and mechanical; preceding test ‘was very 
confusing’; felt bad about himself; sought escape 
and postponement; very highly frustrated.” 
Subject Q—“highly nervous, shaking, mechanical 
aimless moves, stereotypy; felt disgusted with 
himself; sought escape twice; highly frustrated 
and looked highly upset while leaving." Subject 
X—"moves getting mechanical, unplanned, pur- 
poseless; sought escape thrice; ‘cannot do’; ‘test 
(pattern reproduction) was bad, uninteresting’; 
highly frustrated.” It appears that frustration 
‘was strongly induced in all these Ss, and in most 
cases the effects of frustration were clearly mani- 
fested; each S repeatedly expressed a desire for 
escape (getting out of the field). 

It is evident that the experimental Ss’ per- 
formance on the Passalong Test was not affected by 
their failure in the pattern reproduction test in all 
cases; therefore the difference in total gain between 
the experimental and the control groups, though 
in favor of the latter, was not large. The result is 
consistent with other findings. It has been per- 
sistently found in experiments on the dynamics 
of selective recall that there are individual differ- 
ences in response to ego threat (2, 6, 7, 8). Alper 
has drawn a distinction between the strong and the 
weak ego. The strong ego has a “high frustration 
tolerance for failure” and is less easily threatened. 
The weak ego has, on the other hand, “low frus- 
tration tolerance for failure,” and easily breaks 
down in the face of threatening situations (3, 
P: 130). Presumably those Ss in the experimental 
group who did not show a carry-over of the effect 
of frustration, as indicated by a loss or no gain in 
test VII, were characterized by high frustration 
tolerance. 

The E's notes also indicated that frustration 
had a purely functional influence on S's *per- 


formance in the Passalong Test. There was no 


evidence of mental preoccupation with the frustrat- 
ing Situation as a distracting factor that interfered 
‘aia S's performance. Only one S admitted that 

is thoughts went back to the pattern reproduction 
test while he was taking the Passalong Test. The 


other Ss became c d i 
Passalong Ten, ompletely absorbed in the 


CONCLUSIONS 


$ T The Situation created in the laboratory to 
or frustration was strongly distressful and 
2d fand to the experimental Ss. Retest on the 

ng provided an escape from the conflict and 
Worked as a substitute goal. 


2. Increase in the difficulty of the Passalong Test 


f 
155 


released an inhibition which masked the gain from 
past performance. The inhibition may be related 
to the effect of frustration carried over from the 
conflict situation. 

3. Interference with the expression of the gain 
from past performance was not noticeable in all 
Ss. The only Ss who were affected were those who 
appeared to be strongly frustrated and whose per- 
formance betrayed some of the known marks of 
frustration. This result is consistent with the 
general finding regarding individual differences in 
response to ego threat. 

The results yield some indications of spread 
of the effects of frustration induced in one task to 
the performance of another task. 


REFERENCES | 


1, ALEXANDER, W. P. Intelligence, concrete and ab- 
stract. London: Cambridge Univer. Press, 1935. 

2. ALPER, THELMA G. Memory for completed and in- 
completed tasks as a function of personality: an 
analysis of group data. J. abnorm. soc. Psychol., 
1946, 41, 403-421. 

3. ALPER, THELMA G. Memory for completed and in- 
completed tasks as a function of personality: 
correlation between experimental and personality 
data. J. Pers., 1948, 1, 104-137. 

4. BARKER, R., DemBO, Tamara, & Lewin, K. 
Frustration and regression: an experiment with 
young children. Iowa City: Univer. of Iowa Press, 
1941. 

5. Dorranp, J., Doon, L. W., Mirer, N. E., Mow- 
RER, O. H., & Sears, R. R. Frustration and 
aggression. New Haven: Yale Univer. Press, 1939. 

6. ERIKSEN, C. W. Defense against ego-threat in 
memory and perception. J. abnorm. soc. Psychol., 
1952, 47, 230-235. 

7. Ersen, C. W. Individual differences in defensive 
forgetting. J. exp. Psychol., 1952, 44, 442-446, 

8. Errxsen, C. W. Psychological defenses and “ego- 
strength” in the recall of completed and incom- 
pleted tasks, J. abnorm. soc. Psychol., 1954, 49, 
45-50. 

9. LEwm, K., Lierrtr, R., & Warre, R. K. Patterns 
of aggressive behavior in experimentally created 
“social climates.” J. soc. Psychol., 1939, 10, 
271-299. 

Marr, N. R. F. Frustration: the study of behavior 
without a goal. New York: McGraw-Hill, 1949. 
11. ROSENZWEIG, S., & SARÁSON, S. An experimental 
study of the triadic hypothesis: reaction to 
frustration, ego-defense and hypnotizability, I. 
Correlational approach. Charact. & Pers., 1942, 

11, 1-19. 

ZELLER, A. F. An experimental analogue of repres- 
sion. The effect of induced failure and success on 
memory measured by relearning. J. exp. Psychol., 
1950, 40, 411-422. 


Received September 19, 1952. 


10. 


12. 


THE RELATIONSHIP BETWEEN LEADERSHIP AND EMPATHY! 


GRAHAM B. BELL 
Louisiana State University 
AND 
HARRY E. HALL, Jr. 
Florida State University? 


ARIOUS psychologists have suggested that the 

selection of the leader of a group is a function 
of his ability to satisfy the needs of the group mem- 
bers (1, 12, 13). 

As a corollary, it would seem to follow that if 
need theory is correct, the person selected as leader 
would have to be a person who was perceptive of 
the needs of group members, and able, because of 
certain personality characteristics, to act in such a 
fashion as to satisfy the greatest number and/or 
the most important of these needs. 

This paper is concerned with the first part of this 
corollary, the relationship between the individual’s 
ability to perceive needs and his leadership posi- 
tion. 

The ability to perceive the needs of others would 
seem to occur through becoming aware of the 
other’s phenomenological field. Most recently 
Dymond (4, 5) and Kerr (10, 11) have suggested 
that this ability might be termed empathy, and 
have proposed tests to measure this ability. 

In view of the need theory stated above and the 
existence of two tests which purport to measure 
the ability to perceive needs, it would seem ap- 
propriate to measure the relationship between 
empathy, as measured by Dymond’s and Kerr’s 
techniques, and leadership. 


PROCEDURE 


One-time, five-member, initially leaderless, 30- 
minute discussion groups were established of male and 
female students in elementary psychology classes. 
Participants were instructed to discuss a problem in 
human relations, and agree, as a group, upon a solution 
to the problem which was posed. Some of the students 
were given the same stories as previously reported in 
Bell and French (3); others were given more general 
problems in human relations. In all, 18 groups were 
established. Owing to conditions over which the experi- 
menter had no control, seven of these groups contained 
only four members. Scores of these four-man groups 
were translated into five-man group scores on the 
Dymond and leadership scales by the technique sug- 
gested by Guilford (7). At the end of the discussion the 
participants were asked to rank the other individuals 


1 It is the authors’ wish to express their appreciation 
to the Graduate Research Council of L.S.U. for support 
in completing part of the statistical analysis. 

? Then at Louisiana State University. 


156 


„it would seem safe to estimate that if a large 


in the group in terms of leadership by listing their 
preference for leader should the group meet again. At 
this same time, the Dymond scales were given according 
to her instructions. Kerr test scores were obtained on 
Form B in the immediately preceding or following class 4 
period. 


Discussion AND CONCLUSIONS 


The raw data for this experiment consisted of the 
following information concerning each individual: 
leadership rankings in each group, Dymond em- 
pathy scores based upon each five-man group, and 
Kerr empathy scores which were independent of 
the groupings. In view of Bell and French's (3) 
conclusions concerning the consistency of leader- 
ship in one-time, initially leaderless groups, we felt 
justified in considering the leadership scores ob- 
tained in each separate group as a reasonable 
estimate of the individual’s standing in any such 
group, and combined all leadership ratings for 
each separate group into one series and correlated 
these with the results of the Kerr tests. In view of 
Bell’s (2) results concerning the consistency of 
Dymond scores in groups of different membership, 
we combined the Dymond empathy scores for the 
separate groups into a single series and correlated 
these with the leadership scores. d 

The correlation between leadership position, 4 
measured by peer ratings, and empathy, as meas- 
ured by the Kerr, was .25. With 81 degrees of 
freedom „this is significantly greater than zero a 
better than the 1% level of confidence. 

The correlation between Dymond empathy 
scores and leadership as measured above was 18, 
This falls slightly short of significance at the 5% 
level. However, since this is approximately the 
same size correlation (.19) that has been report! 
by Bell (2), in a very different population of 90, 


group had been used, the same correlation wo 
have continued and significance would have bee? 
demonstrated. 

The correlation between Dymond and Kt. 
scores appears to be near zero. Hall and Bell (9) 
report .02. Gilbert (6) reports —.06. 

When the correlations between leadership and 
empathy are corrected for attenuation owing t° 
unreliability of the measures of empathy 40! 


RELATIONSHIP BETWEEN LEADERSHIP AND EMPATHÝ PR 


leadership, Kerr and leadership becomes .37; Dy- 
mond and leadership becomes .25. 

For theoretical purposes it might be useful to 
estimate the amount of variance in leadership 
scores that could be attributed to empathic 
ability as measured by these two independent 
tests. A multiple correlation combining these two 
tests of empathic ability yields a correlation of 
44, using Hall's (8) estimate of the correlation 
between Kerr and Dymond. Thus it would seem 
that if our tests of leadership and empathy were 
perfectly reliable, we might expect to account for 
about 20% of the variance in leadership scores, on 
the basis of the empathic ability to understand the 
phenomenological fields of others, and conse- 
quently their needs. 

These findings are interpreted to indicate sup- 
port for the need theory of leadership. In view of 
the rather high correlations reported herein and 
by Van Zelst (14), it would seem appropriate to 
continue research in this general direction aimed 
toward the construction of more effective em- 
pathic measures and toward efforts to identify 
those situations and personal characteristics that 
make the leader able to satisfy or at least appear 
to satisfy the group needs. 


REFERENCES 


1. Barnard, C. I. The functions of the executive. 
Cambridge: Harvard Univer. Press, 1948. 

2. Bett, G. B. The relationship between leadership 
and empathy, Unpublished doctor's dissertation, 
Northwestern Univer., 1952. 


f 4 x 

de UU SURES tie) 

$157 

"í as ; 

3. BELL, G. B., & Frencu, R. L. Consistency of indi- 
vidual leadership position in small groups of 
varying membership. J. abnorm. soc. Psychol., 
1950, 45, 764—767. 

4. Dymonp, RosALIND F. A scale for the measurement 
of empáthic ability. J. consult. Psychol., 1949, 
18, 228-233. 

5. Dymonp, Rosarinp F. Personality and empathy. 
J. consult. Psychol., 1950, 14, 343-350. 

6. GILBERT, O. E. The relationship of schizophrenia 
and paranoia to empathy. Unpublished master's 
thesis, Louisiana State Univer., 1953. 

7. Guicromp, J. P. Fundamental statistics in psychology 
and education. (2nd Ed.) New York: McGraw- 
Hill, 1950. 

8. Hatt, H. E., Jg. Empathy, leadership and art. 
Unpublished master's thesis, Louisiana State 
Univer., 1953. 

9. Hatz, H. E., JR., & BELL, G. B. The relationship 
between two tests of empathy: Dymond’s and 
Kerr’s. Paper read at Amer. Psychol. Ass., 
Cleveland, September, 1953. 

10. Kerr, W. A., & Sperorr, B. J. Measurement of 
empathy. Chicago: Psychometric Affiliates. 

11. Kerr, W. A., & Srerorr, B. J. The empathy test: 
supplement to manual. Chicago: Psychometric 
Affiliates, 1951. 

12. Krecu, D., & Crurcurrerp, R. S. Theory and 
problems of social psychology. New York: Mc- 
Graw-Hill, 1948, 

13. Newcoms, T. M. Social psychology. New York: 
Dryden, 1950. 

14. Van Zzrsr, R. H. Empathy test scores of union 
leaders. J. appl. Psychol., 1952, 36, 293-295. 


Received August 10, 1953. Prior Publication. 


— 


SEXUAL BEHAVIOR IN THE HuMAN FEMALE. By 
A. C. Kinsey, W. B. Pomeroy, C. E. Martin, 
and P. H. Gebhard. Philadelphia: W. B. 
Saunders, 1953. Pp. xxx 4- 842. Price $8.00. 

"This volume is the second progress report made 
by Kinsey and his research staff based on their 
fact-finding investigation of human sex behavior.' 
The purpose of the survey is described as “an 
attempt ...to discover what people do sexually, 
what factors may account for their patterns of 
sexual behavior, how their sexual experiences have 
affected their lives, and what social significance 
there may be in each type of behavior." Most of 
the statistical data presented in the volume (179 
tables and 155 figures) are based on the sex his- 
tories of 5,940 white females. In addition, the sex 
histories of 934 nonwhite females and 915 white 

females with prison records and of over 8,000 males 
provide data that are used for comparative and 
interpretative purposes. In all, over 16,000 histories 
collected over a 15-year period by the staff of the 
Institute for Sex Research provide the basic re- 
search data on which this report is based. 

The case history data are supplemented with 
information drawn from (a) personal documents re- 
flecting sex behavior (diaries, etc.); (b) observa- 
tional data on sex activities of 14 species of 
mammals; and (c) biologic data from experimental 
investigations. This collation of sexual data from 
all available areas makes this report a much more 
valuable scientific contribution than the first 
report. E 

In Part I the authors describe in detail their 
method of securing personal histories, their sam- 
pling techniques, and the types of analyses which 
they applied to their data. The reasons they se- 
lected the methodologies and techniques which 
they employed are presented. They repeatedly call 
attention to the limitations and inadequacies of 
their study resulting from design and the popula- 
tion reported upon. They caution readers not to 
make overgeneralizations of their reported find- 
ings. All the sex histories were obtained through 
personal interviews conducted by professional 
members of the research staff. From 300 to 500 
questions adapted to the vocabulary and experi- 
ence of the individual were asked of each 
interviewee. The average interview required about 
two hours. The Kinsey staff feels that, other than 
direct observation and recording of sexual activi- 
ties, the face-to-face interview method produces 
the most reliable and valid data. 


1 First progress report: Kinsey, A. C., et al. Sexual 
behavior in the human male, Philadelphia: W. B. 
Saunders, 1948. 


158 


REVIEWS 


The investigators cite reasons why the usual 
statistical probability method of sampling is not 
practical in a study of human sex behavior. The 
most significant objection to using probability 
sampling in this area is the currently insurmount- 
able problem of getting a sufficiently high 
percentage of lone individuals so selected to givea 
complete sex history to a total stranger. Other 
problems which probability sampling incurs are: 
(a) the near impossibility of defining groups which 
could be sampled by some probability scheme; (b) 
greatly increased costs of the investigation; and 
(c) the difficulty of obtaining adequate samples 
of individuals with unique histories (as female sex 
offenders, etc.) by random selection. The Kinsey 
group selected and developed a group sampling 
technique which proved feasible and practical 
for obtaining data on human sex behavior. Opera- 
tionally the staff selects certain types of groups | 


———— M € 


needed in making their total sample more complete 
and representative. They find that by contacting ' 
needed types of groups they can ultimately get all | 
or practically all of the group to give individual 
histories. A majority of histories in the present 
report was obtained from individuals who were 
approached by a group selection method. The in- | 
vestigators report that they have systematically | 
refused to accept histories of “stray individuals. 
The data on which the female report is based att 
least applicable for groups over 50 years old, of 
rural background, Catholic and devout Ji ewish — 
adherence, lower educational and economic levels, 
and certain geographic locations. From the present 
evidence it is doubtful that all of these variables 
are significant factors. 5 
The investigators’ keen awareness of the prob- 

lems of reliability and validity of their data P 
reflected in their many and diverse efforts to che 
on this aspect of the study. Certainly no previous 
investigators have ever given as much attention 
to the problem or devised as many techniques fof 
testing the validity and reliability of case history 
data. Reliability of report is made by carefully 
checking the internal consistencies of the data I? 
each history, and by securing retakes of historlés 


after a minimum lapse of 18 months. Validity $ 


checked by: (a) comparison of the data report 
for the same events by two spouses or by the sexu 
partners in any type of sexual activity; (b) co™ | 
paring memory for events which have been esta? — 
lished by observational techniques (such as indios 
of onset of adolescence, etc.); and (c) comparing 
thedata from the present study with those obtain?” | 
in previous studies reported by other investigato® | 
Part II presents an analysis of the incidence D^. | 


a 


frequency data for eight types of female sexual 
"activity: preadolescent sex play, masturbation, 
mocturnal sex dreams, premarital coitus, marital 
coitus, extramarital coitus, homosexual contacts 
and responses, and animal contacts. The incidence 


adolescence, decade of birth, educational level, 
marital status, age at time of marriage, parents' 
occupational class, religious adherence, and rural- 
Itban background. A final chapter in this section 
"makes a similar analysis of these background 
Variables against the total sexual outlets (orgasms) 
‘that females had obtained from the eight types of 
tual activity. Throughout this section compari- 
_ Sons are made of the data derived from the female 
vith similar data drawn from the volume on 
male, 
E The variables of religious devoutness and 
decade of birth were found to be the most in- 
fluential factors related to over-all sexual behavior 
"patterns of the females. In more select ways, 
“Tural-urban background, age, and educational 
tevel were shown to be related to certain sex 
patterns. Contrary to findings for males, educa- 
fional level and.age were not related to over-all 
E a behavior outlets for the females. The age of 
_ the first orgasm was found to be a good predictor 
$ of subsequent ability to perform sexually both in 
i “solitary and sociosexual behavior. The uniqueness 
“Of each individual's sex history and the tremendous 
dividual variation which occurs in sexual capaci- 
and activities is clearly established. 
‘Besides the wealth of original behavioral data 
Presented in Section II, attention should be called 
x to the scholarly and exhaustive documentation 
made of earlier studies and reports of human 
Mes activities and attitudes. Materials that are 
Ba range from the earliest records of man down 
Aa tae most recent scientific reports. The historical 
COR and evolution of much of our knowledge 
1 ‘of our misconceptions regarding sexual behav- 


f 
* 


M: 


en have developed regarding sexual behavigr 

ne attitudes, The report points out some of the 
Problems which the individual faces in making a 
as a result of our cultural 


ee Teport makes another valuable contribution 
e. co/ecting and comparing the sexual data derived 
du goramammalians and lower level species 
Ca Ae human behavioral data. This phylogenetic 

o E Proved more valuable in understanding 
BL €rpreting the human sex data than might 
FAVE been expected, Anthropological data bearing 


is REVIEWS 


159 


upon sexual behavior of preliterate groups were 
also employed, but their usefulness was limited 
since little of the data in this area have been 
collected systematically. 

Part III isedevoted to a comparison of the male 
and female with respect to (a) sexual anatomy; (b) 
the physiology of sexual response and orgasm; 
(c) neural and hormonal factors related to sexual 
response; and (d) certain psychological factors 
regarding sexual arousal and response. The authors 
have performed a very thorough and scholarly task 
in locating, digesting, and correlating the data on 
these various aspects of sexual biology and be- 
havior. A considerable body of data is marshalled 
to support the thesis that sexual arousal and 
orgasm are fundamentally a neuromuscular tension 
response system. The anatomical, neurological, 
physiological, hormonal, and psychological factors 
which appear to be involved in the response pattern 
are considered in light of evidence. The summary 
statements and evaluations of the data bearing 
upon the biologic basis of sexual behavior make a 
valuable contribution to our understanding and a 
stimulus to further research. 

A major psychological finding reported is that 
the male is conditioned by sexual experience much 
more extensively than the female. Most males re- 
spond to a variety of psychological stimuli, while 
only a few females can do so. Consequently sexual 
arousal and response for the male result from both 
physical and psychological stimuli. Sexual arousal 
for most females is dependent upon actual physical 
stimulation. Because this difference between males 
and females is often an unrecognized factor, it is a 
source of much misunderstanding and of problems 
of adjustment for many heterosexual partners. 

It will probably be another generation before 
a good evaluation can be made of the total impact 
of the Kinsey studies upon the scientific and social 
world. Probably one of the major contributions 
will be his breaking through the taboos and mores 
which previously had barred science from investi- 
gating this fundamental aspect of life and social 
living. 


Austin, Texas GLENN V. RaMsEY 


y PSYCHOTHERAPY: STUDIES IN METHOD- 


OLOGY OF RESEARCH AND THERAPY. Report 
of a group psychotherapy research project 
of the U. S. Veterans Administration. 
Cambridge: Harvard Univer. Press, 1953. 

Pp. xv + 615. Price $6.50. | 
It has rapidly become apparent that group 
psychotherapy, begun chiefly as an economical 
substitute for individual treatment, generates a 
unique type of experience, rewarding in itself. The 
complexity and emotional significance of this group 
experience is beautifully brought out in the report 


e 


160 


by Florence Powdermaker, Jerome D. Frank, and 
their collaborators. Sponsored as a forward-looking 
venture in 1947 by the Veterans Administration, 
the project described in this volume was probably 
the first large-scale formal research into the process 
of group psychotherapy. Publication of this volume 
is, therefore, an important milestone in the history 
of a treatment method which, with its spread from 
institutions to private consulting rooms, may soon 
have a marked impact on our social fabric. 

The two major objectives of the investigation 
were to develop methods for studying the process 
of therapy and the specific problems faced by 
therapists, and to develop methods which could 
provide solutions to the problems isolated by the 
analysis of the treatment process. The first objec- 
tive was almost completely exploratory, under- 
taken with a minimum of assumptions other 
than the delimitation of the therapeutic technique 
to one "analytic in orientation," thus ruling out 
a didactic or discussion-group approach of the type 
widely prevalent shortly after the war. The second 
objective, testing outcomes of treatment or di- 
vergent techniques of group composition or dealing 
with problems, involves more formal research 
procedures. 

'The book, like the project, has two main divi- 
sions, the research with neurotic clinic patients and 
a controlled study of group therapy with hos- 
pitalized schizophrenics. To attain their objectives 
the authors organized a large, interdisciplinary 
research team of psychiatrists, psychologists, and 
social workers. The psychiatrists served as 
therapists while the psychologists and social 
workers functioned as observers in the various 
groups and evaluated progress by testing and 
interviewing patients at various stages of treat- 
ment. The basic data of the research project 
consisted essentially of the observers’ running 
accounts of sessions, checked occasionally against 
tape recordings and, apparently (according to data. 
presented in an appendix), only minimally dis- 
torted from the original verbalization of the 
group. These running accounts were annotated 
for dynamic interplay following discussion by 
team members. From these running accounts the 
investigators, in accordance with their emphasis 
on the Sullivanian concept of the interpersonal 
situation, have abstracted sequences of events 
"showing cause and effect relationships described 
with reference to the settings in which they oc- 
curred. Such a situation formed a kind of pattern, 
which, when systematically compared, added to 
our understanding of treatment" (p. 24). 

This laborious comparison of situations within 
and between groups serves as the chief means 
by which tentative conclusions are drawn within 
each chapter. Fascinating and instructive as these 
situational analyses are, they must of necessity be 


REVIEWS 


highly selected from a mass of material and they 
involve considerable subjectivity. Compiling them 
into elaborate tables only gives the appearance of 
formal analysis; actually this tabular method only 
slows up the reader who takes the trouble to ex- 
amine them and offers little evidence acceptable 
from a strict scientific viewpoint. Whatever the 
failings of this approach, however, it is clear that 
the investigators are sticking closely to their 
original data in deriving generalizations and are 
providing concrete material from which the reader 
can draw his own conclusions. Such relative ob- 
jectivity is rare indeed in books on group or indi- 
vidual psychotherapy. 

It is in its exploration of the problems and 
complexities of the therapeutic process and of 
research in this area that this book is most satis- 
fying. Among the host of problems analyzed in 
detailed and stimulating fashion are such matters 
as optimal composition of the group, initiating 
group treatment, relative emphasis on individual 
or group problems, dealing with the habitual 
“monopolist,” resolving tension through “rallying 
topics,” dealing with hostility toward the therapist 
or other group members, group silences, intro- 
ducing new members, and the personality of the 
therapist in relation to various aspects of treat- 
ment. These and other problems peculiar to the 
treatment of hospitalized schizophrenics are dis- 
cussed with remarkable clarity, honesty, absorbing 
documentation, and a minimum of dogmatic 
theorizing. A host of practical suggestions which 
cannot fail to be useful even to the experien 
group therapist emerge from consideration of these 
problems. 

Of comparable value for the psychologist con- 
cerned with research in therapy are the discussions 
and exemplifications of the practical and method- 
ological problems faced by the investigators. 
Technical problems discussed include such matters 
as deciding on the proper unit of analysis in the 
group data, selecting appropriate rubrics for 
classifying patients, establishing suitable controls, 
and choosing meaningful criteria of change or ims 
provement. Practical difficulties such as integrating 
group therapy research into the framework of à 
service-oriented clinic or hospital, harmonizing the 
diverse language patterns of the interdisciplinary 


teams, or problems arising from the use of rela- 


tive beginners (psychiatric residents) as grouP 
therapists are also brought out and intelligently 
discussed. It is when the investigators go beyon 
exploration to the testing of specific hypotheses or 
determinations of therapeutic outcome that the 
present volume becomes least satisfying. The 
authors are clearly cognizant of the extensive 
social psychological research on the group proces 
and dynamics, yet they simply report that these 
did not prove suitable for their purposes. They ] 


assert that “...in the field of psychotherapy, 
preoccupation with controls and experimental 
design is premature and based upon the misappre- 
hension that these methods themselves lead to the 
discovery of significant relationships" (p. 8). To 
this reviewer this disclaimer seems to be a 
rationalization for lack of resourcefulness in devis- 
ing units or dimensions of analysis which could to 
some extent be studied in relative isolation by 
appropriate experimental and statistical methods. 
The work of the nondirective group in analysis of 
individual therapy or of the group dynamics in- 
vestigators has suggested that relevant features 
of an ongoing therapeutic process may indeed be 
handled fairly objectively. As a matter of fact, 
within this volume Maas and Varon’s work on 
analysis of ward behavior, employing fairly ob- 
jective criteria and reliability checks, provides a 
good example of just such a dimensional analysis, 

Similarly disappointing to the psychologist are 
the chapters describing research on the outcomes 
of therapy. This is so not so much because the 
results are favorable or unfavorable, but rather 
because they are, in a manner of speaking, so 
decidedly inconclusive. Although the authors have 
studied a host of relevant criteria, the variety of 
conditions treated, the relatively small numbers 
who persisted long enough in treatment, the 
various administrative changes, and possibly the 
extensive reliance on relatively inexperienced or 
unskilled therapists are factors which make a clear- 
cut decision about the long-range benefits of 
therapy Impossible from this study. To what extent 


REVIEWS 


161 


the likelihood of such an outcome might have been 
averted by initial planning is not clear. It is much 
too easy to second-guess in a rapidly developing 
field such as this, particularly when one cannot 
help but be struck with the practical difficulties 
facing a large-scale evaluative study carried out in 
a setting geared primarily for service. 

Whatever its failings as pure research, there can 
be no question that this is an important and 
useful book. Technically it is well put together, 
beautifully printed, and written in a style that is 
exceptionally clear and free of esoteric terminology. 
Occasionally one might hope for a somewhat more 
explicit statement of the theory undoubtedly un- 
derlying certain interpretations, but this does not 
seem a serious defect. A book like this should 
command a wide audience of research workers 
concerned with problems of group dynamics and 
therapy as well as of experienced and aspiring 
group therapists. This reviewer feels that it can 
ultimately prove most useful as a textbook on 
group psychotherapy for beginners. Read a chapter 
at a time, it should be invaluable for graduate 
courses or in-service seminars on group psycho- 
therapy because of its excellent examples, attention 
to practical problems, and clear summary sections. 
As a pioneer exploration into little-known terrain 
this work will serve as a stimulating reference for 
workers in several disciplines for years to come. 


JEROME L. SINGER 


Veterans Administration Hospital 
Montrose, New York 


NEW BOOKS RECEIVED 


ABRAHAM, C. Selected papers on psychoanalysis. 
New York: Basic Books, 1953. No pagination. 
Price $6.00. 

ALEXANDER, F., & Ross, HELEN. (Eds.) Twenty 
years of psychoanalysis. New York: Norton, 
1953. Pp. 308. Price $3.75. 

Bossanp, J. H. S. Parent and child. Philadelphia: 
Univer. of Pennsylvania Press, 1953. Pp. 308. 
Price $5.00. 

Burt, C. Contributions of psychology to social 
problems. London: Oxford Univer. Press, 
1953. Pp. 76. Price $1.00. 

CavANAGB, J. R., & McGorprrck, J. B. Funda- 
mental psychiairy. Milwaukee: Bruce, 1953. 
Pp. x + 582. Price $5.50. 

CONFERENCE ON HEALTH AND HuMAN RELATIONS, 
Detmold, Germany. Health and human rela- 
tions. Toronto: Blakiston, 1953. Pp. xx + 192. 
Price $4.00. ` 

Cressey, D. R. Other people's money. Glencoe, 
Ill.: Free Press, 1953. Pp. 191. Price $3.00. 

Dax, E. C. Experimental studies in psychiatric art. 
Philadelphia: Lippincott, 1953. Pp. 100. 
Price $5.00. 

Dzvrscg, K. W. Nationalism and social communi- 
cation. New York: Wiley, 1953. Pp. x + 292. 
Price $5.00. 

Festincer, L., & Karz, D. Research methods in 
the behavioral sciences. New York: Dryden, 
1953. Pp. xi + 660. Price $5.90. 

GorpHAMER, H., & MARSHALL, —. Psychosis and 
civilization. Glencoe, Ill.: Free Press, 1953. 
Pp. 126. Price $4.00. 

HarHawAY, S. R., & Monacuest, E. D. (Eds.) 
Analysing and. predicting juvenile delinquency 
with the MM PI. Minneapolis: Univer. of Min- 
nesota Press, 1953. Pp. vi + 153. Price $3.50. 

Hemicke, C. & Warme, BEATRICE B. Bibliog- 
raphy on personality and social development in 
the child. New York: Soc. Sci. Res. Coun., 

1953. Pp. viii + 130. Price $1.00. 

Jone, C. G. Psychology and alchemy. New York: 
Pantheon Books, 1953. Pp. xxiii + 563. 
Price $5.00. 

June, C. G. Two essays on analytical psychology. 
25 York: Pantheon, 1953. Pp. 329. Price 

KALLMANN, F. J. Hereditary in health and mental 
disorder. New York: Norton, 1953. Pp. 315. 
Price $6.00. 

KraL, S. S., & Bercius, R. Nationale vorurteile. 


Berlin: Duncker & Humblot, 1953. Pp. 94, 
Price not supplied. 

Kusir, Susan H., & LANDAU, GERTRUDE. Group 
work with the aged. New York: International 
Universities Press, 1953. Pp. 214. Price $3.50, 

Lacey, O. L. Statistical methods in experimentation, 
New York: Macmillan, 1953. Pp. ix + 249, 
Price $4.50. 

Moreno, J. L. Who shall survive—foundations of 
sociomelry, group psychotherapy and socio- | 
drama. New York: Beacon House, 1953. Pp. 
cxiv + 763. Price $10.00. 

NAUMBURG, MARGARET. Psychoneurotic art: its 
function in psychotherapy. New York: Grune & 
Stratton, 1953. Pp. x + 148. Price $6.75. 

Morse, Nancy C. Satisfaction in the white-collar 
job. Ann Arbor: Univer. of Michigan Press, 
1953. Pp. viii + 235. Price $3.50. 

Nurri, J. Psychoanalysis and personality. New 
York: Sheed & Ward, 1953. Pp. xiv + 310. 
Price $4.00. 

Peters, R. S. (Ed.) Brett’s history of psychology. 
New York: Macmillan, 1953. Pp. 742. Price 
$7.50. 

Praczr, J. The child's conception of number. New 
York: Humanities Press, 1953. Pp. ix + 248. 
Price $5.00. 

PINTNER MEMORIAL COMMITTEE. Rudolph Pininer, 
Nov. 1884—Nov. 1942. Washington: Gallaudet 
College Press, 1953. Pp. 63. Gratis. 4 

Powys, J. C. In spite of. New York: Philosophical 
Library, 1953. Pp. 312. Price $5.00. 

Rog, Anne. The making of a scientist. New York: 
Dodd, Mead, 1953. Pp. 244. Price $3.75. 

STEINFELD, Juutus I. Contribution to the etiology, 
therapy and prophylaxis of schizophrem. 
Des, Plaines, Ill.: Forest Press, 1953. Pp. 31 
Price not supplied. | 

TRANSACTIONS OF THE Fourra CONFERENCE. 
Nov. 12, 13, 14, 1952. Adrenal cortex. New | 
York: Josiah Macy, 1953. Pp. 165. Price $3.50. 

Viretes, Morris S. Motivation and morale i" 
industry. New York: Norton, 1953. Pp. xvi * | 
510, Price $9.50. 


„von Foerster, H. (Ed.) Cybernetics—circular 


causal and feedback mechanisms in biologii 
and social systems. New York: Josiah Macys 
1953. Pp. xx + 184. Price $4.00. 1 
WonunHoupr, A. The muse at length—a psycho 
analytic study of the Odyssey. Boston 
Christopher, 1953. Pp. 159. Price $2.75. ] 


162 


THE EFFECT OF SHOCK ON RECOGNITION THRESHOLDS: 


MICHAEL M. REECE* 
New York University 


N INCREASING amount of research 
A concerning the nature of perceptual 
processes is currently being done by 
investigators in the fields of social, personality, 
and clinical psychology. 

Several of these investigators (2, 6, 10) 
approach perception from such points of view 
as the relationship of motivation to percep- 
tion, the functional value to the organism of 
perceptual behavior, and the like. A number 
of experiments have resulted in data which are 
interpreted as indicating the importance of 
needs and values in “determining” perception. 

Other investigators decry the lack of ade- 
quate operational definitions in much of this 
kind of perceptual research. They feel that 
the evidence in favor of emotional determi- 
nants of perception is inadequate (12). They 
Tegard perception from the viewpoint of a 
Tesponse system reacting to varying stimulus 
configurations, but they disagree with the 
concepts of the former group concerning the 
kind of variables which may be expected to aid 

in an explanation of the phenomena. These 

Workers feel that a great deal of perceptual 

behavior may be more adequately and par- 

Simoniously described in terms of established 

Principles of “modern association theory” 

(eg, frequency); that perception is always 

inferred from responses of the organism and 

p these Tesponses are learned (12). For 

ieu) the analysis of perceptual behavior is, 

i erefore, part of the general problem of 

earning in terms of S-R relationships. 

peers of both groups agree that the effect 

9 Psychologica] variables (e.g., habit strength) 

on perception should be further investigated. 

A studies have utilized recognition 
a means of investigating the effect of such 
‘This article is based on i dissertati 
nau to the Graduate School ar Are NA Sciences, 
anes ork University, in Partial fulfillment of the re- 
i br the degree of Doctor of Philosophy. 
Deutsch, R, uy indebted to Drs. L. W. Crafts, M. 
The hapi . Gilbert, L. S. Kogan, and J. Zubin. 
Hunt in the ections and criticisms of Dr. J. McV. 
acknowl edged. of this study are also gratefully 


*Now at Wayne University. 


factors as frequency and value upon percep- 
tion (2, 10, 12). Verbal recognition as a re- 
action to a specific stimulus situation is a 
learned response. As such, its acquisition may 
be assumed to proceed in accordance with the 
general principles of learning, i.e., in the same 
manner as does the acquisition of any response 
in any learning situation. 

Among these principles are the association 
of a stimulus situation and the response it 
evokes. In Dollard and Miller's formulation 
(3), the stimulus acquires both a cue function 
and a drive function. Miller (3) also has 
described the manner in which previously 
neutral stimuli acquire a drive function moti- 
vating escape behavior when these stimuli 
are associated with pain and fear. 

In reward learning theory, the association 
of stimulus and response depends upon rein- 
forcement. For Hull (5) and his followers, 
reinforcement occurs through drive reduction. 
In the terms of Dollard and Miller, “a drive 
tends to elicit a considerable variety of re- 
sponses and has a stronger tendency to elicit 
some of them than others" (3, p. 192). Pain 
is included among the primary drives. Mowrer 
has shown how escape from punishing electric 
shock reinforces a response (8). For this study, 
it is assumed that the sudden reduction of a 
drive reinforces the response with which it is 
associated. 

According to reinforcement theory, drive 
impels the subject (S) to respond. In a specific 
situation, stimuli evoke a complex of responses. 
The response which successfully reduces the 
drive is strengthened and subsequently, in a . 
similar situation, is more readily evoked in 
the presence of drive. If experimental arrange- 
ments in a learning situation permit a particu- 
lar response (e.g. verbal recognition) to be 
associated with reduction of a drive (eg., 
pain) for some Ss and prevent such association 
for other Ss, that response would be reinforced 
relatively more in the Ss in whom the drive is 
reduced. 

The present experiment attempts to associ- 
ate pain (induced by shock) with stimuli 
(nonsense syllables) during a learning task. „ 


165 


166 


This task requires the recognition and pro- 
nunciation of the same stimuli which are used 
in a perceptual recognition situation. 

In the learning conditions which allow the 
escape from the pain, half of the stimuli 
presentations (ie, stimulus and response 
syllables simultaneously presented) are associ- 
ated with shock. The shock elicits pain which 
motivates escape behavior. The verbal re- 
sponse (i.e., pronunciation) is followed by the 
cessation of shock. Theoretically the escape 
from the shock reduces the pain and reinforces 
the response. With continued training, it is 
predicted that the latency of this response 
will decrease and readiness to respond increase. 

In the nonescape-shock conditions, half of 
the syllable-pairs are associated with shock 
in the same manner as in the escape-shock 
conditions. The shock elicits pain which 
evokes attempts to escape from the pain. 
However, in these conditions, the shock per- 
sists for the entire duration of the shock- 
syllable presentation. The pronunciation re- 
sponse is not followed by the cessation of 
shock and therefore is not reinforced as it is 
in the escape-shock conditions. No reward is 
provided for recognition and pronunciation. 
There may also be an inhibitory effect upon 
pronunciation owing to the competing and 
interfering responses which may result from 
the strong shock. 

From the foregoing, it follows that the 
latency of the pronunciation response should 
be decreased more in the escape-shock condi- 
tions than in the nonescape-shock conditions 
because this response is reinforced more in 
the former conditions. As recognition is 
inferred from pronunciation, the readiness to 
recognize the stimuli associated with shock 
should be relatively greater in the escape- 
shock conditions than in the nonescape-shock 
conditions. 

In an attempt to investigate the effects of 
differences in the acquired drive function of 
stimuli upon the recognition response, the 
experiment included conditions of varying 
potential predictability. The arrangements 
provided predictable shock conditions in which 
discrimination between shock and nonshock 
syllables was considered possible because the 
same syllables were consistently accompanied 
by shock, and other syllables were free from 
shock. As there would be no shock associated 
with certain syllables, the drive to escape 

? would not be related directly to these syllables. 


MICHAEL M. REECE 


It was felt that the cue for the onset of shock | 
and the acquired drive to escape from the 
shock would be restricted to the shock syl- | 
lables. | 

The design also provided unpredictable 
shock conditions in which no discrimination 
between shock and neutral syllables was pos- 
sible because the shock was administered in 
a random manner. It was felt that every syl- 
lable would become a cue for possible shock 
and that the drive to escape from the shock 
would become associated with each syllable. 

The posttraining tachistoscopic condition 
measures the thresholds of all syllables previ- 
ously used in the learning task. The syllables 
retain the drive and the cue functions which 
they acquired during the training procedure. 
Each syllable evokes the associated response 
characteristics (e.g., decreased or increased 
latency) developed during the learning. 

The basic hypothesis for the study is that 
the mean recognition-report threshold of 
nonsense syllables, half of which were previ 
ously associated with shock in a learning 
situation permitting escape from the shock, 
will be lower than the mean threshold of 
syllables, half of which were previously assoct- 
ated with inescapable shock in a similar 
situation. 

METHOD 

Subjects. The Ss were 70 undergraduate students 
divided into five equal groups. All Ss were volunteers 
and were assigned at random to the various conditions. 

A factorial design was used to investigate the factors 
of escapability and predictability of shock. The groups 
were, predictable escape-shock, unpredictable escape 
shock, predictable nonescape-shock, and unpredictable 
nonescape-shock. A nonshock group was added to test 
eee of frequency and of pronunciation without 


‘The visual acuity of each S was first determined by | 
means of a Snellen Chart. The sequence of conditions 
of the experiment was then as follows: (a) initial & | 
chistoscopic determination of recognition-report thresh- 
olds of the nonsense syllables, (b) first rating of pai 
induced by shock, (c) learning of the paired associates 
(d) second rating of pain, (e) postlearning determina 
tion of recognition-report thresholds. 

Learning procedure. A list of 20 monosyllabic, three 
letter syllables from Glaze’s list (4) of "zero per cent 
association value” was used to form 10 paired associates 
Two sets of lists were prepared, with all of the syllables 
which appeared on one set as stimulus syllables us% 
as response syllables in the other set, and vice verst 

The lists were presented in fixed rotating order (ien 
for S1 of each group the sequence of lists was 12341 
1234, for S2 of each group the sequence was 23412341 
2341, for S3 the sequence was 341234123412, etc); 
Shock syllables were randomly selected and presenti 
to Ss in a rotated order so that each syllable-pair ap 


€ 


EFFECT or SHOCK ON RECOGNITION THRESHOLDS 


peared as a shock-pair. For the predictable shock 
conditions, the same syllable-pairs were consistently 
accompanied by shock and the remaining were con- 
sistently free from shock for any given individual. For 
the unpredictable shock conditions, the shock syllables 
were rotated in each trial so that, for each S, every 
pair of syllables appeared as a shock-pair during the 
learning procedure. 

A modified Bennett-Brown memory drum was used 
to present the syllables. The anticipation-prompting 
method was used, which involved (a) the presentation 
of the stimulus syllable alone followed by (b) the 
simultaneous presentation of this stimulus syllable and 
its associated response syllable. The Ss were instructed 
to anticipate aloud the response syllable when its as- 
sociated stimulus syllable was presented, regardless 
of whether or not the anticipation was correct. Presen- 
tation time of the syllables was two seconds. Inter- 
syllable-pair interval was two seconds and the inter- 
trial interval was six seconds. Learning was terminated 
at the end of the twelfth trial. 

Shock. The administration and intensity of the 
shock was manually controlled by E. A milliameter in 
the shock apparatus provided continuous readings of 
intensity. The electroband was fastened on the right 
arm so that the focal zinc electrode was placed on the 
volar surface of the wrist. Contact was reinforced by 
means of electrode paste. 

The shock was administered simultaneously with 
the presentation of the predetermined shock-syllable 
pairs. For the escape-shock groups the pronunciation 
of the response syllable was followed by cessation of the 
shock. For the nonescape-shock groups, shock persisted 
for the entire duration of exposure of the shock-pair 
on the memory drum. The intensity of shock for each 
S Was set at the point at which a rating of strong pain 
was elicited. Ratings also determined the point at which 
pain was first felt and the point at which the pain was 
intolerable. The intensity was gradually increased 
during the learning period so that the intensity of shock 
during the Jast trial was as much or more than that 
aS Previously had been rated by S as intolerable. 

ostlearning ratings of shock were obtained in order 
to provide a means of estimating the extent of adapta- 
tion to the shock, k 
leu tetgilion tests. The nonsense syllables of the 
$ tning task were tachistoscopically presented in 
Tee random orders prior to, and after, the learning. 
th © ascending method of limits was used to determine 

€ recognition thresholds. The first exposure duration 
ue 10 msec. The duration of each succeeding 

ereafter was increased by 20 msec. The exposure 


termed the threshold for that liable. „ 
The threshold for a syllable was completely determina 
T Proceeding to the next syllable. 
e UU An electronically controlled tachisto- 
es 9f the projection type was used which permitted 
in) S ug of exposure durations, Slides (2 in. X 2 
4 01 the syllables were projected by means of a TDC 
© Projector on the back of a Trans-Lux screen, with 


saening and closing of the tachistosco i 
5 pe shutter which 
d the projected beam of light. The brightness 
Screen was one foot candle (measured by the 


167 


light target held perpendicular to the front of the screen 
at a distance of 6 in.). Eye-level illumination at S’s 
position, 10 feet from the screen, was five foot candles. 

During the postexperimental interview, S was asked 
to state his interpretation of the shock. Other questions 
concerning his reactions to the experimental conditions, 
feelings, attitudes, etc. were included. The interview 
was clinically oriented, and particular care was taken 
to allay any fears which may have resulted from the 
experience of shock. 


RESULTS 

Adaptation to Shock 

A necessary condition for this study was 
the experience of pain by the Ss during the 
learning task. An administration of electric 
shock does not automatically assure an 
experience of pain since individuals vary 
greatly in their reactions to shock. Hence it 
was necessary to obtain an initial rating from 
each S of the pain induced by the shock in 
order to assure a roughly equivalent degree 
of pain in all shock Ss. However, repeated 
shocks tend to result in a lessened reaction to 
the shock; adaptation to shock is evidenced. 
The conditions of the present experiment 
necessitate a minimum of such adaptation. 

In order to provide an estimate of adapta- 
tion to the shock, postlearning ratings were 
also obtained and were compared with the 
prelearning ratings. Only three Ss (5.4 per 
cent) gave postlearning ratings of little or no 
pain, yielding a decrease of more than two 
units on the rating scale. Of these Ss, one was 
in the predictable nonescape-shock group and 
two were in the unpredictable nonescape- 
shock group. Of the other Ss (94.6 per cent), 
not one gave a postlearning rating of less 
than moderate pain; one S showed a decrease 
of two scale units, 13 Ss showed a decrease 
of one scale unit, and 32 Ss showed no decrease. 

These data suggest that the other results 
should not have been markedly affected by 
adaptation to the shock. 


Recognition-Report Thresholds 

Means. A comparison of the differences in 
thresholds obtained before and after the learn- 
ing (shock) sessions would be misleading. An 
individual who tends to have “high” thresh- 
olds can have greater absolute differences 
than the individual who exhibits “low” 
thresholds although the reduction between 
prelearning and postlearning thresholds may 
be proportionately the same. The use of 
proportions as scores tends to make such 
differences more comparable. e 


168 


Stimulus syllable 
Response syllable 


TABLE 2 


Errect or SHOCK ON RECOGNITION- 
Report THRESHOLDS 


€————————— — 


SOURCE OF MEAN 


VARIATION SQUARE df F ? 
Shock escapability 105.44 1 | 7.32 01 
Shock predictability 5.12 1 .36 
Interaction, E X P 6.95 1 48 
Indiv. diff. 14.41 | 52 


Table 1 shows the scores for mean recogni- 
tion-report thresholds for the groups expressed 
as ratios of postlearning thresholds to pre- 
learning thresholds. Bartlett’s test for ho- 
mogeneity of variances (11) was applied to 
these data and indicates that these groups 
are comparable in variance. 

It is noted that the ratios of all groups are 
less than one. The escape-shock groups have 
the lowest scores, indicating the greatest 
reduction between prelearning and postlearn- 
ing thresholds. The nonescape-shock groups 
have the highest scores, showing the least 
reduction in thresholds. 

Shock- and neutral-syllable thresholds. It was 
expected that stimulus generalization effects 
would occur to some extent. In order to evalu- 
ate these effects, the recognition thresholds 
of shock and neutral syllables were compared 
‘by means of the analysis of variance tech- 
nique. This treatment is applicable only to 
the data of the predictable shock groups for 
which the specific shock syllables and neutral 
syllables were always the same. 

The analysis of variance of stimulus- and 
of response-syllable thresholds did not yield 
any significant differences between the thresh- 
olds of shock syllables and neutral syllables 
within the escape-shock or within the non- 
escape-shock conditions. It is possible that 
stimulus generalization prevented any differ- 
ence from appearing in the thresholds between 
the shock and neutral syllables within each 


* group. 


,nonshock conditions although the thresholds 


MicHAEL M. REECE 


Shock predictability and. escapability.? Since 
there is no significant difference between the 
shock- and neutral-syllable thresholds in the 
predictable shock groups, the mean thresholds 
for the entire list of syllables can be compared 
for the predictable shock and the unpredict- 
able shock groups. 

The factorial arrangement of the shock 
groups allows the use of the double classifica- 
tion analysis of variance method (7) to evalu- | 
ate the factors of predictability (predictable 
and unpredictable shock) and of escapability 
(escape and nonescape shock). 

This technique applied to the stimulus 
syllables yields an F of 1.91 for predictability, 
an F of 1.36 for escapability, and an F of 237 
for the interaction of predictability and es- 
capability. None of these F’s is significant. 

Table 2 gives the results for the response 
syllables. It may be seen that the predictability 
factor yields no significant difference. But the 
escapability factor shows a difference which 
is significant (p = .01). The hypothesis that 
there is no difference between these groups 
except that due to chance can be rejected 
with confidence. The nonescape-shock groups 
show higher threshold ratios than the escape 
shock groups. 

Ignoring the factor of predictability (shown 
to be insignificant) and regrouping the data 
of the shock groups allow comparisons among 
these three groups: the escape-shock group 
the nonescape-shock group, the nonshock 
group. A significant / of 2.75 (p < .01) appears 
for the difference between the threshold ratios 
of the nonescape-shock group and the escape 
shock group, with the former showing the 
higher thresholds. A ¢ of 2.03 (p < .05) 8) 
obtained for the difference between the threst 
old ratios of the nonescape-shock group 90^. 
the nonshock group. But there is a / of omy 
42 (not significant) for the difference betta 
the thresholds in the escape-shock and the 


of the former appear to be lower than thos 
of the latter. j 


Interpretation of Shock r 


Each S who experienced shock during the 
learning task was asked in the interview V 


was unavoidable for all shock Ss. 


= 


EFFECT OF SHOCK ON RECOGNITION THRESHOLDS 


state his interpretation of the shock. These 
data can be classified in the broad categories 
of punishment, emphasis, and interference. 
The category of punishment includes interpre- 
tations of the shock as punishment for error, 
for failure to respond quickly, for failure to re- 
spond at all—and as nonspecific punishment. 
The category of emphasis refers to the stressing 
of certain syllables. The category of inter- 
ference includes interpretations of the shock 
as deliberate attempts on the part of E to 
distract, disrupt, or interfere with S. 

In order to compare the interpretations of 
the escape-shock Ss with the nonescape- 
shock Ss, the data were divided into two groups 
categorized as “punishment” and “other.” 
The latter category combines interpretations 
of emphasis and interference. The results 
were evaluated by the chi-square method. 
Table 3 indicates that the obtained differences 
are highly significant. The Ss in the learning 
conditions which enabled them to escape from 
the shock tended to interpret the shock as a 
kind of punishment. The Ss of the conditions 
which did not permit escape from the shock 
tended to interpret the shock as something 
other than punishment, and the greatest 
Proportion of them felt the shock was a 
deliberate interference by E. 


Learning 


The degree of learning of nonsense syllables 
Was measured by the number of correct 
anticipations of the response syllables during 
the twelfth (final) trial. Since the shock was 
hot specifically related to correct or incorrect 
responses, these results are not considered to 
be related directly to the recognition thresh- 
olds, Nevertheless, it may be stated paren- 
thetically that the learning task proved to be a 
difficult one for all Ss. The nonshock group 
achieved a higher degree of learning than the 


Soup. Predictability of shock 

i produced only 
"Papas Vene The obtained differences 
own uted to the effects of the escapability 
is eio and discussion of the results 
or the um E condition will be considered 
1 another article, For the resent, it 
15 noted that the number of trials—which Mi 


limited by the intensity and the number of 


LU 


TABLE 3 ; 
INTERPRETATION OP SHOCK 


15.16 | .001 


* Interference 1; emphasis 8. 
** Interference 18; emphasis 5, 


shocks—yielded a relatively low degree of 
learning in this experiment. 

The learning which is relevant to the 
thresholds in this experiment is concerned with 
learning the “pattern” of shock (i.e., whether 
shock was administered only with certain 
syllables or was given randomly) and with the 
decrease in latency of pronunciation of the 
response syllable. Data concerning the latter 
would provide direct and independent evi- 
dence about the latency of recognition in the 
learning situation. Unfortunately, due to 
practical limitations, E was unable to obtain 
such data concerning the latency of pro- 
nunciation. 

However, estimates of learning of the 
shock “pattern” were obtained. No S learned 
the “pattern” of shock, i.e., no S was able to 
correctly state that the shock accompanied 
only certain syllables or was randomly ad- 
ministered. These estimates were obtained 
from the interview data. 

Qualitative Observations 

Reaction to shock. The range of reactions to 
the repeated administration of shock was 
great. Although relatively few Ss admitted 
feelings of anxiety, sudden, overt, and profuse 
perspiration was a common occurrence. Be- 
havior, such as mispronunciation of the syl- 
lable, frequent clearing of the throat, shuffling 
of feet, and voice tremor, was also common. 

Another kind of behavior which merits 


^ comment appeared to be a kind of “‘dissocia- 


tion” from the shock. For example, one S 
held her right arm (electrode was on the right 
wrist) extended in an awkward position at 
shoulder level during the entire. period of 
learning. She made no attempt to lower her 
arm and, when questioned, appeared to be 
unaware of this behavior. — . 

Another factor which emerged from the 
interview data is that every S, without ex- _ 


QW. 


» 170 


ception, expressed the belief that the shock 
was included for a definite purpose and that 
the purpose could not be harmful to S because 
that would not be socially permissible. The 
reaction to the shock was therefore tempered 
by such an attitude. 

It is also noteworthy that several Ss were 
reminded of past experiences of punishment 
when they were shocked. 

The tachistoscopic situation. A noticeable 
feature of the tachistoscopic situation was 
the tendency on the part of a number of Ss 
to react with feelings of frustration. Other 
reactions were feelings of negativeness and 
hostility toward E, which may have been 
related to the administration of the painful 
shock. These feelings generally were expressed 
freely during the interview. 


Discussion 


Predictability of shock. Administering shock 
consistently with the same syllables or in 
random manner did not result in significantly 
different recognition-threshold ratios among 
the groups. However, Ss’ failure to learn the 
“pattern” of shock indicates that the pre- 
dictability of shock was not effectively estab- 
lished. It appears that this experiment did 
not provide an adequate test of the shock- 
predictability factor. 

Escapability from shock. The data from the 
stimulus syllables showed only chance differ- 
ences in thresholds for all groups. The escape- 
shock Ss tended to recognize the response 
syllables relatively more readily in the post- 
learning tachistoscopic situation than the 
nonescape-shock Ss. There appeared to be 
only chance differences between the thresholds 
of the nonshock group and the escape-shock 
group. 

The major factor to which the differences of 
recognition-threshold ratios among the shock 
groups can be attributed is escapability from 
shock. The effect of this factor seems to be 
restricted to the response syllable, and it 
seems to generalize from shock syllables to 
neutral syllables, 

The results can be interpreted in terms of 
reinforcement theory as previously presented, 
or in terms of interference or inhibition due 

. to shock. 

'The inhibitory quality of strong shock has 

frequently been noted. In this study, the dis- 
a turbing characteristic of the strong shock is 
5" d. x 


MicHarL M. REECE 


attested by the behavior of Ss during the 
learning, by the interpretations of the shock, 
and by the comments elicited during the 
interview. Moreover, the escape-shock group 
was not free from the inhibitory effects of 
the shock. Although these Ss could terminate 
the shock after it occurred, the shock was 
unavoidable at the moment it was adminis- 
tered. 

It can be argued that where the conditions 
allow no escape, shock has an inhibitory effect 
which shows both in the amount of learning 
accomplished during the 12 trials and also in 
the change in the recognition thresholds. 

Assuming that the extent of inhibition varies 
directly with the duration of shock, the 
escape-shock group has more interference 
with the pronunciation response than the 
nonshock group. The nonescape-shock group 
has the most interference, and the nonshock 
group the least interference. If the recognition 
thresholds vary directly with the extent of 
inhibition, it follows that the nonshock group 
should have the lowest recognition thresholds, 
the escape-shock group should have higher 
thresholds than the nonshock group, and the 
nonescape-shock group should have the 
highest thresholds. This is not the case. The 
nonshock group did not obtain lower thresholds 
than the escape-shock group. 

In terms of reinforcement, it appears that 
the recognition and pronunciation of the 
response syllables should have been more 
rewarding for the escape-shock group than 
for the nonshock group, for the shorter the 
delay in pronunciation the shorter was the 
duration of the shock. The nonshock group | 
did not have this source of reward for recog: | 
nition. However, the recognition thresholds 
in the escape-shock group, need not be Jowét 
than in the nonshock group, even though 
recognition has been reinforced more in this | 
Shock „group. Although the escape-shock 


, group was subject to greater reinforcement) | 


it was also subject to more inhibition (due f? | 
the unavoidable shock) than the nonsho 

group. Thus, any actual differences betwee? 
these groups resulting from reinforcement 
would be cancelled. | 

In short, if shock has an inhibitory effect 

escape from it must be reinforcing to some 
degree or the change in recognition thresholds 
for the escape-shock group would be less that 
that for the nonshock group. Since this is 2% 


e 


EFFECT OF SHOCK ON RECOGNITION THRESHOLDS 


the case, the results can probably be explained 
by the interpretation that escape from shock 
is reinforcing and that this tends to overcome 
the inhibitory effects of shock. If this latter 
interpretation is true, then the general theo- 
retical thesis of the study is supported. 

The results show that only the response 
syllables elicited thresholds which were 
significantly different for the various groups. 
This is not surprising in the light of the low 
degree of learning (i.e., number of correct 
anticipations) achieved by Ss. Although 
both the stimulus and response syllables were 
exposed when shock was administered, the 
instructions and conditions were calculated 
to establish, and apparently resulted in, a set 
to react primarily to the response syllable, 
and the stimulus syllable was largely dis- 
regarded. With a greater degree of learning 
one might expect the effects of shock associated 
with the response syllable to be transferred 
to the stimulus syllable since the latter was a 
cue for the response syllable. 

The failure of the predictable shock groups 
to discriminate between shock and neutral 
response syllables within a given list can be 
attributed to the inadequate number of learn- 
ing trials and the intensity and number of 
shocks. The number of items involved would 
contribute to the difficulty of such discrimina- 
tion since half were neutral and half were 
shock syllables. 

Relation to perceptual defense. Previous 
Studies have invoked the concept of perceptual 
defense as an explanation of the “raised” 
Tecognition thresholds that were obtained (2, 

; In this experiment, one group which was 
Subjected to an unavoidable noxious situation 
from which there was no escape showed rela- 
tively high recognition thresholds. Can these 
aues e interpreted as indicating perceptual 
, The perceptual defense concept (6) assumes 
$ at _ the organism responds with *defense 

eactions to anxiety-provoking stimuli, and ^ 
goce that delayed recognition (inferred 
Pus oe relatively higher thresholds) is one 
: Such defense. This assumption is fully 
Mur UE to the theoretical context of this 
i e Le., responses elicited by the desire 
BE AUN n the pain may be considered in 
noted, hy elense of the organism. It must be 
Bie ae ue this experiment presented 

Tova with the same noxious stimulus, 

Providing some of them with the means of 


171 ; 


escaping from the pain, and preventing escape 
from the pain by others. If one accepts the 
assumption that noxious stimuli evoke defense 
reactions, it follows that both the relatively 
higher thresholds of the group which was not 
allowed to escape from the pain, and the rela- 
tively lower thresholds of the group which 
could terminate the shock may be considered 
as defense reactions. No valid reason is seen 
for restricting the notion of defense to the 
nonescape Ss. 

Tn terms of this study, it is not necessary to 
conceive of perceptual defense as active 
repression by the organism. On the contrary, 
relatively high thresholds may be considered 
as resulting from a failure to establish a 
response which can effectively decrease the 
anxiety (or pain) associated with the perceived 
stimulus situation. Conversely, stimuli previ- 
ously associated with such anxiety may yield 
relatively low thresholds if recognition has 
been rewarded by escape from the anxiety. 

Other variables. Although the effects of 
frequency of presentation were relatively well 
controlled (i.e., all Ss received the same num- 
ber of learning trials of material with which 
they had had no previous direct experience), 
there was no control of the frequency of letters 
in the initial and final positions in the syllables. 
No quantitative data were obtained in regard 
to this factor, but observation of Ss left a 
definite impression that it was important in 
determining presolution responses. 

The most obvious example was seen in 
reaction to the syllable, XAT. Very frequently 
Ss, after responding correctly to this syllable 
once, expressed doubt concerning their re- 
sponse and changed it at the next exposure 
of the syllable. It is suggested that such re- 
actions are related to the relatively infrequent 
occurrence in the English language of words 
beginning with X. 

Although the ratings indicated that minimal 
adaptation to shock occurred within the groups 
in this experiment, some adaptation appears 
to have occurred, and it may be attributed to 
the fact that attention was not focused on the 
source of the pain (as shown by Ss' behavior 
and comments) during the learning trials as it 
was during the ratings. 

Certain personality characteristics tend to 
result in higher recognition thresholds in a 
tachistoscopic situation regardless of the 
effect of other variables such as shock. In the 
present situation, it was very evident that © 


* 


172 


individuals who expressed doubt concerning 
their accuracy of perception and appeared to 
lack confidence in their ability to accomplish 
the task, or who did not like to guess, were the 
ones with the highest recognition thresholds 
(prelearning thresholds as well as postlearn- 
ing). A strong impression was gained that a 
high positive relationship exists between 
these characteristics and high recognition 
thresholds. 
SUMMARY 


The purpose of this study was to test 
deductions from reinforcement learning theory 
concerning visual recognition thresholds. The 
general hypothesis was that the recognition 
of nonsense syllables previously associated 
with electric shock will occur more readily 
if the experimental conditions enable S to 
escape from the shock than if escape is not 
possible, 

The sample consisted of 70 undergraduate 
college students equally divided into five 
groups. Strong shock was administered to 
S while learning a list of paired nonsense 
syllables by the  anticipation-prompting 
method which necessitated the pronunciation 
of all response syllables. Four groups were 
used in a factorial design to investigate the 
factors of shock “predictability” and ''es- 
capability." Pronunciation of the response 
syllable was followed by cessation of shock 
in the escape-shock group. The nonescape- 
shock group endured the shock for the entire 
duration of the syllable's exposure on the 
memory drum. In the predictable shock group, 
the shock consistently accompanied the same 
syllable-pairs and the remaining syllables were 
free from shock. In the unpredictable shock 
group, shock was administered in random 
order. A nonshock group was also included. 
Recognition thresholds of the same syllables 
were tachistoscopically determined prior to, 
and after, the learning. 

The statistical analysis indicates that the 
factor of escapability from shock was a sig- 
nificant factor in the determination of the 
postlearning-recognition thresholds of the 
response syllables. The escape-shock group 
evidenced relatively lower thresholds than 
the nonescape-shock group. No significant 
differences were found between the thresholds 
of the escape-shock and the nonshock groups. 

Although the inhibitory effects of the strong 

1 


Ns 


E ^4 


MicmakL M. REECE 


shock complicate the situation, these results. 
are interpreted as tending to support the 
proposed hypothesis. 

Relationship of the findings to the concept | 
of perceptual defense and other variables is 
discussed. 

CONCLUSIONS 


The following conclusions are drawn from 
the results of this study. 

1. It is suggested that deductions based 
upon the principles of reward learning theory, 
can effectively predict differences in visual 
recognition thresholds. 

2. The results suggest that the factor of 
escapability from shock in a noxious situation 
is a significant determinant of subsequent 
recognition thresholds of stimuli associated 
with that situation. 

3. The experiment did not provide the 
conditions requisite for the adequate investiga- 
tion of the effect of shock “predictability” 
during a learning task upon subsequent 
recognition thresholds of stimuli associated 
with that situation. 


REFERENCES 


1. BRUNER, J. S. Personality dynamics and the process 
of perceiving. In R. R. Blake & G. V. Ramsey 
(Eds.), Perception: an approach to personality. 
New York: Ronald, 1951. Pp. 121-147. 

2. Bruner, J. S., & Krecu, D. Perception and perso 
ality: @ symposium. Durham, N. C.: Duke 
Univer. Press, 1950. 


4. Graze, J. A. The association value of nonse 
* syllables. J. genet. Psychol., 1928, 35, 255-261. 

5. Hutz, C. L. Principles of behavior. New York! 
Appleton-Century, 1943. j 

6. McGmwres, E. Emotionality and perceptual de) 
fense. Psychol. Rev., 1949, 56, 244-251. 

7. McNemar, Q. Psychological statistics. New 
Wiley, 1949. 1 

8. Mowrer, O. H. Learning theory and personally | 
dynamics. New York: Ronald, 1950. iin 

, 9. Postman, L. Toward a general theory of cognition 
In J. H. Rohrer & M. Sherif (Eds.), Social by 
chology at the crossroads. New York: Harpe 
1951. Pp. 242-272. 


York! 


recogni: 
1951, 58, 271-284. 
11. SNEDECoR, G. W. Statistical methods. Ames: 10%) 
State Coll. Press, 1946. 
12. Soroxox, R. L., & Howes, D. H. Word frequen 
personal values, and visual duration old 
Psychol. Rev., 1951, 58, 256-270. 


Received September 20, 1952. 


THE LACK OF GENERALITY IN DEFENSE MECHANISMS AS 
INDICATED IN AUDITORY PERCEPTION! 


SHABSE H. KURLAND 
Columbia. University: 


N THE basis of the many studies of 
O perceptual behavior which have ap- 
peared in the literature (2), one can 
conclude that the “emotionality” attached to 
a stimulus may have a significant effect on the 
recognition threshold at which the name of a 
stimulus object can be correctly reported. 
Thus for some subjects (Ss) the threshold is 
lowered (vigilant behavior) and for others it 
is raised (defensive behavior) as a function of 
the “emotionality” of the stimulus (3). One 
theory suggested to account for this observa- 
tion holds that the direction of change in 
threshold is a function of the perceiver’s ad- 
Justing to the anxiety aroused by the stimulus 
objects. It has been further proposed that this 
adjustive behavior can be related significantly 
to “defense” mechanisms characteristic of an 
individual's behavior in nonperceptual situ- 
ations. From this line of reasoning it has been 
deduced that those Ss who typically use intel- 
lectualizing defenses (obsessive-compulsive) 
and are thereby able to cope freely with emo- 
tional stimuli should show vigilant perceptual 
behavior; and those Ss who use repression 
(hysterical) as the typical defense should con- 
tinue to avoid anxiety-arousing stimuli-and 
react with defensive perceptual behavior (7). 
i 5, Eriksen, and Fonda, in testing this 
YPothesis, found that such a relationship 
does occur and that patients with intellectualiz- 
ing mechanisms perceive threatening material 
with Significantly greater accuracy than those 
with repressing mechanisms (7). i 
The logic of their reasoning implies that 5 


ee study is based on a doctoral dissertation com- 
at Columbia University. The writer wishes to 


Meeting of the American Psychological Association, 


| Washington, D. C., September 1952. 


2 
Now at Veterans Administration Regional Office, 


| Baltimore, Maryland. 
173 


uses one typical kind of defense mechanism 
whenever he is confronted with anxiety re- 
gardless of the nature of the situation or the 
source of the threat. In other words, knowing 
an S's adaptive response to anxiety in one 
situation, one can predict his response in other 
anxiety-arousing situations. Evidence from 
both the clinical study of patients (10) and 
from experiments calls this generalization into 


question. For example, Goldstein, in studying 


the consistency of defense mechanisms in 
normal Ss, found that while there was a small 
group of Ss who showed such a consistency, 
there was a much larger number who used 
different mechanisms in relation to different 
impulses (4). Belmont and Birch, who studied 
learning, recall, and relearning using nonsense 
syllables some of which had been associated 
with shock, found no consistency in Ss' use 
of repression in the various tasks employed (1). 
It would therefore seem worth while to re- 
examine the hypothesis of a relationship be- 
tween Ss' perceptual mechanisms when they 
are confronted with anxiety-arousing stimuli 
and the type of mechanism they typically use 
to cope with anxiety in nonperceptual situ- 
ations. 

If the assertion that defense mechanisms 
are general is valid, then one would expect the 
perceptual behavior of patients characterized 
by their therapists as using obsessive-compul- 
sive mechanisms (intellectualization and simi- 
lar defenses) to differ from the perceptual 
behavior of patients who are characterized as 
using hysterical mechanisms (repression and 


- avoidance). Specifically, the obsessive-com- 


pulsive group should tend to perceive emo- 
tional words at lower intensities than the 
hysterical group. 
MzrBOoD 

Stimulus words. About 150 words selected from the 
literature on perception and word association were 
given to six judges to be sorted into three categories 
(neutral emotional and undecided) on the basis of 
their conception of how a group of neurotics would - 


174 SHABSE H. KURLAND 
TABLE 1 TABLE 3 
Nevrrat AND EMOTIONAL SrwoLus Worps Usen THE MATCHING OF PATIENT AND Normat Grovps* 
"INEUTRAL STIMULI EMOTIONAL STIMULI Onszs-| Com- 
SIVE- | BINED 
d n Hys- | Com- | Pa- | Nor- 

pon Ripa VARIABLE TERICS |PULSIVE| TIENTS | MAIS 
Paper Agony A = 
Curtain Failure Sex 
Music Death* Males... 5... 4 14 18 9 
Broom Vagina Females. 11 8 19 12 
White Intercourse Median age.. «|. 082 26 27 28 
Table Suicide Median IQ........ 110 115 113 119 
Chair Cheat Median years of 
Trade Breast* education. ....... 12 13 12 12 
Shoe Crazy ae - - - 
Rug Homoserual* uer eden Mixer tit bette Lotti an] cie 
eu um Sas hure a ei aguare test mas employed site aie bd 
Grass Worry Of Statistical significance, sex must be regarded as essentially un 
Book Orgasm controlled. 
House Blame 
Radiator Mourn* steps and then in 11 two-decibel steps. Thus, a record 
Hat Flunk was made on which each of the three lists was repeated 
Ocean Fight identically 22 times at increasing degrees of loudness, 
Bell Penis starting from a level where the words were not audible 
Farm Love to a point where they were clearly perceptible. Recog- 
Shade Blood nition thresholds for each word were thereby obtained 
Lamp Anger* by an ascending method of limits. 
Pencil Sex Subjects. A patient group consisting of 37 persons 


* Words on which there was 83 per cent agreement among 


judges. On all other words there was 100 per cent agreement. 


TABLE 2 

Tur Taree Lists or Srm«urus Worps 
Robin Cheat Mourn* 
Kill Breast Radiator 
Stone Chair Flunk 
Raped Crazy Hat* 
Agony Trade Fight 
Paper Homosexual Ocean 
Curtain* Shoe Bell 
Music Rug Penis 
Failure Dog Farm 
Broom* Kiss Love 
Death* Grass Shade 
Vagina Worry Blood 
White Orgasm Lamp 
Table Book Anger 
Intercourse House Pencil 
Suicide Blame* Sex 


* Omitted from calculation of results for all Ss since th: 
frequently reported as not heard or incorrectly heard. 


react to them. By this operation, 24 neutral and 16 
emotional words on which there was unanimous agree- 
ment and eight emotional words on which five judges 
were in agreement and one was doubtful were obtained. 
The 48 words were then divided into three lists. Each 
consisted of eight neutral and eight emotional words, 
tape recorded at intervals of three seconds. Using this 
master recording, it was possible to replay each list 22 
times, decreasing the attenuation first in 11 one-decibel 


was selected Fifteen were psychiatrically judged 98 
using predominantly repressive mechanisms and 22 as 
using obsessive-compulsive mechanisms for handling 
anxiety. The therapist who was seeing the patient for 
intensive therapy three times weekly made the decision 
whether or not the patient satisfied the criterion, A 
nonpatient group of 21 individuals who had never had 
any psychiatric or psychological treatment and were 
apparently functioning satisfactorily was also tested. 
As indicated in Table 3, there were no significant dif- 
ferences between patient and normal groups with re- 
spect to age, sex, intelligence, or education. 
Procedure. The patients reported to the psychology 
office of the hospital and were asked to participate in à 
psychological test which had been recorded on à tape. 
They were asked to repeat just what they heard. The 
normal Ss were told that the examiner (E) was work 
ing on a research project in clinical psychology. After 
the introduction, the procedure was identical for 
Ss, and E's instructions to report what was heard all 
to guess if the sounds were not clearly perceptible wet 


repeated on the tape of stimulus words. The patient | 


listened to the recording with a pair of earphones. Be 
tween the first and second and the second and 
lists, a short rest period was given. Using a stopwatch 
and a prepared time schedule, E always knew to what 
word S was responding or failing to respond. All r 
sponses were recorded. 


3 The patients were all hospitalized in Hillside Hos 
pital, Glen Oaks, Long Island, New York. The autho! 
wishes to acknowledge his debt of gratitude to Di 
Joseph S. A. Miller, Medical Director of the hospital 
and Milton S. Gurvitz, Chief Psychologist, and to thé 
other staff members who cooperated in this project: 


| 


Lack or GENERALITY IN DEFENSE MECHANISMS 


At the conclusion of the auditory experiment, the 
Hillide Short Form of the Wechsler-Bellevue Intel- 
"ligence Scale was administered (5). Other relevant data 
such as age and education were also obtained at this 
time. 
- RESULTS 
The criterion threshold values used in com- 
putation are based on the decibel level at which 
S made a correct response to the stimulus 
word twice successively, the lower of the two 
values being selected.* For each S a perceptual 
score was then computed by subtracting the 
medial threshold value for S's neutral words 
from the median threshold value for his emo- 
tional words. To control for the variations in 
range of recognition thresholds among Ss, 
this difference in medians was divided by the 
average of the interquartile ranges for the 
neutral and emotional word lists. The per- 
ceptual score, because of this use of the dif- 
ference in medians for critical and neutral 
lists, does not reflect individual differences in 
auditory acuity, attention, or similar factors 
which may raise or lower the threshold in- 
dependently of the nature of the stimulus word. 
Since the perceptual scores of one group of Ss 
are compared with those of a different group, 
it Was not necessary to equate the two word 
lists in terms of word length, word frequency, 
or phonetic structure, These factors are con- 
Stants for both groups and cannot, therefore, 
contribute to any difference in perceptual 
Scores which may exist for the two groups. 
It was hypothesized that if there is gener- 
ality of application of defense mechanisms, 
then the perceptual scores of the obsessive- 
compulsive group would be lower than those 
of the hysterical group. Using the Mann- 
Whitney U test for data not normally dis- 
tributed (8),ë the null hypothesis of no dif- 
ference In perceptual scores for the two patient 
groups cannot be rejected. Also, there are no 
Significant relationships between perceptual 
Scores and sex (run test®), age, intelligence, of 
education (rank-order correlation, Table 4)., 


“A table of threshold values for each S has been 
Eines With the American Documentation Institute. 
inn ,ocument No. 3982 from the ADI Auxiliary 
p RUD Project, Photoduplication Service, Library 
$125 gress, Washington, D. C., remitting in advance 
M for Photoprints or $1.25 
Servi Sees Payable to Chief, Photoduplication 


175 


These negative results suggest a lack of re- 
lationship both between obsessive-compulsive 
mechanisms and vigilant perceptual behavior 
in the auditory mode and between hysterical 
defenses and repressive perceptual responses, 
at least to auditory stimuli The question 
remains, however, whether or not patients 
differ in their perceptual behavior from Ss who 
are not psychiatrically deviated. If the patients 
are characterized by a greater degree of anxiety 
than the “normals,” and if vigilance and de- 
fense are common forms of coping with anx- 
iety, then the patients should show a greater 
incidence of these forms of perceptual response 
than the nonpsychiatric Ss. 

To attack this problem, the two patient 
groups, since they did not differ in their per- 
ceptual scores, were combined and compared 
with the normal group. Neither the Moses 
test (9)! nor the run test (8)* demonstrates 
that the patients react vigilantly or defensively 
with greater magnitude than the normals. 

The perceptual scores, however, are sig- 
nificantly lower (6 « .01) for the patients 
than for the normals, thus indicating that the 
hysterical as well as the obsessive-compulsive 
patients were more vigilant in their auditory 
perceptual reaction to the emotional words 
than were the normal Ss. 

A comparison of the recognition thresholds 
for each of the emotional words for the patient 
and normal groups was also made. First, indi- 

*' The actual medians of the perceptual scores for 
the various groups were as follows: obsessive-com- 
pulsive patients, —.08; hysterical patients, —.27; 
normal Ss, .00. Thus, the obtained median of the 
threshold intensities for the hysterical group is lower 
than that for the obsessive-compulsive group, a dif- 
ference in the direction opposite to that which was 
hypothesized. By sign test logic, therefore, the results 
obtained in this experiment cannot be accumulated with 
those of Lazarus, Eriksen, and Fonda (7) to support 
the idea of generality in defense mechanisms; they are 
actually opposed to the hypothesis tested. 

7 The Two-Sample Test or Moses test is designed to 
test the null hypothesis that two groups come from a 
common population against the alternative that the 
scores in one are congested at both the upper and 
lower extremes of the distribution of scores. The test 
is nonparametric and can reject the null hypothesis 
even if the means of the two groups do not differ. 

8 The run test is used to determine if two groups are 
drawn from a common population and can reject the 
null hypothesis no matter how the populations differ. 
Since it is sensitive to all kinds of differences (disper- 
sion, median, skewness, etc.), it is not powerful against 
any single alternative. 


176 


TABLE 4 


RANK-ORDER CORRELATIONS FOR PERCEPTUAL SCORES 
WITH AGE, INTELLIGENCE, AND EDUCATION 


OBSESSIVE- 
Cowpur- 
VARIABLE | HYSTERICS SIVES Normatrs 
04 35 
04 AE 
.34 .21 


vidual variation in acuity among Ss was con- 
trolled by expressing the thresholds of each 
S for each emotional word as a deviation from 
his own median threshold on the neutral list. 
The corrected values were then compared for 
the two groups for each word (Mann-Whitney 
U test). Seven of the 21 emotional words 
showed a significant difference (5 < .02) in 
thresholds between the combined patient 
group and the normal group (agony, suicide, 
worry, homosexual, love, orgasm, and vagina). 

The effect of age, education, and intelligence 
upon the recognition thresholds for the normal 
Ss was studied, and no significant relationships 
were found. Normal female Ss, however, 
showed both vigilant and defensive perceptual 
behavior of greater magnitude than normal 
male Ss, 


DISCUSSION 


From these results, there is no support for 
the hypothesis that membership in obsessive- 
compulsive hysterical categories is related to 
vigilant and defensive modes of perception in 
the auditory mode. That such a phenomenon 
as vigilant perception does occur, however, 
is indicated by the fact that the median thresh- 
olds of our Ss are significantly higher in the 
normal group than in the patient group. Since 
these two groups are equated for sex, intel- 
ligence, education, and age, and since the 
statistical procedures controlled for variations 
in acuity, the obtained difference may be at- 
tributed to the complex of factors involved 
in an emotional disturbance severe enough to 
require hospitalization. This result was not 
predicted, however, so this suggested rationale 
amounts to only an untested hypothesis. 

Although the data give no support to the 
hypothesis of generality in the application of 
defense mechanisms, they may be analyzed 
in another fashion to eliminate the possibility 


SuaBsE H., KURLAND 


that a more limited type of relationship be. 
tween adjustive mechanisms in perceptual 
and nonperceptual situations may obtain, 
The stimuli in the emotional word list repre- 
sent a variety of areas of anxiety. It may be 
that S uses a generally characteristic response 
to anxiety dealing with words representing 
one particular area of anxiety, whereas with 
words from other areas, he may use other types 
of perceptual mechanisms. | 

To explore this possibility, those emotional 
words which could be so classified were divided 
into three specific areas: sex, aggression, and 
social failure.’ The median recognition thres- 
holds, after correction for acuity, were calcu- 
lated for each patient in the two groups for 
each of the three areas. Again, the two groups 
do not differ significantly in their recognition 
thresholds for either of the three groups of 
words, indicating that in this experiment there 
was no relationship between membership in 
the obsessive-compulsive or hysterical group 
and the auditory perceptual behavior studied. , 
Thus, the present study supports the notion 
of specificity rather than generality of dé 
fense mechanisms. 

One possible rationale to explain the finding 
that the combined patient group was mort 
rapid ("vigilant") in its perceptual reaction 
to the emotional words is based on word fre 
quency. It is known that there is a significant 
relationship between recognition thresholds 
and the frequency or popularity of a stimulus— 
the higher the frequency, the lower the thresh 
old (6). Since the patient groups were being 
seed for intensive psychotherapy and wet 
living in a hospital environment, the frequent? 
of emotional stimuli may have been artificially 
raised. No such factor was operating in the 
nonpatient group. This greater exposure to 
emotional words could result in a lowering % 
the thresholds for the patients and thereby 
account for the finding of vigilant behavi0! 
in this group. 1 

An alternative explanation is that patienti 
without regard to diagnostic category, typical, 
show vigilant behavior as an adjustive respon 
to the anxiety that may be associated wit 
auditory stimuli. Whether this explanation 8 
more reasonable than one based on frequen 


?'The words in each area were as follows: 5%“ 
vagina, intercourse, breast, homosexual, kiss, 0 } 
penis, love, and sex; aggression—kill, fight, blood, at | 
anger; social failure—failure and flunk. | 


Lack or GENERALITY IN DEFENSE MECHANISMS 


cannot be determined without additional 
. study. 

In view of the conflicting results in the 
‘literature on perceptual defense, it might be 
‘worthwhile to attempt to resolve the problem 
of the relationship between defense mecha- 
nisms in perceptual and nonperceptual situ- 
ations by using another approach. It would 


seem that a basic requirement in continuing: 


this line of investigation is to determine if 
vigilant or defensive perceptual behavior is 
characteristic of certain individuals. If it is 
possible to form groups of vigilant and de- 
fensive perceivers, then one can investigate the 
personality correlates of such behavior and its 
situational and historical antecedents. From 
the obtained results it would be possible to for- 
mulate hypotheses to account for the interre- 
lationships of perception and other variables on 
a sounder basis than has been advanced so far. 


SUMMARY 


Recent studies of perceptual behavior have 
demonstrated that there is a change in recog- 
nition threshold as a function of the “emo- 
tionality” attached to a stimulus. It has been 
proposed that the change in threshold is a 
nction of the type of mechanisms used to 
Cope with anxiety generally. Since there is 
Teason to doubt the generality of this asser- 
tion, this study was undertaken to test the 
othesis that those patients who use intel- 
lectualization as the preferred type of mecha- 
msm to handle anxiety will perceive emotional 
Words at lower thresholds than those patients 
who use predominantly repressive mecha- 


- Twenty-two hospitalized patients using 
obsessive-compulsive mechanisms, 15 patients 
using hysterical mechanisms, and 21 normal 
>S were presented with neutral and emotional 
Words which had been recorded on a magnetic 


177 


tape with increasing degrees of loudness so 
that the recognition thresholds could be ob- 
tained using an ascending method of limits. 

There was no difference in the perceptual 
Tecognition thresholds for the two groups of 
patients, The combined patient groups, how- 
ever, perceived the emotional words at sig- 
nificantly lower thresholds than the normal 
Ss. It is concluded that further study is neces- 
sary before one can accept with confidence 
the assertion that an S will respond to anxiety 
in a perceptual situation with the same type 
of defense that he employs in a nonperceptual 
situation. 


REFERENCES 


1. BELMONT, L., & Bca, H. G. Re-individualizing 
the repression hypothesis. J. abnorm. soc. 
Psychol., 1951, 46, 226-235. ! 

2. Brake, R. R., & Ramsey, G. V. (Eds) Perception, 
an approach to personality. New York: Ronald, 
1951. 

3. BRUNER, J. S., & Postman, L. Emoti?nal selectivity 
in perception and reaction, J. Hers., 1947, 16, 
69-77. 

4. GorpsrEIN, S. A projective study of psychoanalytic 
mechanisms of defense. Paper read at Amer, 
Psychol. Ass., Washington, D. C., September, 
1952. 

5. Gurvitz, M. S. The Hillside Short Form of the 
Wechsler-Bellevue. J. clin. Psychol 1951, T, 
131-134. 

6. Howes, D. H., & SorowoN, R. L. Visual duration 
threshold as a function of word-probability, J. 
exp. Psychol., 1951, 41, 401-410. 

7. Lazarus, R. S., Ersen, C. W., & Fonna, C. P. 
Personality dynamics and auditory perceptual 
recognition. J. Pers., 1951, 19, 471-482. 

8. Moszs, L. E. Non-parametric statistics for psycho- 
logical research. Psychol. Bull., 1952, 49, 122-143, 

9. Moszs, L. E. A two sample test. Psychometrika, 
1952, 17, 239-247. 

10. SravEREN, H. Suggested specificity of certain 
dynamisms in a case of schizophrenia, Psy- 
chiatry, 1947, 10, 127-135. 


Received October 21, 1952. 


THREAT-EXPECTANCY, WORD FREQUENCIES, AND PERCEPTUAL 
PRERECOGNITION HYPOTHESES! 


EMORY L. COWEN* 
University of Rochester 


( ONSIDERABLE research interest in recent 
years has centered around the study 
of the influence of threat on percep- 
tion. Though patent conclusions are by no 
means available, the fruitfulness of early 
work is amply demonstrated in the burgeoning 
of research and theoretical contributions in 
this area by clinically as well as experimentally 
oriented psychologists. Behind this state of 
affairs lies a growing recognition of the need 
for a unified psychology (7), a need which has 
found expression more frequently and more 
specifically in terms of the desirability of a 
greater rapprochement between experimentally 
oriented and clinically oriented psychologists. 
The study of perception, as Werner (19) and 
Howie (6) have pointed out, provides an ideal 
common meeting ground for these heretofore 
largely "isolated disciplines." When, in addi- 
tion to this, we consider an important trend 
within the field of clinical psychology toward 
fuller utilization of traditional scientific 
methodology in testing clinical hypotheses 
(14), the recent widespread interest in the 
study of the motivational aspects of perception 
seems readily understandable. 

Investigations to date, by Shafer and 
Murphy (16), Postman and Bruner (13), 
Eriksen (3), Rosenstock (15), and others, have 
led to a tentative generalization that threat 
tends to disrupt both the accuracy and speed 
of perceptual report. These studies have, 
individually and collectively, evoked stimulat- 
ing criticism (6, 9, 12) with respect to ex- 
planatory theory as well as methodology. 

1 This investigation was supported in part by a re- 
search grant to the senior author from the National 
Institute of Mental Health of the National Institutes 
of Health, Public Health Service. The specific study 
reported here is the second in a series of papers de- 
riving from a broader research program in the area of 
sociopsychological and personality correlates of psycho- 
logical rigidity. 

3 Portions of this paper were presented to the Divi- 
sion of Personality and Social Psychology of the 
American Psychological Association at the meeting in 
Washington, D. C., September 1952. 


® 


178 


ERNST G. BEIER 
University of Utah 


AND 


Even more controversial, perhaps, has been 
McGinnies’ attempt to demonstrate the 
existence of autonomic discrimination of threat 
before perceptual report in terms of certain 
objective and qualitative indices of “defense” 
(11). McGinnies’ study has been singled out | 
for criticism at both the experimental and the 
theoretical level by Howes and Solomon (5) 
and Solomon and Howes (17). The objections 
which have been advanced may be summarized 
primarily on two major grounds: first, the 
possible intervention of a conscious inhibition 
between initial perception and report of per- 
ception, and second, the fact that the results 
may perhaps be accounted for more parsi- 
moniously by a word-frequency hypothesis 
(i.e., the selected neutral words generally are 
more common ones; hence, seeing them mort 
rapidly is interpreted as a function of the 
subject's [S’s] greater past experience with 
them). 

Recently, several research inquiries have 
been directed at the criticisms raised by 
Howes and Solomon (5, 17). McCleary and 
Lazarus (10) and Lazarus and McCleary (8); 
using a conditioning technique with nonsens 
syllables in order to control for the factors of 
differential word frequencies and inhibition 
of report of threat words, found a significantly 
higher prerecognition GSR deflection fot 
shocked than for nonshocked syllables. From 
these data, which are generally consistent 
with the earlier McGinnies findings, they 
conclude that the autonomic responses indicat? 
that discrimination is occurring before CoN 
scious recognition, a process which they refet 
to as “subception.” 

Cowen and Beier (2) and Beier and Cowe? 
(1) have used a threat-expectancy situation 
(ie., alerting Ss beforehand to the immine? 
exposure to threat words) in order to reduc 
the likelihood of inhibition of report. Unde! 
these conditions they have demonstrated that 
Ss tend to require a greater number of trials 
and more time for correct report of threat 
words than neutral ones. 1 


THREAT, WORD FREQUENCIES, AND RECOGNITION 


The specific purpose of the present investiga- 
tion is to determine whether delayed perceptual 
report of threatening words will occur under 
conditions of threat-expectancy in an experi- 
mental setup which allows us to examine more 
precisely the operation of possible conscious 
inhibition as well as differential word fre- 
quencies, 


METHOD 


Fifty-nine college undergraduates, 29 male and 30 
female, served as Ss in the present investigation. The 
Ss were tested individually by one of three examiners 
(Es) (two male and one female). The E began the test- 
ing session by reading a list of 71 five-letter words to 
S. Twenty-three of these words were prejudged to be 
more or less threatening in our culture, while the re- 
maining 48 were thought to be essentially neutral. 
‘The S was informed that he would be asked to decipher 
Some of these words later. 

i The Ss were then shown a series of booklets, con- 
sisting of 30 carbon copies of a single typed word (in 
capital letters). The 30 copies, which had been typed 
on an electric typewriter, were arranged in order from 
the most blurred to the clearest copy. This technique 
is described more fully elsewhere (2, 20). After two 
practice trials using the words PAPER and NIGHT, 
Ss were asked to decipher eight neutral and eight threat 
words chosen from the initial reading list and presented 
one by one in a randomly selected but fixed order, as 
follows: MAGIC, DIRTY,* TULIP, WHORE,* 
TABLE, PENIS,* CANDY, RAPED,* SUGAR, 
CHEAT,* BREAD, DEATH,* URINE,* PLANT, 
STORE, BITCH.* (Asterisks indicate threat words.) 

In the administration of the test series of words, 
E did Dot permit Ss to spend more than three seconds 
on any given copy of any of the words. The E recorded 
the number of trials and total time required for correct 
eus Ek Que was encouraged and 

: S W. made prior to accurate 
Stimulus word were posu os EA 


REsULTS AND DISCUSSION 


Perception of threat and neutral words. The 
first datum to be examined is the total number 
of trials required for accurate report of threat 
KR compared to neutral ones. For each 
» i difference score was computed based on 
t mean difference in number of trials'required 
Or accurate report of the two types of words: 

positive difference indicates that more trials 
vee Tequired for the eight threat words than 
d eight neutral ones; a negative difference 
Er the opposite, Fifty-three of the 59 Ss 

f EN to have positive difference scores. 
dites i group as a whole, the obtained mean 
Br s Score was +4.049 trials. In examin- 
bie e Sparture of this value from a hypo- 

ical mean difference Score of zero (which 


179 . 


we would predict if our findings were to be 
explained solely on the basis of chance factors), 
the computed / ratio is 9.38, which is significant 
at beyond the .001 level. An analysis of our 
data in terms of time scores indicates essen- 
tially similar results. Since in an earlier study 
(2), a high correlation was found between 
trials and time, we cannot regard the latter 
measure as an independent one. 

Conscious inhibition. Several checks were 
made to study the possible influence of con- 
scious inhibition on the above findings. Post- 
test inquiry with this relatively sophisticated 
group revealed few admitted instances of 
deliberate inhibition. In addition to this, 
however, two statistical checks, based on an 
antecedent logical analysis, were utilized. It 
was reasoned that since the threatening words, 
toa considerable extent, were sexually oriented, 
if conscious inhibition were operating one might 
expect female Ss to show it more than male Ss. 
A 1 ratio based on mean difference between 
threat and neutral words for male and female 
Ss indicated no differences between the groups. 
As a matter of fact, male Ss actually showed 
a somewhat larger mean difference between 
threat and neutral words, but this difference 
did not approach statistical significance. 

Another way of approaching this same prob- 
lem derives from the assumption that Ss' sex 
per se is not the crucial factor in conscious 
inhibition. Instead, it can be argued that the 
S-E constellation is a more basic determinant 
of whether or not conscious inhibition of 
response will occur. From this line of reasoning, 
one would hypothesize that Ss who were 
examined by Es of the opposite sex would show 
a greater discrepancy in perceptual response 
between threat and neutral words than Ss 
examined by Es of the same sex. To study this 
variable, the pooled records of male Ss with 
female Es and female Ss with male Es were 
compared to the pooled records of male Ss 
with male Es and female Ss with female Es. 
Once again the obtained / ratios indicated no 
significant differences between the groups, nor 
were any major trends toward significance 
observable. 

Word frequencies. A second major criticism 
which has been directed toward this type of 
study is that the “threat” words have com- 
monly been of smaller word frequency than 
control words, and for this reason, slower 
response could be explained on the basis of 


e 


M kou 


180 


the S's being less familiar with them. Part of 
the logic of a threat-expectancy situation is 
that by reading a list of words to Ss before the 
basic perceptual experiment, actual discrepan- 
cies in word frequencies will tend to be, at 
least situationally, reduced. This of course is 
just an assumption—one which the present 
Es thought desirable to check. Y 

Tn order to do this, frequencies were obtained 
from the Thorndike-Lorge word-frequency list 
(18) for each of the 16 test words used. Two 
of the threat words which did not appear in 
the list were assigned arbitrary word fre- 
quencies of one. Next, the average number of 
trials required for each word by the group as 
a whole was determined. The range here was 
from a mean of 7.7 trials for BREAD to a 
mean of 22.5 trials for WHORE. If word 
frequency is a crucial determinant of per- 
ceptual response under the present experi- 
mental conditions, one would hypothesize a 
significant negative correlation between it and 
average number of trials required for correct 
report of the test words. The obtained Pearson 
r for this relationship was +.003. Since the 
suggestion has been advanced (5) that a truer 
picture of the relationship may be obtained 
by using the logarithm of the word frequency, 
this was also done. The resulting nonsignificant 
correlation coefficient was —.08. 

Prerecognition hypotheses. Several investi- 
gators have presented data, either objective 
or qualitative, bearing on the prerecognition 
hypotheses of their Ss (3, 11, 13). Such data 
are seemingly useful in attempting to infer 
differential reactions of Ss to threatening and 
‘neutral words before they are correctly re- 
ported. 
T In the present study, a total of 1,133 pre- 
recognition guesses were recorded, 507 for 
neutral words and 626 for threat words. This 
difference, while statistically significant, dis- 
appears when the variable of number of trials 
Íor threat and neutral words is partialled 
out. 

Each prerecognition hypothesis was classi- 
fied independently by two judges? in terms of 
five content categories, as follows: 


1. Structural—Responses Containing three or four 
letters of the stimulus word in their proper location 
_ (e.g. place or plane for plant). 


MEL Le PUN 
_ ' The authors wish to express their appreciation to 
Judith Hess of the University of Rochester for her con- 

Bish cing to the analysis of the prerecognition hy- 


Emory L. Cowen AND Ernst G. BEER 


2. Meaningless—Responses without a dictionary 
meaning (e.g., bryny for dirty or caleba for table), = 
3. Neutral—Meani words, essentially non- 
threatening in nature (e.g., plate for death or wagon for 


magic). 
4. Emotional—Meaningful aggressive, escape, or 
sexual responses (e.g., breast for cheat or phobia for. 


penis). 

5. Marginal—A wastebasket category for border- 
line hypotheses, not clearly classifiable either as neutral: 
or emotional (e.g., screw for cheat or fairy for penis)4 


Agreement on classification between the 
two judges exceeded 94 per cent. Most of the 
disagreements occurred in assignments to the 
marginal category. Following the original 
independent ratings in order to facilitate sub- 
sequent statistical analysis, the judges dis- 
cussed items for which agreement had been 
lacking, arriving thereby at a final assignment: 
of each item to one of the five content cate 
gories. Of the total of 1,133 responses 283 
per cent were classified structural, 2.4 per cent 
meaningless, 49.6 per cent neutral, 11.7 per. 
cent emotional, and 8.0 per cent marginal. An 
over-all chi-square test, comparing the fre- 
quencies of prerecognition guesses in the five 
content categories for threat and neutral 
stimulus words, indicates that the observed 
frequencies are close enough to the expected 
frequencies to be accounted for solely on the 
basis of chance variation. This analysis was 
extended so that each of the five individual 
content categories could be examined against 
all others. Here, significantly more (p <.05) of 
the prerecognition hypotheses classified asi 
Structural were given to neutral words thal! 
to threatening ones. In no other instance WAS 
a significant difference observed. i 

Of the five classification categories used) 
four were based on content, while the fifth 
the structural, was determined entirely bY 
formal properties of the prerecognition by: 
potheses. For this reason it was possible 0. 
reclassify structural responses into one of the! 
four other categories according to the content 
‘of the responses. The results of this secondar) 
analysis of prerecognition hypotheses, togethel 
with the chi-square value obtained from tabu 
lation of observed and theoretical frequen“ 
are presented in Table 1. Further, each of ti 
three classification categories was tested 
against the other two combined; it was foun 
that significantly more neutral hypothesi 

* A later impression of the authors was that most 
the words classified as marginal probably approxim# 
emotional guesses more closely than neutral one 


> 


THREAT, WORD FREQUENCIES, AND 


TABLE 1 


CmrSquARE Test BASED ON RECLASSIFICATION OF 
STRUCTURAL RESPONSES* 


CONTENT 
CATEGORY NEUTRAL THREAT ToraL 
Neutral 127 85 212 
(106)** — (106) 
Emotional and 18 58 76 
marginalt (38) (38) 
Meaningless 15 17 32 
(16) (16) 
Total 160 160 320 


* x17 29.51, df = 2, p < .01. 

** Numbers in parentheses indicate expected frequencies, 

1 These two categories have been pooled, in accordance with 
the notation in footnote 4 in the text, because the expected fre- 
quencies for the category marginal were extremely small. 


were given for the neutral stimulus words, 
while significantly more emotional and mar- 
ginal guesses occurred in the case of threat 
words.5 (In each instance p « 01.) 


CONCLUSIONS AND IMPLICATIONS 


The significantly greater number of trials 
required for accurate report of threatening 
Words as compared to neutral ones constitutes 
a verification of our earlier finding that per- 
ceptual response appears to be less effectual 
under conditions of threat-expectancy as well 
as direct threat. That these data may not be 
explained on the basis of a word-frequency 
hypothesis is indicated by an absence of cor- 
relation between word frequency and number 
of trials required for accurate report under 
the threat-expectancy condition. Moreover, an 
explanation of the present data in terms of 
Conscious inhibition does not appear to be 
entirely satisfactory, For one thing, in the 

eat-expectancy situation, by alerting Ss 
to the impending occurrence of threatening 
Stimuli, the credibility of their subsequent 
appearance is increased and their verbalization 


is simply a function of the fact that S had actually per- 


from the threatening stimul i i 
for Penis, naked for raped): Even SE p plis d 
Cases discarded from our is, the significance of 


the results Presented in Table 1 holds up. 


"f v i § 
RECOGNITION 181 
during the test series presumably facilitated. 
The absence of admitted instances of deliberate 
inhibition, as determined by posttest inquiry, 
supports this point of view. Further, we have 
been able to demonstrate an absence of dif- 
ferential inhibition as a function either of the 
sex of the S or sex similarity or dissimilarity 
S-E constellation. 

We interpret the findings of the present 
research as indicative of the probable operation 
of a perceptual defense process. To the extent 
that one is willing to accept as an index of 
defense the significantly higher incidence of 
emotional prerecognition hypotheses for threat 
words than for neutral ones, which was ob- 
served in the reclassification of structural 
hypotheses, this position is strengthened. 

Alerting Ss to impending threat in the threat- 
expectancy situation creates an opportunity 
for innumerable perceptual and behavioral 
adaptations. As a next step in this area of 
research it seems feasible to study system- 
atically correlates of the various adaptations 
made by Ss in terms of other personality vari- 
ables. Understanding the behavioral correlates 
of adaptation to threat is important to both 
the theoretician and the clinician. The fruitful- 
ness of studying individual differences in 
reactivity to perceptual threat has already been 
demonstrated (4). The potential value of 
placing such information at the service of the 
clinician has been suggested (8). 


SUMMARY 


In order to investigate whether delayed 
perceptual report of threatening words would 
occur under conditions of threat-expectancy 
in an experimental setup which makes possible 
examination of the operation of possible 
conscious inhibition as well as differential 
word frequencies, 59 Ss were given a series 
of 16 booklets, each containing 30 carbon 
copies of one five-letter stimulus word, to 
decipher. Eight of these words were considered 
neutral and eight threatening. 

Significantly more trials were required for 
correct report of threat words as compared 
to neutral ones. No correlation between num- 
ber of trials required for correct identification 
of the test words and word írequency was 
found. Several logical and statistical checks 
were used to reduce the likelihood of explaining 
the results on the basis of conscious inhibition. 
The findings are interpreted as consistent with 
a concept of perceptual defense. This interpreta- 


182 


tion was supported by an analysis of the pre- 
recognition hypotheses. 

Some possible implications for personality 
theory and clinical practice, as well as sugges- 
tions for further research, have been proposed. 


REFERENCES 


1. Bere, E. G., & Cowen, E. L. A further investiga- 
tion of the influence of “threat. * on 
perception. J. Pers., 1953, 22, 254-257. 

2. Cowen, E. L., & Beer, E. G. The influence of 
“threat-expectancy” on perception. J. Pers., 
1950, 19, 85-94. 

3. Ersen, C. W. Perceptual defense as a function 
of unacceptable needs. J. abnorm. soc. Psychol., 
1951, 46, 557-564. 

4. Ersen, C. W. Defense against ego-threat in 
memory and perception. J. abnorm. soc. Psychol., 
1952, 47, 230-235. 

5. Howes, D. H., & SorowoN, R. L. A note on Mc- 
Ginnies *Emotionality and perceptual defense." 
Psychol. Rev., 1950, 57, 229—234. 

6. Howre, D. Perceptual defense. Psychol. Rev., 1952, 
59, 308-315. 

7. Kreca, D. Notes toward a psychological theory. 
J. Pers., 1949, 18, 66-87. 

8. LAZARUS, R. S., & McCreary, R. A. Autonomic 
discrimination without awareness: a study of 
subception. Psychol. Rev., 1951, 58, 113-122. 

9. Lucas, A. S. An evaluation of some current 


Emory L. CowEN AND Ernst G. BEIER 


criticisms of Gestalt psychological work on per- 
ception. Psychol. Rev., 1951, 68, 69-95. | 

10. McCreary, R. A., & LAZARUS, R. S. Autonomic 
discrimination without awareness: an interim 
report. J. Pers., 1949, 18, 171-179. 

11. McGnaus, E. Emotionality and perceptual de- 
fense. Psychol. Rev., 1949, 56, 244-251. 

12. Pastore, N. Need as a determinant of perception. 
J. Psychol., 1949, 28, 457-475. | 

13. Postman, L., & Bruner, J. S. Perception under 
stress, Psychol. Rev., 1948, 55, 314-323. 

14. Ramey, V. C. Training in clinical psychology. New 
York: Prentice-Hall, 1950. 

15. Rosenstock, I. M. Perceptual aspects of repres- 
sion. J. abnorm. soc. Psychol., 1951, 46, 304-315. 

16. SCHAFER, R., & Murray, G. The role of autism 
in a visual figure-ground relationship. J. exp. 
Psychol., 1943, 32, 335-343. 

17. Sorouon, R. L., '& Howes, D. H. Word frequencies, 
personal values and visual duration thresholds. 
Psychol. Rev., 1951, 58, 256-270. 

18. THORNDIKE, E. T2 & Loree, I. The teacher's word 
book of "30,000. words, New York: Columbia 
Univer. Press, 1944. 

19. Werner, H. Introductory remarks. In symposium 
"Interrelationships between perception and 

ity”. J. Pers., 1949, 18, 2-S. 

20. Warre, M. A. A study "of schizophrenic language. 

J. abnorm. soc. Psychol., 1949, 44, 61-74. 


Received October 27, 1952. 


| 
| 
| 
| 
| 


PREDICTING LANGUAGE BEHAVIOR FROM OBJECT SORTING: 


LAURENCE S. McGAUGHRAN 
University of Houston 


ESPITE its historical and theoretical 
D significance, the area of conceptual 
behavior is almost virgin territory in 
psychological research. The work which has 
been done in this area has been mainly con- 
cerned with abnormalities in concept formation. 
These studies have generally dichotomized all 
conceptual behavior into “concrete” and “‘ab- 
Stract" levels or “attitudes.” The terms 
concrete and abstract have generally been used 
by Goldstein and others to differentiate be- 
tween the conceptual behavior of normal 
adults and that of such deviant groups as the 
brain-damaged, schizophrenics, and children. 
According to this analysis, an abstract attitude 
may be achieved by normal adults; individuals 
outside this class are characteristically concrete 
in their conceptualizations. 
Goldstein defines concrete and abstract at- 
titudes as follows: 


We can distinguish normally two different kinds of 
attitudes we call the concrete and the abstract. In the 
Concrete attitude we are given over passively and bound 
to the immediate experience of unique objects or situ- 
ations. Our thinking and acting are determined by the 
immediate claims made by the particular aspect of the 
object or situation... [in the abstract attitude] we 

end the immediately given specific aspect of 
Sense impressions, we detach ourselves from the latter 
and consider the situation from a conceptual point of 
view and react accordingly. Our actions are determined 
Dot so much by the objects before us as by the way 
Md E Met them; de individual thing becomes a 

ental example or representati: “ 

gory” (3, p. 6) ip! 'epresentative of a *'cate- 


Goldstein (2) clearly states his assumption 
t these attitudes are dichotomous. Others 
(eg, 1, 4, 9) have attempted to spread out 
the concrete-abstract dichotomy into a con- 


tinuum, However, these attempts have also 


of Philosophy. I 
who directed the 


Sive 
encouragement and Support during this study. 


involved research with deviant groups, and 
appear to have been directed toward breaking 
down one member of the polarity or the other, 
but not both in the same terms. In other words, 
there has not been developed a unitary set of 
concepts which may be used to dimensionalize 
an entire range of conceptual behavior in 
general psychological terms. 

The apparent lack of exact meaning for the 
concept, “concrete,” makes it especially dif- 
ficult to dimensionalize conceptual levels or 
attitudes. As originally used by Goldstein 
(see 3) to characterize conceptual behavior in 
brain-damaged patients, concrete attitude 
seemed to mean an inability (or, at least, a 
special difficulty) in breaking down perceptual 
wholes into parts. That is, an object was not 
seen by the patient as a "carrier" of a number 
of attributes, but as a nonanalyzable whole. 
Thus, when the patient grouped objects into 
categories, he did so in such a way that the 
whole-character of the objects was maintained 
(e.g., grouping on the basis of identity instead 
of similarity). Elaboration of this use of the 
term, concrete, in subsequent investigations 
has introduced a number of other criteria for 
concrete behavior which are more or less con- 
sonant with this original observation; these 
have included an inability to "shift" conceptu- 
ally from one attribute to another with the 
same group of objects, a lack of “spontaneity” 
in grouping, and “narrow” conceptual limits. 

However, as subsequent studies introduced 
additional experimental groups, such as chil- 
dren, aments, neurotics, and schizophrenics, 
additional criteria for concretism were sug- 
gested which do not seem to correspond with 
the original meaning of the term. Examples 
of these are the conceptual grouping of objects 
on the basis of past personal experience, nar- 
rative sequences, and private symbolic mean- 
ings. 

ve analysis of all of the criteria for con- 
cretism cited in the literature suggests that 
there are at least two broad categories of 
conceptual behavior to which concrete concepts 
may be assigned. The first more or less coin- 


e 


184 


cides with the meaning which is inferred to 
have been originally intended by Goldstein by 
the term, concrete, as discussed above. The 
second category of conceptual behavior to 
which a number of concrete concepts can be 
assigned might be designated as that involving 
“private,” or “nonpublic,” principles of collec- 
tion. Thus, when the individual groups objects 
on the basis of differences in his feelings toward 
them or the symbolic significance they hold for 
him, the observer without foreknowledge of 
these private events is unable to predict the 
limits of the conceptual grouping. Such group- 
ings do not, however, necessarily reflect an 
inability to break down wholes into parts. 

The experimental variable employed in the 
present research is an elaboration of this dis- 
tinction made between two categories of 
concrete conceptualization. 


DEFINITIONS 


To set up a system of analysis of conceptu- 
alization which would “cut through" the 
previously employed abstract-concrete dichot- 
omy, it seemed necessary to make a number of 
distinctions which have not heretofore been 
expressed. The following definitions are set 
forth to convey these distinctions; the terms 
employed are, of course, constructs limited in 
application at the present time to the opera- 
tions of the present experiment. 

1. A concept is a term identifying the prin- 

ciple employed by an individual to collect two 
or more objects (or events) to which he in 
Some manner intends to respond similarly, 
The nature of a concept necessarily implies 
that some other abstractible qualities of the 
single objects are disregarded by the individual 
when the objects are so collected. 
_ The total of conceptual-group-memberships 
is the sum total of all possible abstractions 
which could be made from objects collected 
within a specific conceptual grouping. 

_ 2. Conceptual freedom is a postulated dimen- 
sion of conceptualization which describes the 
extent to which an individual concept permits 
the potential inclusion of additional conceptual- 
group-membershipswithin its limits. The degree 
of conceptual freedom is inversely related to 
the complexity (i.e., multiplicity of attributes) 
of the collecting principle for the specific 
concept. (For example, a concept of “white, 

“Square objects" would by definition exclude 
"white, not-square" and "square, not-white" 


5 


L. S. McGAUGHRAN 


objects, and would, therefore, have a lesser | 
degree of freedom than either “white objects” 
or “square objects" as separate concepts) | 
The terminals of this dimension are designated 
“closed” and “open-ended,” respectively, 
Closed concepts tend toward restrictiveness, 
and open-ended concepts tend toward infinite 
freedom, in the variety of objects (and thus ad- 
ditional conceptual-group-memberships) that 
would be included within their limits, 

3. Conceptual extensionality is a postulated 
dimension of conceptualization which describes | 
the extent to which an individual concept 
permits potential public prediction of com | 
ceptual group limits. The degree of conceptual | 
extensionality expresses the extent to which 
the principle underlying the individual concept 
is shared and freely communicated by the 
majority of persons within the same culture. | 
(For example, the principle for *'triangularity" 
is culturally denotable; the principle for 
“attractiveness” is considerably less so.) The 
terminals of this dimension are designated 
“public” and “private,” respectively. 

4. Conceptual area is a broad class of concepts 
grouped together by empirical definition. 
Definition of the limits of a conceptual area 
is an arbitrary procedure based upon the 
setting of a guessed cutting point for two of 
more dimensions of conceptualization to form 
an assumed axis, for the purpose of testing # 
hypothesis concerning the “best fit” for the 
axis. The term conceptual area is used to 
contrast the present formulation with previous 
ones using conceptual level as a basis for 
classification. Classification by conceptual | 
level represents an attempt to locate concepts 
at points along a single continuum; classificé: 
tion by conceptual area implies the use 
two (or more) continua as coordinates to locate 
concepts within a space common to both (0 
to all) of them. Three conceptual areas are het? 
hypothesized: hypostatic, autistatic, aM 
metastatic (for definitions, see below). 1^ 
first two of these areas are those employed 1! 
the present experimental groups. E 

5. The hypostatic? conceptual area (containing 
“H concepts”) lies within the quadrant be 
tween the closed and public terminals of tht 
dimensions of conceptual freedom and cor 

* The dictionary definition for hypostasis is a “si” 
stance, subsistent principle, or essential nature of vi 
thing; a subject in which attributes are conceived “= 


PREDICTING LANGUAGE BEHAVIOR FROM OBJECT SORTING 


ceptual extensionality, respectively (cf. Fig. 1). 
This area consists of those concepts which 
have a relatively low potential for additional 
conceptual-group-memberships and a relatively 
high potential for public prediction of con- 
ceptual group limits. Conceptual groupings 
within this area would be closed and public; 
that is, the complex principle used. to group 
the objects would limit the kinds of objects 
that could be included in a group but would 
be readily understood by others. 

H concepts are complex concepts. A com- 
plex concept is a term whose principle of collec- 
tion “freezes” two or more abstractible items 
within the total of its conceptual-group-mem- 
berships into a fixed relationship. The principle 
of collection for an H concept is based upon 
the identity, species-identity, or quasi-identity, 
of the objects within its group. Identity is 
assumed exact replication of objects within a 
conceptual group. A species is a group of ob- 
jects, each of which is “called,” or usually 
identified, by the same name (for example, 
knives, pencils). Quasi-identity is the partial 
identity of objects within a group induced 
through the “freezing” of two or more items 
into a fixed relationship upon which the 
concept is based (for example, “red, round, 
rubber objects”). 

T 6. The autistatic conceptual area. (containing 
‘A concepts") lies within the two quadrants 
formed by the entire conceptual freedom 
dimension and the private terminal of the 
Conceptual extensionality dimension (cf. Fig. 
1). This area consists of those concepts which 
have a relatively low potential for public pre- 
diction and an undetermined potential for 
additional conceptual-group-memberships. (In- 
formal observation in this particular experi- 
menta] situation Suggests that the quadrant 
of low potential for additional conceptual- 
Broup-memberships and low potential for 
Public prediction would be a relatively small 
p at least for the population from which 
E ae was drawn. It might be 
Henn M that Such responses would fre- 
d s Psychotic—or, at least, markedly 
dd lve—in nature. This is, of course, 

1g more than a speculation at this point, 


* Th "n 5 
€ term, “autistatic,” was Coined by the writer 


toc i 
Onvey the Meaning of a form of conceptualization 


based upon : 
of coll fona Private or noncommunicated principle 


185 


Purine Concerss 


Metastatic Cowcerruat Arca 


Hyeosranc Conccetuar Arca 
( CTI. Concerts! 


H^ Conceers ) 


Open Eno 


Concerrs 


1 
Conceptuac AREA 


CA Coverors ) 


Auristatic 


1 

1 

1 
A 


Pawate Üonetprs 


Fic. 1. REPRESENTATION OF CONCEPTUAL AREAS 
FORMED By AN ASSUMED Axis OF THE DIMENSIONS 
or “CONCEPTUAL FREEDOM” AND “(CONCEPTUAL 
EXTENSIONALITY" 


since this present experiment was not designed 
to test this hypothesis.) 

7. The metastatic! conceptual area (containing 
“M concepts") lies within the quadrant be- 
tween the public and open-ended terminals 
of the conceptual freedom and conceptual 
extensionality dimensions, respectively (cf. 
Fig. 1). This area consists of those concepts 
which have a relatively high potential both 
for additional conceptual-group-memberships 
and for public prediction.* 


STATEMENT OF OPERATIONS AND HYPOTHESIS 


A method frequently used in studies of con- 
ceptualization (e.g., 1, 3, 9, 10, 11) involves 
the task of sorting commonly known, *every- 
day" objects, such as toys, tools, smoking, 
writing, and eating equipment, etc., into groups 
into which they “belong.” This method, with 
some alterations in administration and scoring 
procedures as discussed below, was adopted 
in the present study to provide a measure of 


4 The dictionary definition for metastasis is a “change 
of state, substance, or form.” 

5 The concept, metastatic conceptual area, is em- 
ployed in the present experiment only to exclude certain 
individuals within the sample from the experimental 
groups. With some exceptions, the concepts falling 
within this area correspond to those designated as on 
an “abstract level” by previous investigators. 


e 


186 


the criterion of hypostatic vs. autistatic con- 
ceptualization. 

The field of individual differences in language 
behavior appears to be a rich source of data 
for psychological research, but the results of 
investigations in this area have frequently 
seemed cloudy. One possible reason for this 
may be that the experimental criteria used 
to predict language behavior have most fre- 
quently lacked clear definition (e.g., psy- 
chiatric categories). Since the present study 
attempts a clearer delineation of behaviors 
collected under the term concretism, and since 
language behavior, itself, involves conceptu- 
alization, it seemed particularly appropriate to 
use language analysis as a method to test the 
validity of the separation of concrete con- 
ceptualization in an object-sorting task into 
hypostatic and autistatic areas. 

The hypothesis is formally stated as follows: 
A significant relationship exists between con- 
ceptual performance in an object-sorting situa- 
tion, as expressed by dominant “conceptual 
area,” and language performance, as measured 
by an analysis of language usage in a picture- 
interpretation situation. 


MetHops 
Preliminary Studies 


Four preliminary studies using a total of 35 sub- 
jects (Ss) were conducted to develop administration 
and scoring manuals (7) for the object-sorting situation 
(OSS) as a criterion task and the language-behavior 
situation as a predictor task. The first three of these 
studies were largely empirical explorations of methods 
and measures; the fourth preliminary study was more 
formally designed to choose an adequate measure in the 
sorting task and to eliminate all language measures ex- 
cept those which permitted the most objective scoring 
and which seemed to fit within a coherent rationale. 
To simplify presentation, final administration and 
scoring procedures for both tasks are described at this 
Point. In actual fact, some alterations to provide this 
Eu ies: "ved for both procedures during 

e item lysis of the fixed-desi, i 
id be dece bT. ayant: hia 

Development of procedures for scoring the OSS. The 
32 objects used in the sorting task corresponded roughly 
to those used by previous investigators, although no 
special attempt was made to match them exactly be- 
cause no substantial body of normative data is avail- 
able to warrant rigidly fixed procedures. The modifica- 
tions in procedure introduced in the present study were 
in methods of administration and scoring. As regards 
administration of the OSS, instead of presenting all 
of the objects to S simultaneously, E handed over each 
object separately and in a fixed sequence designed to 
force a choice between alternative groupings for each 


3 


L. S. McGAUGHRAN 


object presentation. When these sortings were com. | 
pleted, the objects were shuffled into one pile and § 
was requested to try to resort them in a different way, 
Responses for both phases (sorting and resorting) were | 
used in computing the criterion score for each S. 
In scoring the OSS, E categorized each of S's sort- | 
ings in this task as occurring within the hypostatic, 
autistatic, or metastatic areas of conceptualization, 
A list of the subareas of classification employed in mak. | 
ing these designations is shown in Table 1, together 
with an example for each class of conceptual grouping, 
The criterion measure was termed the hypostatic- 
autistatic index (H-A index). It was obtained for each | 
S by subtracting the percentage of his autistatic con- 
cepts from the percentage of hypostatic concepts when 
either figure was higher than his percentage of meta- 
static concepts. Those Ss who did not meet this cri- 
terion (that is, Ss whose percentage of metastatic con- 
cepts exceeded the percentage of concepts in each of the 
other two areas) were excluded from the present study. 
Development of procedures for the analysis of language 
behavior. Data from the language-behavior situation 
consisted of narrative interpretations to a series of 12 
pictures presented in sequence to each S.° Instructions 
to S for this task corresponded, in general, to those 
usually employed in administering the TAT. Thes 
interpretations were electrically recorded and tran 
scribed to typewritten protocols; each protocol was 
proofread at least twice before the language analysis 
To explain the rationale used to develop and to group 
the language measures, it seems necessary to discuss 
briefly the general behavior tendencies which were as 
sumed, a priori, to be associated with regnant hypo- 
static and autistatic conceptualization. When the indi- 
vidual tends to require complete identity as the basis. 
for his sorting of objects in the OSS, he maintains à 
high degree of order and precision in his concep! 
organization. His conceptual distinctions are based 
upon a fine discrimination of perceptual dissimilarity 
between one complete object and another (that $; 
between one complex of conceptual-group-membtr- 
ships and another); this efficiency in conceptual ot 
ganization is maintained so long as the parameters 
of conceptual-group-memberships remain relativel 
“fixed,” but it tends to break down when changes 0 
ambiguitiés are introduced into the stimulus field. 0s 
the other hand, when S consistently uses the object 
in the OSS simply to “trigger” a series of private 3* 
sociations, his conceptual organization may have i 
or nothing to do with the perceptual similarity of 9 
jects within his groups. The introduction of changes 
ambiguities, as perceived by E, in the stimulus fi i 
may be completely disregarded by S unless thes 
changes coincide with his own autistic shifts in as% 
ciation. E. 
An individual's conceptual system is his organizatio! 
of his field of awareness. If he consistently employs? 
particular form of conceptual organization under 0% 


* Nine of these pictures were taken from the E 
series (cards 1, 2, 4, 10, 11, 13, 14, 16, and 20). T d 
other three were taken from a set prepared by p 
B. Rotter, Ohio State University. Card 1 was used s 
a “warm-up” task only; narratives for this card W 
not used in the language analysis. 


TABLE 1 


AREAS AND SUBAREAS OF CONCEPTUALIZATION SCORED IN THE OBJECT-SORTING SITUATION 
wirH EXAMPLES OF GROUPINGS AND DEFINITIONS FOR EACH SUBAREA 


AREA AND SUBAREA OF 


EXAMPLE OF 


EXAMPLE OF STATED 


SUBAREA CONCEPTUAL GROUPING OBJECTS GROUPED BASIS FOR GROUPINGS 
HYPOSTATIC AREA 
H-1 Concepts of identity 
a. Absolute identity 2 corks “corks—the same thing” 
b. Species identity toy fork, fork “forks go together” 
c. Identity of function toy hatchet, toy hammer “hatchet and hammer—serve 
same purpose" 
H-2 Concepts of induced partial 
identity 
a. Direct multiple restric- 2 sugar cubes, file card “white and rectangular” 
tion 
b. Indirect multiple re- toy tools, tools “children’s tools here—real 
striction tools there” 
c. Juxtaposition of simi- green square, red circle, file “same material and different 
larity and dissimilarity card colors” 
d. Common internal varia- pipe, spoon “stems with depression at 
tion base” 
H-3 Concepts of situational co- 
occurrence of species 
a. Reciprocal cofunction- pipe, matches “light pipe with matches” 
ality 
b. Specific mediation file card, blotter "write on with ink, need 
blotters” 
c. Unstructured situation- pencil, eraser “seem to go together” 
al co-occurrence 
AUTISTATIC AREA 
A-1 Physiognomic concepts 
a. Judgmental (several) “objects that please me” 
b. Subjective elaboration lock, toys “keep people from stealing” 
c. Experiential contiguity blotter, cigarettes, matches ^ "have on my desk at home" 
A-2 Concepts of constructionand green square, red circle, ball “fellow with ear muffs" 
design (spatially arranged) 
A-3 Chain and radial concepts block, sink stopper, eraser, “these have projections (bl, 
red circle (3 groups) ss); the stopper also goes 
with eraser because rubber; 
and circle with stopper— 
i round" 
A4 Representational concepts matches, cork “burnt cork for minstrel” 
A-5 Narrative and conditional 2 corks "close whiskey bottle with 
concepts one, have another if get 
drunk and lose it" 
A-6 Concepts of implicit and 
loose mediation 
a. Implicit mediation tableware, sugar, smoking “to eat and smoke” 
equipment 
b. Loose (synecdochal) matches, toy hatchet “matches from trees and ax 
mediation . chops trees down” 
c. Heterogeneous concepts (several) “all are made of either wood 
f or metal or both" 
d. Nonce concepts (several) "rest over there—don't know 
why" 
METASTATIC AREA 
MA peus of unitary abstrac- red square, ball, and eraser — "they're red" 
ion 
M2 Concepts of superordination several toys “all these are toys” 
M —nominal 
3 Concepts of superordination several toys “all of these you use for play” 


—implicit 


187 


188 


set of stimulus conditions, it might be hypothesized 
that he would show similar tendencies under other 
conditions. This is really nothing more than a specific 
assumption of the existence of relatively general be- 
havior potentials (or traits) within a particular area 
of personality organization. On an a priori basis, an 
assumption was made that the individual's consistent 
use of hypostasis as a principle of conceptual organiza- 
tion might be associated with a more general behavioral 
potential toward “reality-fixing,” or “stimulus-bound- 
ness" Similarly, it was assumed that the consistent 
grouping of objects in the OSS on the basis of private, 
nonextensional meanings to the individual might be 
associated with a more general behavior potential 
toward "autism." 

If these consistencies in behavior in the sorting task 
actually are associated with more general behavior po- 
tentials of reality-fixing vs. autism, it might be ex- 
pected, again on an a priori basis, that individuals under 
different stimulus conditions would show other con- 
sistent differences in behavior in more specific terms 
such as rigidity-fluidity, inhibition-spontaneity, pas- 
sivity-autonomy in reality structuration, vigilance- 
laxness in reality-testing, etc. Such concepts as these, 
lacking precise definition, were used rather loosely as 
guides in the selection and grouping of the measures of 
language behavior elicited in the picture-interpretation 
task. It might be added that this projective task seemed 
admirably suited for the present experimental problem 
because its purpose is to require an attempt to interpret 
ambiguous stimulus material. 

The 39 frequency measures finally selected for lan- 
guage analysis are shown in Table 2. A brief descrip- 
tion and an illustration are provided for each measure. 
The 39 separate measures were grouped into eight 
classes of common rationale in order to test both specific 
and more general hypotheses of relationship between 
behavior potentials. The specific and general h 
for the measures in each class are set forth below in 
terms of the predicted performance for the hypostatic 
group (H group). 

1, Class A (language flow). The general hypothesis 
was that the H group would show less spontaneity, 
more inhibition in picture interpretation. The specific 
hypotheses for each measure were: (a) Measure 1— 
low total verbal production for H group. (b) Measure 
2—low amount of speech interference (i.e., “concept 
rivalry") for H group. (c) Measure 3—high fluency in 
produced speech for H group. 

2. Class B (maintenance of autonomy). 'The general 
hypothesis was that the H group would show less 
autonomy in imposing new structure upon the experi- 
mental situation. The specific hypotheses for the sepa- 
rate measures were: (a) Measures 4 and 6—fewer 
“free” questions and fewer remarks out of context of 


7 A more recent test of this assumption was made b 
Jeffreys (5). He found a low but significant relationship 
between hypostatic vs. autistatic conceptualization 
in an object-sorting situation, and range of experienced 
movement in the phi phenomenon and “form-bound 
vs. form-labile" performance on the Rorschach test. 
‘The latter two situations were introduced by Klein 
and Schlesinger (6) to demonstrate the generality of 
“perceptual attitudes” which they term “resistance to 
instability” and “tolerance for instability.” 

y 


L. S. McGAUGHRAN 


the experimental task by the H group. (b) Measures 
5 and 7—more requests for reassurance, and more re- 
marks protesting difficulty in interpreting ambiguous 
details in the stimulus cards by the H group. 1 

3. Class C (flexibility of conceptual organization), | 
The general hypothesis was that Ss in the H group, 
tending toward description rather than interpretation 
of the narrative cards, would show a greater initial | 
precision in their choice of words; that is, their verbal | 
responses would have a more deliberative, either-or | 
character. The specific hypotheses for the separate | 
measures were: (a) Measures 8, 9, 10, and 11—fewer | 
slips of speech, and fewer corrections and elaborations | 
of verbal constructions once produced by the H group. | 
(b) Measure 12—more attempts by the H Group to | 
emphasize “psychological distance" between the nat 
rator and the stimulus card by substituting a modal | 
auxiliary (e.g., could be) for a copulative (is) form of | 
predication. dl 

4. Class D (maintenance of narrator role). The general | 
hypothesis was that the H group, showing less spon: | 
taneity, free imagination, and autonomy in projective 
interpretations, would attempt to maintain psycho- 
logical distance by literalizing the experimental task 
of “making up a story.” The specific hypotheses for 
the separate measures were: Measures 13, 14, and 15— 
more references by the H group to the artificiality of 
the stimulus material and to the fictional nature of 
their narrative responses. 4 

5. Class E (reality-modifying expressions). The 
general hypothesis was that when the H Group did 
qualify or extend their interpretations, it would bein. 
the direction of either greater precision of description 
or less positiveness of an assertion. The specific hy- 
potheses for the separate measures were: (a) Measures 
16, 17, and 18—fewer qualifying terms by the H group 
to convey allness, completeness, certainty, or limitless” 
Progression. (b) Measures 19 and 20—more qualifying | 
terms by the H group to convey exactitude or attenua 
tion of the "reality" of their story interpretations. - 


The specific hypotheses for the separate measures were! 
(a) Measures 21, 22, 23, and 24—more constructio 
by the H group assessing the. possible or probable meat" 
ings “resident” in the stimulus card. (b) Measure 257 
fewer constructions by the H group referring to s 
own experience as the source of meaning for the narra 
tive content. p 

7. Class G (resistance to the experimental task). Sis 
the number of rejections of stimulus cards (Measilé 
26) was the only measure in this class, the only We 
pothesis was that the H group would be less able ' 


Carry out the experimental task of narration M 


8. Class H (summation within-group measures) Th 
di 
measures within each class listed above. They are adi 


to provide a test for the validity of the common "í 
tionale underlying each general hypothesis. 


PREDICTING LANGUAGE BEHAVIOR FROM OBJECT SORTING 189 


TABLE 2 


Crass, SPECIFIC TITLE, DESCRIPTION, AND ILLUSTRATION OF EACH FREQUENCY MEASURE 
USED IN THE ANALYSIS OF LANGUAGE BEHAVIOR DURING A PICTURE-INTERPRETATION 


Task 
Crass, NUMBER, AND SPECIFIC TITLE ILLUSTRATION OF 
For EAcH LANGUAGE MEASURE BRIEF DESCRIPTION OF MEASURE : LANGUAGE CONSTRUCTION 
Language flow (Class A) Measures of verbal production and 
fluency 
1. Total verbal production 1. Total number of words used in 1. — 
interpretation task 
2, Interference noises 2. Nonmeaningful sounds 2. “uh,” “ah,” “er.” 
3. Word contractions 3. Contractions increasing fluidity 3. “This man’s going to. , .’? 
of speech 


Maintenance of autonomy (Class B) Measures of verbal behavior outside 
of the context of the narration 


task 

4, Autonomous questions 4. All questions by S except re- 4. “Which card was it that looked 
quests for reassurance like this one?” 

5. Nonautonomous questions 5. Requests for reassurance or for — 5. “Is that story alright?” 

E's evaluation at completion of 
story. 

6, Autonomous asides 6. "Aside" remarks by S not im- 6, “Such a person always gets by." 
mediately related to the experi- 
mental task of narration. 

7, Nonautonomous asides 7. “Aside” remarks by S protest- 7. “The background is too shad- 
ing some feature of the picture owy, I don't know what he's 
or of the narration task. doing." 

Flexibility of conceptual organiza- Measures of S’s correction or elabo- 

tion (Class C) ration of words or phrases al- 
ready uttered. 
8. Permeability 8. Correction of slips of speech, 8. “The books she hand—has in 


such as a jumbling of words or her hand. . .” 
an “anticipation” of a word be- 
fore its usual order in the sen- 
tence. 

9. Enhancement 9. A break in continuity in whicha 9. “, . listening to the music—to 
word or a phrase is substan- the tones of the music.” 
tially repeated, but with elabo- 

y rations or amendments. 

10, Mirroring 10. An abrupt shift from a positive 10. “I don't think—I think...” 
to a negative form (or v.v.) 
with retention of essentially the 

1L P: 7 3 Same meaning. 

+ Parenthetica] constructions 11. Sequential phrases joined by 

such connectives as “rather,” 
“or,” etc. in which a refinement 
of the original statement seems 


b 
...aspect—rather, attitude 
” 


t intended. 
12, Reality reductions 12. Sent pir in which 12. “He is—seems to be angry.” 
there is an apparent attempt to 
reduce the positiveness of an in- 
M terpretative assertion. 
aintenance of narrator role Measures of S's presumed attempt 
(Class D) to maintain “psychological dis- 
tance” through continued refer- 
ences to the “props” of the 
13. Direct references to stimulus 13, ase oF agii d picture, 13. “This picture reminds me...” 
material painting, or card during the in- 
retation. 
14. Direct references to narrating 14. EUR presumably emphasiz- 14. “It could be a scene in which 
erminology ing the fictional nature of the n 


interpretative content (eg., 


190 


L. S. McGAUGHRAN 


TABLE 2—CoNTINUED 


Crass, NUMBER, AND SPECIFIC TITLE 


FOR EACH LANGUAGE MEASURE 


BRIEF DESCRIPTION OF MEASURE 


ILLUSTRATION OF 
LANGUAGE CONSTRUCTION 


15. Pronominal references to stimu- 


lus material 


Reality-modifying expressions (Class 


E) 


16. Allness expressions 


17. Expressions of certainty 


18. Open-ended expressions 


19. Expressions of exactitude 


20. Attenuating expressions 


Reality-structuring expressions 
(Class F) 


21. Externalization of narrative 


"reality" (verbal) r 


22. Externalization 
“reality” (adverbial) 


23. Probability hypostasis (verbal) 


24, Probability hypostasis (adver- 
bial) 


25. Internalization 
“reality” 


of narrative 


of narrative 


15. 


Substitution of pronouns with- 
in the same or a succeeding sen- 
tence for terms scored in 
Measure 13. 


Measures of the usage of terms serv- 


16. 


17: 


18. 


ing to strengthen or weaken 
the positiveness or extensive- 
ness of an assertion. 

Terms conveying completeness 
or entirety of a narrated event 
(eg, completely, totally, 
utterly). 

Terms conveying the certainty 
of a narrated event (eg., 
positively, absolutely, defi- 
nitely). 

Phrases which serve to “hold 
open" a phrase or clause for 
unspecified additions (e.g., and 
80 on, and everything, and all 
that). 


. Terms conveying exactitude or 


singularity in the sense of an 
emphasis upon specification 
(eg. only place, strictly 
speaking, exact time). 


. Phrases which tend to reduce 


or weaken the meaning or 
effect of the words they modify 
(e.g., more or less, a little bit, 


something like). 


Measures intended to estimate the 


22. 


25. 


degree to which S accepts or 
evades the responsibility for 
the “reality-stru " of his 
narrative. 


- Modal auxiliaries conveying 


fixity of the plot or of the char- 
acterizations as "résident" in 
the stimulus card. 

Adverbial terms or phrases 
conveying fixity of the plot or 
of the characterizations as 
"resident" in the stimulus card. 


- Modal auxiliaries or other 


verbal forms conveying prob- 
able fixity of the plot or of the 
characterizations as “resident” 
in the stimulus card. 


- Adverbial terris or phrases con- 


veying probable fixity of the 
plot or of the characterizations 
as "resident" in the stimulus 
card. 

Terms or phrases referring to 
S's own reported experience as 
the focus or medium for his 
interpretation (eg. I see, I 
associate). 


15. 


16. 


17. 


18. 


19. 


21. 


22. 


23. 


24. 


25. 


. “He probably is more or le 


“This picture has nothing on jj; 
It does not look .. . ”” 


“He is completely exhausted.” 
“She is definitely dead.” 


“She’s trying to comfort him,” 
and all that, and...” 


“Its the only place it could 
happen.” 


wishing that . . .” 


"This might be a mothers 
situation.” 


T 1 
“It possibly is a question of «+ 


“This picture seems to indi e 
that...” 


“He probably is the kind 
fellow who...” 


PREDICTING LANGUAGE BEHAVIOR FROM OBJECT SORTING 


TABLE 2—CoNrINUED 


191 


CLASS, NUMBER, AND SPECIFIC TITLE 


For EACH LANGUAGE MEASURE 


BRIEF DESCRIPTION OF MEASURE 


ILLUSTRATION OF 
LANGUAGE CONSTRUCTION 


Resistance to experimental task A measure of the failures to carry 


26. 


(Class G) 


Theme avoidance 


Raw score summations within groups 


of language measures (Class H) 


26. 


out the task of *making up a 
story." 


A response to a stimulus card in 26. 


which there is only description, 
without narration or depiction 
of characters (as behaving- 
experiencing organisms). 


Measures based upon raw score 


(frequency count) summations 
of the foregoing measures 
grouped together on the basis 
of a common rationale. 


27. Autonomous balance—questions 27. Measure 4 minus Measure 5. 
28. Autonomous balance—asides 28. Measure 6 minus Measure 7. 
29. Autonomous balance—total 29. Measures 4 plus 6, minus 5 plus 
30. Conceptual fluidity 30. Summation of raw scores for 
Measures 8, 9, 10, and 11. 
31. Conceptual fluidity—fixity 31. Measure 30 minus Measure 12. 
balance 
32. Total narrator distance 32. Summation of raw scores for 
Measures 13, 14, and 15. 
33. Narrator distance (direct refer- 33. Summation of raw scores for 
ence) Measures 13 and 14. 
34, Narrator distance (reference to 34. Summation of raw scores for 
stimulus material) Measures 13 and 15. 
35, Reality-expanding expressions 35. Summation of raw scores for 
p Measures 16, 17, and 18. 
36. Reality-narrowing expressions 36. Summation of raw scores for 
^ Measures 19 and 20. 
3r. en narrowing—expanding 37. Measure 36 minus Measure 35. 
ce 
38. Hypostatic reality structuration 38. Summation of raw scores for 
39 E un Measures 21, 22, 23, and 24. 
* Hypostatic-autistatic balance in 39. Measure 38 minus Measure 25. 


“This means nothing—there's a 
bridge and some kinds of 
animals, but you can't tell 
what's happening." 


reality structuration 


Fixed Design for the Formal Experiment 


TA “fixed-design”” experiment was conducted to pro- 
vide a formal derivation and test for the general and 
Specific hypotheses listed above. It consisted of two 
por (a) an item analysis to derive an exact basis 
or the prediction of differences in language behavior 

OE the experimental groups; and (b) a cross- 
ed bid to test the occurrence of these dif- 
“Rees as a function of the variabli i 
tut ete variable of hypéstatic yox 

ue sample drawn for the fixed-design stud: 
ly con- 
ra of a group of 74 college students enrolled in ele- 
PE PSychology classes at Ohio State University; 
E was placed in the object-sorting and picture- 
"rpretation situations within a single experimental 
Tie S Which averaged about an hour in duration. 
ae eee who met the experimental criterion (i.e. 
hipaa lose Conceptual groupings were predominantly 

Ypostatic or autistatic) were distributed into the 


following subgroups: (a) item analysis study—hypo- 
static group 12, autistatic group 11; (6) cross-validation 
study—hypostatic group 14, autistatic group 14. 

These subgroups were matched for age, educational 
level, test intelligence, and sex; an analysis of variance 
indicated no significant group differences for any of 
these control factors. 

Since the same E scored both procedures, it was 
necessary to set up a number of controls to prevent 
contaminations in prediction. These preventive steps 
(eg., coding of names, avoidance of halo effect) are 
discussed in (7). Except where a simple frequency count 
was more appropriate (e.g., Measure 26, theme avoid- 
ance), the frequency for each of the language measures 
was converted into a ratio of the number of occurrences 
per 1,000 words of protocol. To distinguish the fre- 
quency count and the frequency ratio from the language 
scores, to be discussed below, both of the frequency 
expressions are called “raw scores." 


192 


TABLE 3 
DISTRIBUTION AND SIGNIFICANCE VALUES FOR TOTAL “LANGUAGE SCORES" IN THE ITEM ANALYSIS oF | 
LANGUAGE Measures ror Hypostatic (H) AND Avtistatic (A) Groups 


L. S. McGAUGHRAN 


(N = 23) 
ITA NUMBER EFFECTIVENESS OF CUTTING POINT 
PREDICTIVE CATEGORY OF or LANGUAGE Gent A 
LANGUAGE MEASURES MEASURES SE GSUUE aoe * 
VISUS Ricut | W2oNc | Ricut | Wronc 
I. Total—major-predictor 15 11 1 11 0 15.54 
measures 
IL. Total—raw score summation 12 12 0 10 1 15.49 


(Class H) measures* 


* Class H measures are those used to test the general hypotheses of language behavior (cf. Table 2). 


RESULTS 


Item analysis. The raw scores for each lan- 
guage measure were placed into parallel fre- 
quency distribution tables for hypostatic and 
autistatic groups, and a cutting score for that 
measure was set at the point of greatest separa- 
tion between these groups. Each measure was 
then treated as a separate item in a language- 
behavior "test." To accomplish this, it was 
necessary only to assign a consistent direction 
(i.e., “pass” or “fail””) for each measure to the 
expected performance for one of the two 
groups of Ss. The expected performance for 
the hypostatic group was arbitrarily assigned 
in the direction of passing. Thus, an S in 
either group whose frequency count for any 
measure placed him within the range of per- 
formance considered to be characteristic of the 
hypostatic group was scored as passing that 
item. "Scores" were computed for each S 
based upon the number of passes for all of 
the items within each category of language 
measures; high scores were considered as the 
expected performance of the hypostatic group. 

Distribution and significance values for total 
"language scores" are shown in Table 3. 
"Predictive category of measures" refers to 
the three categories in which the language 
Measures were grouped according to their 
Separate purposes in testing the specific and 
general hypotheses listed above. Category I, 
major-predictor measures, is comprised of 15 
separate language measures whose proportion 
of discrimination between hypostatic and 
autistatic groups equalled or exceeded 70:30 
in the item analysis; this group of measures 


' tute the predictor for the general hypotheses 


constitutes the major predictor for the specifi 
hypotheses of language behavior.* | 

Category II, group summation measures, if 
composed of 12 of the 13 measures listed in 
Class H in Table 2; as raw score summations 
per individual S of the separate measures 
within each of the common rationale groups, 
these Category II measures, together, consti. 


of language behavior for hypostatic and | 
autistatic groups of Ss. Each of the languagt 
measures in Category II also provided # 
proportion of discrimination between the ct 
terion groups equalling or exceeding 70:30 
in the item analysis study.” The remaining 1/ 
language measures out of the 39 used wet 


study. The language scores for the Categol)) 
TII measures are not shown for the item anal ri 
sis, but were computed in the cross-validatiot 
study and are shown in Table 4. : 

All chi-square values (for both item analyst 
and cross-validation studies) were corrected 
for continuity by Yates's correction (8). 

While these values seem high, they, of couts 
*do not constitute evidence for or against ^ 
hypothesis of relationship between the criteP! 

* The separate language measures in this categ% 
are Nos. 2, 3, 4, 6, 7, 9, 12, 13, 14, 16, 19, 21, 23, ^ 
and 26. For a description of each of these meas" 
see Table 2. E 

? Measure No. 36 was omitted from Categoty v 
as the only raw score summation measure which P^. 
vided a proportion of discrimination less than 10+” 
in the item analysis study. i 


| 
l 
| 


PREDICTING LANGUAGE BEHAVIOR FROM OBJECT SORTING 


193 


TABLE 4 


| DISTRIBUTION, SIGNIFICANCE, AND CORRELATION ESTIMATE VALUES FOR TOTAL "LANGUAGE SCORES" IN THE 


Cross-VALIDATION OF LANGUAGE MEASURES FOR HYPOSTATIC AND AUTISTATIC GROUPS 


(N = 28) 
Torat Numeer | EFFECTIVENESS OF PREDICTIONS 

PREDICTIVE CATEGORY | ` op L, ANGUAGE 

s pease MEASURES H Groupe A Group Pd ? Fois 
penen Ricat | Wronc | Ricar | WRoNG 
I. Total—major- 15 10 4 12 2 7.15 <.01 69 

predictor measures 

IL Total—raw score 12 9 5 11 3 3.65 «.10 .46 
summation (Class 
H) measures* 

III. Total—minor- 12 6 8 12 2 1.58 «.30 20 
predictor measures 

IV. Summation—lan- 27 10 4 11 3 5.17 <.05 .59 
guage scores in 
CategoriesI and II} 

V. Summation—lan- 27 10 4 12 2 7.15 <.01 62 
guage scores in 
Categories I and 
In 

VI. Summation—lan- 39 10 4 11 3 5.17 «.05 .58 
guage scores in 
Categories I, II, 
and II} 


* Class H measures are those used to test the general hypotheses of language behavior (cf. Table 2). 
t Refers to the summation per individuals of language scores in the respective categories of measures. 


and predictor measures beyond the limits of 
the item analysis sample. 

Cross-validation, In the cross-validation 
analysis, the cutting score used for each 
language measure was the mean frequency 
for the total group of Ss in the cross-validation 
study. Thus, once the direction of the predic- 
tion for each measure had been determined, a 
simple comparison of results for each measure 
against assumed 50:50 chance distributions 
could be made, 

EUM 4 shows the results of these predictions 
of language behavior for the hypostatic and 
the entries in this table cor- 
respond to those of Table 3, except for the 


language scores grouped into Categories I, II, 


and III; the intent of these summations was 
to provide an additional rough test of the 
interrelatedness of all behavior potentials 
subsumed under all of the specific and general 
hypotheses concerning language behavior. 

An examination of Table 4 indicates that 
the chi-square values obtained for the language 
measures in Category I (the major predictor) 
and for Category V (a summation of all of the 
separate measures) would have occurred by 
chance less than one time in a hundred. 
Similarly, the results for Category IV (sum- 
mation of the measures initially providing the 
greatest discrimination between criterion 
groups) and Category VI (summation of lan- 
guage scores for all measures) have p values of 


* Jess than .05. The chi square for the results of 


the measures in Category II (raw score sum- 
mation of groups of measures with a common 
rationale) falls just below the value (3.841) 
required for a .05 level of confidence, The chi 
square for Category III (minor predictors) is 
not sufficiently high to support a generaliza- 
tion. The biserial correlations between the 


194 


criterion and each category of language 
measures range from .20 to .69, varying con- 
sistently with the chi-square distribution. 
These results are based upon a prediction 
of both direction and absolute distribution 
values obtained for each measure. If one 
imposes the somewhat less stringent require- 
ment to examine only for direction (i.e., for 
the total number of measures which provided 
some amount of correctly predicted discrimina- 
tion), he finds that 32 of the 39 measures were 
successful; that is, only seven measures had 
distributions opposite to that predicted, or 
provided no discrimination at all.!° The ¢ value 
for the difference between this proportion of 
.82 and a hypothetical proportion of .50 is 
4.00, which is well above the 2.714 ¢ value 
required for a p of .01. A similar result would 
be obtained for the proportion of measures 
showing a difference behween means of the 
criterion groups in the predicted direction. 


Discussion 


Methodological considerations. A preliminary 
study (7) of the OSS pointed to a relatively 
high consistency in the subareas of con- 
ceptualization employed by Ss in the sorting 
and resorting phases of the task. That is, 
when £ instructed Ss, after the completion of 
the initial phase in the OSS, to try to resort 
the objects in a different way, they usually 
revised their groupings by forming others 
which were predominantly within the same 
conceptual area. However, these results were 
not considered as conclusive because of the 
small N employed. 

No direct estimates of reliability for the 
language measures were attempted. However, 
there was indirect evidence of consistency not 
only from the proportionate drop in the dis- 
criminative power of each of the separate 
language measures from the item analysis to 
the cross-validation studies, but also from the 
relative chi-square values obtained for the 
Several predictive categories; thus, the lower 
the discriminative power in the item analysis 
study for each predictive category of measures, 
the lower the obtained chi squares tended to 
be in the cross-validation study. 

The results obtained in the OSS did not 


10 These seven language measures were Nos. 3, 5 
15, 16, 24, 27, and 37. vA 


L. S. McGAUGHRAN 


, The obtained results for language measures! 


meet the assumptions of normal distribution 
and continuity upon which the biserial correla 
tion technique is based, nor was the N of 
appropriate size. For a number of reasons, 
though, this technique seemed to provide a 
somewhat more appropriate estimate of degree. 
of relationship than other correlation tech: 
niques which would have been operationally’ 
possible. However, under these circumstance) 
the test of the null hypothesis was based upon 
chi-square values, and the correlation esti| 
mates are considered only roughly descriptive’ 
of the obtained group differences. 

Theoretical considerations. The degree of 
success in predicting differences in language] 
behavior between hypostatic and autistatic] 
groups suggests that “conceptual area" mighi 
be more useful for some purposes than “com 
ceptual level,” in the sense that the latter 
term would not have supported such a predic 
tion; that is, a classification of the objed 
sortings into concrete and abstract level 
would not have differentiated between closet 
public and private concepts, since both type 
of concepts would have been classified ii 
most cases as concrete. Further, since thet 
was not a significant difference in test intel 
ligence among hypostatic, autistatic, ant 
metastatic groups in the sample used in this 
study, it might be inferred that the demo 
strated differences in conceptual behaviti 


conceptual levels, or abilities, among th 
groups. 

An a priori assumption was made that the) 
consistent differences in conceptual behaviti 
might be associated with more general beha 
potentials—perhaps defensive in nature 
reality-fixing and autism. Other, more speci 
behavioral terms were assumed to be associe 
with reality-fixing and autism and were u% 
as loose guides in setting up general and spe? ; 
hypotheses of differences in language behavioti 


Predictive Categories I and III were u 
test the specific hypotheses; the general 
potheses were tested by evaluating the rest 
for the Category II measures. The summati 
of language scores in Categories IV, V; E 
VI were intended to provide an over-all a 
of interrelatedness for all of the language 
haviors which were measured. J 
While the results seem to warrant a conc 


PREDICTING LANGUAGE BEHAVIOR FROM OBJECT SORTING 


» sion of considerable interrelatedness of con- 
ceptual behaviors (including language usage) 
within the experimental groups, it is felt that 
certain limitations exist which would restrict 
the interpretation of the rationale underlying 
the language measures. 

1. The behavior potentials cited in each 
hypothesis of language behavior are not pre- 
cisely defined beyond the operations of the 
present study. 

2. Since the language measures were not 
Separately validated, some of the measures 
‘may bear no relationship at all to the criteria; 
the probability is, however, that most of them 
do because all but seven of the measures 
showed some amount of predicted discrim- 
inative value. 

3. Each measure, as but a single hypothetical 
expression of a more general assumption of 
behavioral consistency, does not in itself 
constitute a full test of that assumption. 


SUMMARY 


Two dimensions of conceptualization, free- 
dom and extensionality, were postulated; 
from these postulations, it was hypothesized 
that the type of conceptual behavior pre- 
viously identified as concrete could be demon- 
Strated to occur in two more or less mutually 
exclusive conceptual areas—hypostatic and 
autistatic. With the use of these constructs, 
two experimental groups were separated on 
the basis of consistent differences in concept 
formation in an object-sorting situation. The 
validity of this separation was tested by 
Statistically evaluating the significance of 
Predicted group differences in language usage. 
Since Significant differences in language be- 

avior were found to exist as predicted, it was 


195 


concluded that conceptual area might be a 
more useful concept for some purposes than 
conceptual level in that the latter term would 
not have supported such a prediction. The 
degree of interrelatedness which was found to 
exist between conceptual and language be- 
haviors and among the language behaviors 
themselves suggested the existence of more 
general behavior potentials, which were 
termed reality-fixing and autism. 


REFERENCES 


1. Borres, Marjorie. The basis of pertinenee. Arch. 
Psychol., 1937, No. 212. 

2. GOLDSTEIN, K. Human nature in the light of psycho- 
vB Cambridge: Harvard Univer. Press, 
1947. 

3. GOLDSTEIN, K. Language and language disturbances. 
New York: Grune & Stratton, 1948. 

4. HANFMANN, EUGENIA, & Kasanın, J. Conceptual 
thinking in schizophrenia. Nerv. ment. Dis. 
Monogr., 1942, No. 67. 

5. Jerrreys, A. W., Ja. An exploratory study of per- 
ceptual attitudes. Unpublished doctor's disserta- 
tion, Univer. of Houston, 1953. 

6. Kiem, G. S., & ScurestNoER, H. J. Perceptual 
attitudes toward instability: I. Prediction of 
apparent movement experiences from Rorschach 
responses. J. Pers., 1951, 19, 289-302. 

7. McGaucuran, L. S. Dimensions of conceptualiza- 
tion: a preliminary experimental analysis of 
relationship between "conceptual areas" and 
language behavior. Unpublished doctor's dis- 
sertation, Ohio State Univer., 1950. 

8. McNemar, Q. Psychological statistics. New York: 
Wiley, 1949. 

9. Rapaport, D. Diagnostic psychological testing. 
Vol. 1. Chicago: Year Book Publishers, 1945. 

10. Tuompson, J. Ability of children of different grade 
levels to generalize on sorting tests. J. Psychol., 
1941, 11, 119-126. 

11. WrrcL, E. On the psychology of so-called proc- 
esses of abstraction (trans. by M. J. Rioch). 
J. abnorm. soc. Psychol., 1941, 96, 3-33. 


Received December 13, 1952. 


THE GENERALIZATION OF EXPECTANCIES! 


RICHARD JESSOR? 
University of Colorado 


He general purpose of the present study 
was to determine the conditions under 
which a change in expectancy in a given 
situation tends to generalize to other situations. 
The hypothesis, derived from the social learn- 
ing formulations of Rotter (10, 11), was that 
the generalization of changes in expectancy 
from one situation to other situations de- 
pended, at least in part, on the goal-relatedness 
of these situations to each other. The greater 
the commonality of goals which any two situa- 
tions have, the greater the generalization of 
changes in expectancy from one situation to 
the other. In Rotter's system, the occurrence 
of a response in a specific situation is assumed 
to be a function, in part, of the expectation 
that it will be followed by or lead to the oc- 
currence of certain goals or external reinforce- 
ments. If the goals do occur or are attained 
(success),-the expectancy for that response is 
increased. If the goals do not occur or are not 
attained (failure), the expectancy decreases. 
Expectancy refers to the subject's (S’s) in- 
ternal probability that a certain event (rein- 
inforcement) will occur and has been opera- 
tionally defined in a series of studies by such 
referents as a verbal statement by S (7), and 
the observed betting behavior of S in a gam- 
bling situation (2). The similarity of these 
views to those of Tolman (13), Brunswik (1), 
and to Lewin's concept of subjective proba- 
bility (8) may be readily seen. 
Tf success and failure change expectancies in 
a specific situation, the question might be 
asked whether such an effect is limited to 
that specific situation alone. We see, in every- 
1 This article is based upon a portion of a disserta- 
tion submitted in partial fulfillment of the require- 
ments for the degree of Doctor of Philosophy in the 
Department of Psychology at The Ohio State University. 
The author wishes to express his indebtedness to his 
advisor, Dr. Julian B. Rotter, for his constant stimula- 
tion and helpful criticism throughout the course of the 
research. Thanks are due also to Drs, Delos D. Wickens 
and Boyd R. McCandless for their assistance during the 
| investigation. A summary of this paper was read at the 
1952 meeting of the American Psychological Associa- 
tion in Washington, D. C. 
2 The author was a predoctoral Fellow of the Social 
Science Research Council during the period in which 
this experiment was carried out. 


196 


"functional relatedness of responses is presumeil 


day life, examples of a person doing well j 
poorly in one situation and then expecting i 
do well or poorly in others; e.g., the college 
student who is refused a date may expect 
be rejected in other social situations. 


To deal with such generalization phenomena i 


tional relatedness* of behaviors or response 
Responses are considered to be functionally r 
lated when they have led in the past to tlt 


to mediate the generalization of changes in ex 
pectancies when any given response is reii 
forced. A parallel concept in a nonexpectantj 
theory is, of course, Hull's (6) habit-famll 
hierarchy wherein the reinforcement of on 
member of the family is shared by the otl! 
members. 1 

The problem investigated was the influent 
of success and failure (positive and negatit 
reinforcement) upon expectancies in the sati 
situation and in related situations. The resul 
to be presented will deal with the change 1n € 
pectancies in related situations after success! 
failure in the original situation. On the bis 
of the foregoing theoretical considerations, 
hypothesis was formulated that the degré? 


vary with the degree of goal-relatedness ort 
subsequent situations to the original reinfort 
situation. 

MxrrHOD 


Subjects. A total of 132 male Ss, selected from in 
ductory classes in psychology, speech, and E. 
performed the experiment. The majority of the stu? 
"were freshmen, Seven Ss were discarded on the M 
of criteria which preceded the introduction of the? 
dependent variable. Only males were used becall a 
of the tasks in the experiment involved physical 8 
and it was felt that success or failure in this are? 
be rather a peripheral experience for most fom 
college students. f 


* The nature of the responses which are function 
related for any individual depends upon his past", 
ings and is not intrinsic to the responses then 
While ultimately it is necessary to study functi 
for each individual, the problem may be apP 
at this stage by assuming, for certain groups 
similar functionality based upon some degree 
formity of social learning. 


as 


of 


' GENERALIZATION OF EXPECTANCIES 


Tasks. The selection of four differentially goal- 
related tasks constituted one of the problems of the 
research. For experimental purposes it was decided to 
deal with subgoals rather than higher-order goals or 
needs. The following subgoals were employed: Tecog- 
nition for academic skill, recognition for physical skill, 
and love and affection from the opposite sex. They were 
chosen because of their meaningfulness to a male col- 
lege population and to enable comparison with a pre- 
vious study by Crandall (4). 

With the above goals in mind, four tasks were se- 
lected, each of which might be expected to involve 
primarily, for the groups being used, one of these goals. 
In addition, special instructions were utilized in de- 
scribing each task to structure it around the specific 
subgoal desired. The first task was a series of eight 
verbal arithmetic problems typed on a sheet of paper. 
The problems were of increasing difficulty and required 
only arithmetical computation. It was felt that this 
task, by its learned cultural implications for an aca- 
demic group, and by virtue of the instructions accom- 
panying it, e.g., “this task predicts success in college," 
Involved the goal of recognition for academic skill. The 
arithmetic task was used as the original task for all Ss 
ae the task on which success or failure was manipu- 

_ The three remaining tasks constitute the generaliza- 
tion tasks. Of the three tasks, one was intended to in- 
volve the same subgoal as the reinforced arithmetic 
task, This task was described as a measure of vocabu- 
lary knowledge and consisted of a deck of cards with 
anagrams of increasing difficulty on them. Since it is 
of a traditional academic nature, and by virtue of in- 
structions, ei, “this task measures your general 
knowledge,” it was presumed that this task would also 
involve the need for recognition for academic skill and, 
thereby, be most goalrelated to the arithmetic task. 

The second generalization task consisted of an epi- 
Cyclic Pursuit rotor whose Speed could be controlled. 
This fask, by Tequiring manual participation and by 
Use of instructions, e.g., “this is a test of physical co- 
ordination and athletic skill,” was Presumed to involve 
the goal of 


an adjoining, 


from the opposite 
involve was selected and structured not to 
°, Or, at least, minimally to involve, the goa] of 


Viera mS at pata i 


he gd of the foregoj; 


that Teinforcement on the arithmetic 


considerations concerning 
it was predicted 
Pun task should result 
ol ar hna n8 from most to least in the 
m nd Order: arithmetic task, vocabulary task, pur- 

The level-of-aspiration para- 
this experiment. The aim of the 


E 


197 


study was described to S as an attempt to find out how 
accurately people can predict their performance in 
a variety of situations. The four tasks were described 
in detail, including the information that possible scores 
on each task ranged from 0 to 50. The S was asked to 
predict, as accurately as possible, the score he would 
get on each task and was told that he would have an 
opportunity to make new predictions for all tasks when- 
ever he finished one of them. 

The S first performed the arithmetic task; he was 
given a predetermined score which was interpreted by 
means of a table of false norms, and was told, “Well, 
let’s pause for a moment and go through and make these 
predictions again for each task. You predicted —— 
last time. You got ——. What do you predict now?" 
This question was asked for each task in the order 
arithmetic, pursuit rotor, vocabulary, and social task. 

At this phase all the relevant data for the experi- 
ment were completed, that is, predictions of per- 
formance scores on four tasks before and after re- 
inforcement on one of them, the arithmetic task. It is 
necessary at this point to indicate the relationship of 
such data to the purpose of the experiment. 

The S's answer to the request that he predict his 
Score is the referent for the construct of expectancy. 
It is, however, an indirect and relative measure of that 
construct, i.e., it does not provide us with a probability 
value. Instead, the score which S predicts out of a range 
of 0 to 50, while under instructions to be accurate, may 
be considered the score in that range which has, for 
S, the highest subjective probability of occurrence. 
While we do not know the absolute value of the ex- 
pectancy, we know it is higher than the expectancy for 
any other score. After success or failure on the arith- 
metic task, the subject may predict a different score 
for some of the tasks. We assume from this that this 
new score now has the highest expectancy of occur- 
rence. The prediction of a different score than the first 
score predicted, after success or failure, is the referent 
for the measure of expectancy change. It is the generali- 
zation of these changes which is the problem under in- 
vestigation. 

General procedure. Each S was seen individually, 
The S was seated at a table on which were the pursuit 
rotor, two decks of cards containing the anagrams, and 
two sheets of paper containing arithmetic problems, 
The E sat at another table, part of which was shielded 
from S’s vision. After random assignment of S to one 
of the experimental groups, E explained the procedure 
and read the instructions. After stating an expectancy 
score for each of the four tasks, Ss performed the arith- 


* The S was also asked at the outset what score was 
the lowest one he would still be satisfied in getting, 
This score was utilized by E as the basis for manipulat- 
ing success and failure and is called the minimal goal 
score. Depending on what reinforcement group he was 
assigned to, S was told a false performance score which 
was a specified number of points above or below his 
minimal goal. In the experiment there were a success 
group, a weak failure group, and a strong failure group, 
all defined in terms of the direction and size of the dis- 
crepancy of the reported performance score from the 
minimal goal score. This procedure was an attempt to 
assure that success or failure was defined within S's 
own framework rather than arbitrarily, For Purposes 
of the hypothesis dealt with in this article, these groups 
are combined and treated as a whole. 


198 


metic task and received predetermined scores accord- 
ing to the group to which they were assigned. Scores 
were manipulated by false timing and the use of com- 
plex looking tables of norms. The Ss then gave the 
Second set of expectancy scores. This completed the 
experimental aspect of the situation. All Ss then per- 
formed either the pursuit rotor or vocabulary task or 
both and were allowed to succeed on these in order to 
overcome the effects of the prior arithmetic failure. 
This did not, however, constitute part of the experi- 
mental procedure. That the failure conditions were 
meaningful was quite apparent from the distress evi- 
denced by most Ss. 

After this experience of success, S was told that the 
half hour was up, and, since another S was already 
waiting, it would be necessary to omit the social at- 
tractiveness task. The S was asked not to discuss the 
experiment with anyone, and no S admitted to prior 
knowledge of the experiment when questioned. 


RESULTS 


The analysis of the data was made in terms 
of number of Ss, out of the total group of 125, 
who changed their expectations on a given 
task. The chi-square test for correlated pro- 
portions (9, p. 206) was used to test the sig- 
nificance of the differences between propor- 
tions that changed on the generalization tasks. 

That the reinforcement procedure was ef- 
fective is evidenced by the fact that on the 
directly reinforced task, the arithmetic task, 
88.8 per cent of the total group changed their 
expectancies after success or failure. The main 
concern of the present report, however, is 
whether or not expectancy changes generalize, 
and the nature of this generalization. 

The data presented in Table 1 indicate quite 
clearly that generalization has taken place in 
the experiment, that is, that the effect of suc- 
cess or failure on the arithmetic task was not 
confined to that task alone. It may be seen 
from Table 1 that, for all three of the general- 
ization tasks, some Ss changed expectancies. 
This finding provides the necessary basis for 
the analysis required to test our hypothesis. 

According to our hypothesis, generalization- 


TABLE 1 


Proportion or Torat Group (N = 125) Wxo 
CHANGED EXPECTANCIES ON THE THREE 


GENERALIZATION TASKS 
PROPORTION 
Wo CHANGED 
Task EXPECTANCY 
Vocabulary .448 
Pursuit rotor .360 
e Social attractiveness .248 


RICHARD JESSOR 


TABLE 2 
DIFFERENCES AMONG GENERALIZATION TASKS 
Proportion OF Totat Group (N = 125) Wno 
CHANGED EXPECTANCIES 


IN 


Porsvit 
Task VOCABULARY Roron 
Social attractive- .200 .112 
ness (<.001)* (<.02)* 
Pursuit rotor -088 
(.13)* 


* Values in parentheses represent probability values for the 
chi-square test of significance of differences between correlated 
proportions. 


of-expectancy changes should be mediated by 
the goal-relatedness of the nonreinforced tasks 
to the reinforced task. Our analysis of the goal 
qualities or characteristics of the generaliza- 
tion tasks used in this experiment indicated 
that the vocabulary task was most goal-related 
to the arithmetic task, the pursuit rotor next, 
and the social attractiveness task least. We 
should therefore expect to find that the pro- 
portion of Ss who change expectancies de- 
creases in the order vocabulary, pursuit rotor, 
and social task. Table 1 confirms our predic- 
tions, the respective proportions being .448, 
-360, and .248. To determine whether these 
proportions were significantly different from 
each other, the analysis presented in Table 2 
was made. 

Table 2 indicates that the differences among 
the tasks are significant in the predicted direc- 
tion. Some question might be raised concerning 
our, acceptance of a p of .13 between the vo- 
cabulary and the pursuit rotor tasks as sig- 
nificant. This value as it stands is not too 
discrepant from the generally accepted .05 
level to indicate a trend toward significance. 
Tn addition, however, a further analysis of the 
data, excluding the 15 Ss making up the suc- 
cess group, yielded a difference between the 

*vocabulary and the pursuit rotor tasks which 
was sighificant at the .05 level. The significance 
of the differences among the other tasks, in 
this analysis, remained substantially the same 
as given in Table 2. The reason for this analysis; 
excluding the weak success group, was due to 
certain methodological problems which arose 
during the experiment. Success seemed to have 
less face validity for Ss than failure did for Ss 
in the failure groups. For these reasons the 
data for the success group were not considered 
as valid as the data for the failure conditions. 
The relationships found for changes in expect- 


"ME 


' GENERALIZATION OF EXPECTANCIES 199 


ancies are very similar, however, including or 
excluding the success group. 


Discussion 


The major hypothesis, that it is possible 
to predict generalization along a dimension 
of functional relatedness of behavior or learned- 
goal similarity, derives support from the 
experimental findings. This may be considered 
one of the important dimensions for which 
Tolman (12) emphasizes a need in psychology. 
At this point it becomes necessary to consider 
several points which indicate the rudimentary 
nature of our findings and the importance of 
further research. 

Although the utility of a dimension of 
similarity of goals in predicting generalization 
seems to have been demonstrated, it must be 
emphasized that we have no knowledge what- 
soever of the nature of the units along that 
dimension. The research has ordered three 
tasks along this dimension, but only relative 
to each other and to the arithmetic task. Until 
some sort of scaling procedure is developed 
whereby we can predict upon the basis of scale 
units rather than relative position, it will not 
be possible to describe in mathematical terms 
the nature of the generalization function. 

The writer is aware of the fact that the goal 
characteristics of the generalization tasks were 
not independently measured. The question 
may be raised, e.g., “how do you know the 
vocabulary task really involved the need for 
recognition for academic skill?” To some extent 
this question is not meaningful. A system of 
Personality adopts certain constructs on the 
basis of their utility in prediction. If the need 
or goal constructs, as defined and applied in 
this study, prove to be predictive, that seems 
the only necessary criterion for their utility. 
To ask whether these needs really were present 
is partly to lose sight of the nature of the con- 
structs. The aspect of the question which would 


be meaningful is whether this goal description » 


of each task was suitable for all Ss. On this 
point, it would be necessary to agree that our 
Procedure assumed that Ss have a certain 
homogeneity of social learning experiences, an 
assumption which, of course, would not be 
Strongly defended. For preliminary research 
relying on group differences it was apparently 
useful. Füture research could well refine this 
aspect, by using Ss with known social learning 
backgrounds for whom the goals involved in 
any situation would be known with a high de- 


gree of accuracy, instead of relying upon gross 
cultural uniformities. : 

A further point related to the above is 
whether stimulus similarity was fully controlled 
in the experiment. It is not possible to affirm 
this, and it may be that some of the tasks were 
more similar to the arithmetic task on a 
physical basis than others were. If this were 
so, it would enable an explanation of the ex- 
perimental findings on the basis of stimulus 
generalization. Such an explanation is felt to 
be untenable in light of the conditions of the 
experiment, and the writer cannot conceive 
that the predictions made could also have 
been made solely on the basis of stimulus 
generalization. In addition, a study done sub- 
sequent to the present one, but with a similar 
hypothesis, controlled the factor of stimulus 
similarity. Chance (3), in her study, used 
identical task situations but varied their goal 
character by instructions and attained results 
in agreement with those of the present experi- 
ment. 

A final limitation of the design needs to be 
noted. The order in which all Ss were asked 
to state their expectancies and minimal goals 
was not rotated among the tasks but always 
began with the arithmetic task and followed 
with the pursuit rotor, vocabulary, and social 
attractiveness tasks. If order alone, however, 
were determinative of the degree of generaliza- 
tion, then the fact that more generalization 
occurred to the vocabulary task than to the 
pursuit rotor would be inexplicable. Order, per 
se, cannot account for our findings. 

The results of the present study are related 
to two others. Besides Chance's corroborative 
study mentioned above, the research of 
Crandall (4), which preceded the present 
experiment, approached the major hypothesis 
in a different manner. Crandall utilized TAT- 
type pictures selected on the basis of their 
need-related stimulus value. The pictures 
dealt with the areas of recognition for physical 
skill, recognition for academic skill, and af- 
fection from opposite sex peers. The Ss told 
stories to the pictures before and after failure 
in a physical skills situation. A control group 
had a rest pause between sessions. His findings, 
using ratings of the stories (5), showed the 
greatest change on the physical skills pictures, 
next on the academic skills pictures, and least 
on the affection pictures. While his results were 
in the predicted direction, the differences in 
amount of change between the three picture * 


200 


areas were not statistically significant. The 
present study, utilizing a different methodology, 
deals with the same problem investigated by 
Crandall. 

'The implications of the present problem for 
theory and practice seem apparent. Where 
the construct of expectancy is central to a 
personality theory, an understanding of the 
conditions under which expectancies change 
becomes of crucial importance. The knowledge 
that expectancy changes generalize, at least 
in part, according to the goal-relatedness of 
situations would seem to be of importance to 
the clinician as well as the theorist. 


SUMMARY 


The present study was an attempt to test 
a hypothesis derived from Rotter's formula- 
tions of a social learning theory of personality. 
Within that theory, the occurrence of a re- 
sponse is considered to be in part a function 
of the expectancy that the response will be 
followed by or lead to a specific goal or rein- 
forcement. Expectancies are presumed to 
change on the basis of two factors: (a) whether 
the specific response is or is not followed by 
the reinforcement, and (b) by generalization 
from changes in the expectancies of function- 
ally related behaviors or responses. The con- 
cept of functional relatedness implies that the 
responses lead to the same or similar goals. 
Hence, stated another way, generalization of 
expectancy changes is presumed to occur along 
a dimension of leàrned-goal similarity. 

To test this latter proposition, the level-of- 
aspiration paradigm was utilized. One hundred 
twenty-five Ss made predictions of their per- 
formance on four tasks before and after failure 
or success on one of them, the arithmetic task. 
Change in expectation statements after success 
or failure was the indirect and relative measure 
of expectancy change. The four tasks were 
selected on the basis of their presumed goal 
potential for the subculture of college males. 
In addition, instructions were used to delineate 
further the primary goal involved in each task. 
The four tasks were an arithmetic test, a 
vocabulary test, a pursuit rotor task, and a 
social interview with a member of the opposite 
sex. These tasks were presumed to involve, 
respectively, the goal of recognition for aca- 
‘demic skill, the goal of recognition for academic 
skill also, the goal of recognition for physical 

, Skill, and the goal of love and affection from 
the opposite sex. It was predicted that changes 


RICHARD JESSOR 


in expectancies after success or failure on the 
arithmetic task would generalize to the other 
three tasks on the basis of their goal-related- 
ness to it. Since goal-relatedness to the arith- 
metic task varied in the order in which the 
tasks are listed above, it was predicted that 
the degree of generalization would vary in the 
same decreasing order. 

The results were dealt with in terms of pro- 
portion of Ss who changed their expectancies 
on the three generalization tasks. The data 
corroborated the hypothesis and indicated that 
the three tasks were significantly different from 
each other in relative position along the dimen- 
sion of goal similarity. The findings were dis- 
cussed in terms of limitations and areas of 
further research. 


REFERENCES 

1. BRUNSWIE, E. Organismic achievement and en- 
vironmental probability. Psychol. Rev., 1943, 
60, 255-272. 

2. CASTANEDA, A. A method for measuring expectancy 
as conceived within Rotter’s social learning 
theory of personality. Unpublished master’s 
thesis, Ohio State Univer., 1951. 

3. CHANCE, JUNE. Generalization of expectancies as 
a function of need-relatedness. Unpublished 
doctor’s dissertation, Ohio State Univer., 1952. 

4, CRANDALL, V. J. A preliminary investigation of the 
generalization of experimentally induced frustra- 
tion in fantasy production. Unpublished doctor's 
dissertation, Ohio State Univer., 1950. 

5. Cranpatt, V. J. Induced frustration and punish- 
ment-reward expectancy in thematic appercep- 
on stories. J. consult. Psychol., 1951, 15, 400- 


6. Hott, C. L. Mind, mechanism and adaptive be- 
.. havior, Psychol. Rev., 1937, 44, 1-32. 

7. Jessor, R. A methodological investigation of the 
strength and generalization of verbal reinforce- 
ment. Unpublished doctor’s dissertation, Ohio 
State Univer., 1951. 

8. Lewis, K., DEwBOo, Tamara, Festincer, L., & 
Sears, PAULINE S. Level of aspiration. In 
McV. Hunt (Ed), Personality and the behavior 
disorders. New York: Ronald, 1944. Pp. 333-378. 

9. McNemar, Q. Psychological statistics. New York: 

^ . Wiley, 1949. 

10. Rorirm, J. B. Tentative formulations of basic 

n principles for a social learning theory of person- 
ality. Unpublished manuscript, Ohio State 
Univer., 1948, 

11. Rorrer, J. B. Tentative formulation of some social 
learning constructs. Unpublished manuscript, 
pis State Univer., January, 1950, June-July, 

12. Torman, E. C. A cognition motivation model. 
Psychol. Rev., 1952, 59, 389-400. 

13. Torman, E. C., & Brunswik, E. The organism 
and the causal texture of the environment. 
Psychol. Rev., 1935, 42, 43-77. 


Received December 15, 1952. 


bfc 


i» 


EFFECTS OF DECISION MAKING BY GROUP MEMBERS ON RECALL 
OF FINISHED AND UNFINISHED TASKS: 


MURRAY HORWITZ AND FRANCIS J. LEE 
University of Illinois 


(URRENT theories of motivation have 
developed mainly from studies which 
have treated the individual in isolation 

from a social group. In these studies, as the 
goal-striving individual moves toward goals 
or away from avoidances, his environment 
serves as a fixed frame of reference against 
which changes of position are measured. It is 
possible, however, to conceive of a fixed en- 
vironment as only the limiting case of the 
more general condition in which, to a greater 
or lesser degree, positional relationships within 
the environment are changing. Such a con- 
ception of a changing or “active” environment 
seems to be necessary if we consider an indi- 
vidual’s situation as a member of a social 
group. By virtue of action undertaken by the 
group, the individual may find himself in a 
new position relative to his goals or avoidances 
even though he has himself been completely 
quiescent. Thus the individual as a group mem- 
ber may find himself moved toward or away 
from either goals or avoidances by “the ground 
moving under his feet.” If he is located in an 
environment with these escalator-like prop- 
erties, his psychological situation can be said 
to consist on the one hand of positions through 
which he can locomote, as is assumed in 
Studies of isolated individuals (11), and on the 
other hand of positions through which he will 
to some degree be “carried” by the group. 
The study reported here is the second in 
which we are investigating the consequences 
for motivational theory of examining the 
individual in an active social environment. 
The problem dealt with in this paper grows 
out of findings of a previous experiment (9) 
in which one of the authors investigated the’ 
effects upon a person’s need tensions of being 
carried” toward or away from goals or 
* This study was carried out at the Bureau of Edu- 
cational Research, College. of Education, University of 
Illinois, and supported in part by a research grant 
from the Human Relations Branch, Office of Naval 
Research, U. S. Navy (Task No. NR 172-201). The 
authors are indebted to William M. Peterson for 


CHASE in the conduct and analysis of the experi- 
ent. 


201 


avoidances by group action. In this earlier 
investigation, individuals working in a group, 
but voting independently of one another, were 
asked to indicate whether they desired to 
finish or to avoid finishing each of a series of 
puzzles which the group had begun. In some 
cases where subjects (Ss) voted to finish work 
on a puzzle, the group completed this task, 
while in other cases the work was left un- 
finished. Similarly, where an S voted to avoid 
work on the puzzle, he might find the group 
acting in line with his vote or in opposition 
to it. The existence of tension systems was 
measured by Zeigarnik's (19) method, which 
assumes that level of recall of tasks is a direct 
function of the magnitude of unreduced tension 
associated with the tasks. According to 
Zeigarnik's assumptions, tension systems are 
unreduced for unfinished tasks but are reduced 
for finished ones, and recall of unfinished tasks 
should therefore be greater than recall of 
finished tasks. In the case of puzzles which Ss 
had voted to finish, a comparison of levels of 
recall for actually finished and unfinished 
puzzles indicated that tension systems could 
be regarded as reduced when S was “carried” 
by the group into a situation in which the 
group task was completed, but that tension 
systems were not reduced when the group 
"carried" him only part way toward comple- 
tion. However, if the S had voted to avoid the 
puzzle, tension systems corresponding to 
avoidance were reduced where the group's 
action satisfied S by removing any possi- 
bility that he would be “carried” into the 
region of avoidance; tension systems were not 
reduced where the group completed the puzzle, 
“carrying” him against his desire into the 
situation he wished to avoid. 

It was apparent from these results that one 
could extend the theory of tension systems (13) 
in a fairly direct way to the active environ- 
ment provided by a social group. This study 
produced one unexpected result, however, 
which pointed to a peculiar characteristic of 
decision making when viewed in the context 
of this type of environment, and which sug- 2 


E 


202 


gested moreover that decision making has 
direct effects upon the operation of tension 
systems. Certain Ss found that in the course 
of voting whether or not to work on the 
puzzles their votes differed frequently from the 
announced majority decision. On the basis of 
qualitative observations, it appeared that 
under these conditions Ss lost interest in 
completion or noncompletion of puzzles, and 
instead seemed to define a new task for them- 
selves, namely, to vote so as to agree with the 
majority of the group. The data revealed that 
these Ss shifted the basis on which puzzles 
were recalled, recalling a predominance of 
puzzles for which their votes had agreed with 
the group. But this is an odd result in terms of 
tension-system theory, for, given the task of 
agreeing with the group, agreements can be 
regarded as finished tasks and disagreements 
as unfinished ones. By recalling more agree- 
ments, these Ss were apparently reversing the 
Zeigarnik effect. It was further observed that 
in attempting to agree with the anticipated 
group vote, the decision about how to vote 
had now become a difficult problem for these 
Ss. To interpret the paradoxical finding con- 
cerning the Zeigarnik reversal, hypotheses were 
suggested about possible relationships between 
decision making, tension systems, and recall. 

In studying decisions of an isolated indi- 
vidual, it can be legitimately assumed that if 
an individual pursues one of two alternative 
paths of action, he has made a decision be- 
tween them. However, the peculiar character- 
istic of decision making in an active social 
environment is that just as a person may be 
“carried” toward or away from goals or avoid- 
ances, he may be “‘carried” by the group past 
the decision point before he has been able to 
make a choice between alternatives. In a group 
situation, that is, he may find that he has not 
been given an opportunity to “make up his 
mind” and must act while still in a state of 
indecision. To account for the Zeigarnik re- 
versal it was hypothesized that tension sys- 
tems will affect recall differently in states of 
indecision and decision: if an individual is in a 
state of indecision, then tension systems will 
tend to be expressed in a wish-fulfilling manner, 
resulting in greater relative recall of finished 
tasks; if he is in a state of decision, then tension 


Systems will tend to be expressed in terms of 


goal-directed activity, hence in greater relative 
recall of unfinished tasks. In the next section 
we attempt to show how this hypothesis can be 


Murray Horwitz AND Francis J. LEE 


derived within Lewin’s theory of motivation, 
and we describe an experiment designed to 
test this theoretical interpretation of the 
previously obtained results. 


DERIVATION OF THE HYPOTHESIS 


For the sake of brevity, we shall indicate 
the general form of derivation of the hypoth- 
esis rather than attempt a formal derivation. 
The form of the argument, using concepts 
within Lewin’s theory of motivation, is as 
follows: 

Assumption 1. An individual will manifest 
tendencies toward wish fulfillment (opera- 
tionally: tend to recall finished tasks) in 
psychological situations of high fluidity; an 
individual will manifest tendencies toward 
action (operationally: tend to recall unfinished 
tasks) in psychological situations of low 
fluidity. 

Assumption 2. A state of indecision (‘‘con- 
siderations”) corresponds to a psychological 
situation of high fluidity; a state of decision 
("readiness for action”) corresponds to a 
psychological situation of low fluidity. 

Hypothesis. An individual will tend to recall 
finished tasks if he is in a state of indecision; 
he will tend to recall unfinished tasks if he is in 
a state of decision. 

The concepts employed above and the ra- 
tionale for the assumptions made require some 
discussion. Within Lewin’s theory, the struc- 
ture of the psychological situation is treated 
in terms of an individual’s view of positional 
relationships between means and goals (11). 
Lewin further characterizes psychological sit- 
uations.in terms of their fluidity, or degree of 
ease with which restructuring can occur. 
individual who is engaging in fantasy (level of 
irreality) is in a situation of high fluidity so 
that a relatively weak force, such as a simple 
suggestion, may induce him to view himself as 
shaving attained his goal, though objectively 


, the goal may be quite distant. However, while 


he is engaging in everyday business (level 
of reality), the degree of fluidity of the psy- 
chological situation is low. His view of the 
steps he must take to reach his goal will tend 
to correspond more closely with objective facts, 
and he will usually require stronger forces in 
the form of evidence, persuasive arguments, 
etc. to modify his views. 

In terms of this concept of degree of fluidity, 
we suggest the following rationale for assump- 
tion 2. The existence of a need tension within 


3 


| 
\) 


Decision MAKING AND THE RECALL OF FINISHED AND UNFINISHED TASKS 


the individual implies that tendencies will be 
aroused to reduce the psychological distance 
between his present position and his goal. 
Given a highly fluid cognitive structure, these 
tendencies are assumed to induce a restructur- 
ing of cognitions in the direction of wish 
fulfillment; that is, rather than acting toward 
the goal, the person tends to alter his view of 
his situation so that he sees himself as having 
attained the goal. Since need tensions are not, 
or only slightly, reduced by such irreal be- 
havior (7), they continue to operate, and the 
person in this situation should continue to 
think about goal attainments. It follows that 
if tension systems exist related to performing a 
series of tasks, a person behaving on the level 
of irreality should recall more finished ones, 
i.e., attainments. (This process seems to cor- 
respond with Rapaport’s [17] description of the 
primary process in psychoanalytic theory, in 
which drives operate to cathect the memory of 
past gratifications.) On the level of reality, 
however, the cognitive structure is less fluid 
and resists such restructuring. In order to 
reduce his psychological distance from the 
goal, the individual is now required to loco- 
mote, i.e., engage in action to attain the goal. 
Ovsiankina (16) has shown that where a sub- 
ject is locomoting toward a goal, interruption 
of the task leaves him with tendencies to 
resume locomotion. Thus, where thinking is 
related to action, he will tend to continue to 
think about the task which is to be resumed, 
and if tension systems related to a series of 
tasks exist, a person behaving on the level of 
reality Should recall more unfinished ones. 
(This appears to correspond with the psycho- 
analytic secondary process in which ideas 
Serve, not as cathexes of the memory of past 
gratifications, but to “enhance the chances... 
of discovering the object in reality—or so 
change reality that the object becomes avail- 
able” [17, p. 74].) d 
.. We propose, in assumption 2, that states of 
indecision correspond to being in a fluid 
Psychological situation, and that states of 
decision Correspond to being in a nonfluid 
Situation. Here we follow the conceptual 
treatment of decision making proposed by 
Cartwright and Festinger (3) and Lewin (12). 
The individual in a condition of indecision is 
Ih a state of “considerations,” that is, he 
views himself as first in one, then in another, 
of two overlapping situations. His situation is 
fluid because in a state of considerations, rela- 


203 


tively small forces can induce him to “change 
his mind” about his distance from the goal; 
that is, we assume that, with respect to degree 
of fluidity, a state of considerations is closer 
to fantasy than to action. When the individual 
makes a decision, he eliminates all but one 
alternative, and is no longer fluctuating be- 
tween overlapping possibilities. He moves from 
a psychological situation of considerations to a 
situation of readiness for action. The assump- 
tion involved is that making a decision implies 
a “freezing” of the previously fluid cognitive 
structure. From the preceding assumption 
that need tensions produce wish-fulfillment 
tendencies in fluid situations and action tend- 
encies in nonfluid situations, our hypothesis 
follows, namely, that finished tasks will tend 
to be recalled while an individual is in a state 
of indecision and that unfinished tasks will 
tend to be recalled while he is in a state of 
decision. 

In the next section we describe the experi- 
ment designed to test this hypothesis. The 
hypothesis, as we noted in the introduction, 
was suggested by a proposed interpretation of 
an earlier unexpected finding that in a situa- 
tion where Ss voted for or against performing a 
task, recall of tasks by certain “deviant” Ss 
was mainly influenced by whether or not their 
votes on given tasks agreed with the majority 
vote. This finding was attributed to the occur- 
rence of uncontrolled and only. qualitatively 
observed variations in the experiment, namely, 
that these Ss had subjectively redefined the 
task to that of being in agreement with the 
group, and that as a result they were fre- 
quently in a state of indecision. In the experi- 
ment reported here, the task explicitly as- 
signed to the Ss is the one which was presumed 
to have been established by these “deviant” Ss, 
This procedure has the advantage of possible 
gains in information by testing our present 
hypothesis while simultaneously checking the 
proposed interpretation of the earlier finding. 


METHOD? 


Subjects. The Ss were female undergraduates at the 
University of Illinois, recruited in five-member teams 


2 To condense this article, copies of the verbatim 
instructions, the various forms used in the experiment, 
and the original tabulations of data have been deposited 
with the American Documentation Institute. Order 
Document No. 4059 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress, 


Washington 25, D. C., remitting in advance $2.00 for © 


204 


to represent their sororities in a test which was de- 
scribed to them as measuring group sensitivity. These 
were eight groups, each meeting for one two-hour 
session. Although a total of 40 Ss participated, the 
procedure required that one member of each of the 
eight groups receive a different experimental treatment 
from the others. Data obtained from these eight Ss are 
not reported in the present results, which are based on 
the remaining 32 Ss. 

Apparatus. The physical setup was designed to 
permit Ss to work together in assembling a series of 
Jigsaw figures, while allowing only a minimum of com- 
munication among them. The Ss were seated around a 
circular table approximately 5 ft. in diameter. The 
jigsaw figures were placed within a smaller concentric 
circle approximately 3 ft. in diameter. Vertical shields 
(plywood panels 22 in. high and 18 in. long) extended 
from this center circle to 6 in. beyond the outer edge 
of the table. These shields were placed so as to divide 
the table into six segments or “booths,” one for each 
of the Ss and one for the experimenter (E). In addition, 
window shades were mounted at the end of each booth, 
and drawn to a level where Ss, while sitting in their 
booths, had a clear view of the jigsaw puzzles placed 
in the center work space, but could not see the occu- 
pants of booths opposite their own. 

Assembling the jigsaw puzzles was entirely a manipu- 
lative task. The jigsaw puzzles were 16 simple figures 
of such objects as "telephone," "shoe," etc., Outlines 
of figures were drawn on white cardboard (1314 in. 

X 1314 in.), and each figure was divided into five sec- 
tions. These outlines could be filled in with pieces of 
colored poster board cut to match the five sections of 
the outline. Pieces were distributed by E around the 
outline, and Ss moved them into place using rods (30 in. 
long) mounted on swivels in each booth. Attached to 
each of these rods was a small metal tip which enabled 
Ss to lift or push the jigsaw pieces into their appropriate 
places on the outline of the figure. 

In order that Ss could register votes about whether 
or not they desired to complete a. given puzzle, two- 
way toggle switches were installed in the booths. These 
switches were wired to a panel of lights in E's booth so 
that each S could activate a green or red light to indi- 
cate a “yes” or “no” vote, respectively. 

Procedure. The major features of the experimental 
Procedure can be summarized as follows: 

| 1, To enhance Ss’ receptivity to the experimental 
goal of being in agreement with the group, two pre- 
experimental devices were used: (a) a 15-minute group 
discussion concerning the effects of sorority life on 
improving Ss’ “sensitivity to others’ feelings”; (6) a 
so-called “Test on Group Membership” in which Ss 
could—and did—personally commit themselves to the 
importance of being in agreement with their group. 

2. The rules for assembling the jigsaw puzzle re- 
quired that Ss work together to place three of the five 
pieces in place within the puzzle outline, and then vote 
in private whether to stop at that point or continue 
assembling the remaining two pieces. 

3. The group was informed that their score for 
“sensitivity” would be determined by the level of 
agreement attained in the vote—a unanimous vote 


Murray Horwitz AND Francis J. LEE 


being scored 100 points; agreement among four out of 
five, 50 points; a simple majority, 30 points, 

4. Each S was supplied with 20 voting tabs, half 
of which were marked “yes,” and half “no.” Before 
voting each S was required to give up a tab corre- 
sponding to her intended vote, thus insuring that both 
“yes” and “no” votes would be distributed over the 
16 puzzles. 

5. The Ss were provided a supply of “message 
forms” on which they could indicate how they intended 
to vote and their reasons for so voting. In order to 
engender additional feelings of responsibility in the 
voting, each of four Ss was privately informed that 
she had major influence over the outcome, since her 
message—and only hers—would be delivered to another 
S. Each time message forms were collected, one form 
was shown to the fifth S, thereby leading each of the 
others to believe that her own message had been 
delivered. 

6. After messages and voting tabs were turned in, 
Ss registered their votes using the toggle switches at 
their work places. The E recorded their separate votes, 
and then announced to the whole group at once the 
results of the voting. Announcements of the vote fol- 
lowed a prearranged experimental plan, and did not 
necessarily correspond with the actual vote. Votes on 
the first two puzzles were announced as "yes," since 
announcement of “no” votes so early in the series 
might have been interpreted by Ss as meaning that the 
group was rejecting the experimental situation. Subse- 
quent announcements were varied so that each S 
would find her own vote about equally often in agree- 
ment or disagreement with the group vote. 

7. After an announced "yes" vote, the group as- 
sembled the two remaining pieces of the puzzle before 
starting a new one. After a “no” announcement, the 
group immediately began to work a new puzzle. 

8. In order to avoid any suggestion that Ss were 
being given a memory test (13), E, after the above 
cycle had continued for 16 puzzles, stated that he 
wished to get Ss’ reactions to the puzzles up to that 
point, and asked, “Would you jot down the names of 
any of the figures which come to your minds, in order 
to have something specific to refer to when you give 
your reactions.” The Ss were allowed a minute and à 
half for listing names of puzzles, after which the lists 
were collected. A questionnaire entitled, *Your Reac- 
tions to This Test,” designed to check Ss’ under- 
standing and acceptance of the experimental instruc- 
tions, was then distributed. 

Measures used. One of the items on the message 
forms read: 

“As far as I can figure out the basis for the group's 
, voting, the group decision: j 
(check aà— Can turn out either way 

one) b— ——Will probably turn out to be ‘yes’ 

$ £— —Will probably turn out to be ‘no. ” 
Since Ss' goal was to agree with the group, those who 
checked a can be regarded as in a state of indecision 


.concerning their vote; those who checked b or c can 


be regarded as in a state of decision. For a given 5, 
puzzles associated with these respective states can be 
classified as either “decision” puzzles or “indecision” 
puzzles, 


Puzzles can be further classified as ones on which 


given Ss had been in agreement or disagreement with 
the announced group vote, hereafter designated as 


E 


e 


DECISION MAKING AND THE RECALL OF FINISHED AND UNFINISHED TASKS 


“agreement” and “disagreement” puzzles. The tend- 
ency to recall finished tasks is measured by relative 
recall of agreement puzzles, and tendency to recall 
unfinished tasks is measured by relative recall of dis- 
agreement puzzles. To avoid misunderstanding, it 
should be noted that we depart here from the more 
familiar practice in which unfinished tasks are coordi- 
nated to interrupted puzzles while finished tasks are 
coordinated to completed puzzles. In terms of the goal 
defined for Ss in the present situation, we assume that 
the task on a given puzzle is finished where an S’s 
vote agrees with the announced group vote. The task 
is assumed to remain unfinished where S's vote dis- 
agrees with the announced group vote. 
RESULTS 

The hypothesis being tested can be divided 
into two subhypotheses, dealing with the 
decision and indecision conditions, respec- 
tively: (a) recall of decision-disagreement 
puzzles will exceed recall of decision-agree- 
ment puzzles (Zeigarnik effect); (b) recall of 
indecision-agreement puzzles will exceed recall 
of indecision-disagreement puzzles (Zeigarnik 
reversal). f 

Considering the decision condition first, we 
can compare the proportions of disagreement 
and agreement puzzles recalled by each S, and 
determine how many Ss did and did not recall 
relatively more disagreement puzzles. The null 
hypothesis is equivalent to the statement 
that the occurrence of an S whose recall con- 
forms with the hypothesis is equiprobable with 
the occurrence of an S whose recall does not 
conform. This can be tested by the binomial 
expansion with p = .5, using a one-tailed test 
since direction is predicted. Thirty-one Ss can 
be compared; for one S the comparison is 
impossible since she never found herself in 
disagreement with the group whilg in an 
indecision condition. Twenty Ss (64.5 per cent) 
recalled relatively more disagreements than 
agreements in conformity with the hypoth- 
esis; 11 Ss (35.5 per cent) did not recall 
more disagreements than agreements. On the 
assumption made in the null hypothesis, the 
probability of obtaining this distribution by 
chance is .075 (20). This is a somewhat con-* 
Servative test of the hypothesis, because four 
Ss who recalled equal numbers of agreements 
and disagreements are included among the 
negative cases. 
ff these four Ss are eliminated from con- 
sideration, then 20 Ss (74.1 per cent) recalled 
More disagreements than agreements and 
Seven Ss (25.9 per cent) recalled more agree- 
ments than disagreements; the probability of 
obtaining this outcome by chance is approxi- 


205 


mately .01. If the four Ss are distributed 
equally between the two groups, then p = .02. 
Tn any case the subhypothesis regarding recall 
under conditions of decision can be accepted 
at close to or better than the .05 level of 
confidence. 

For states of indecision we can compare only. 
17 of the 32 Ss, since nine Ss never indicated 
that they were undecided, and of the remain- 
ing 21, two had no agreements and four had 
no disagreements in the indecision conditions. 
Here we find the predicted reversal in recall 
tendencies. Only five Ss (29.4 per cent) re- 
called more disagreements than agreements, 
but 12 Ss (70.6 per cent) recalled more agree- 
ments than disagreements. Evaluated as 
above, the subhypothesis regarding recall 
under conditions of indecision can be accepted 
at the .072 level of confidence. 

The method used above for testing the two 
subhypotheses treats the proportions of agree- 
ments or disagreements recalled by each S as 
scores which represent the strength of S's 
tendency to recall either type of puzzle. It 
is clear that this method is an insensitive one, 
in that it ignores information in the scores 
about the strength of these recall tendencies. 
We next evaluate our data by means of the ¢ 
test, which uses the information about strength 
of recall, although it has the disadvantage of 
combining proportions whose variances cannot 
be regarded as homogeneous, particularly 
where, as here, they are based on samples of 
unequal size (5). 

In the decision condition, the mean of 
proportions of disagreement puzzles recalled 
by the 31 Ss is .414. The mean proportion of 
agreement puzzles recalled is .279. The value 
of ¢ for the difference between these correlated 
means is 2.46 (Table 1), which for 30 df is 
significant at the .01 level, using a one-tailed 
test. For the 17 Ss who can be compared in the 
indecision condition, the mean proportion of 
agreements recalled is .505; the mean pro- 
portion of disagreements recalled is .277. Here 

— 1.90, which for 16 df is significant at better 
than the .05 level. These results are again 
consistent with the hypothesis. 

One further statistical test of these data 
throws some light on whether the above results 
are seriously affected by our dealing with pro- 
portions based on small samples. The recall 
scores for decision-disagreement and decision- 
agreement puzzles were based on samples of 
mean size, V = 5.1 (range N = 1 to 8) and © 


» 


206 


TABLE 1 


RxsuLTS Or ! Tests COMPARING RECALL AMONG 
Pars or Tyres OF PUZZLES 


Murray Horwitz AND Francis J. LEE ’ 


TABLE 2 


COMPARISON OF POOLED RECALL OF AGREEMENT AND 


DISAGREEMENT PUZZLES IN STATES OF DECISION 


MEAN DIFFERENCE 
TYPE or 
DECISION- INDECISION- 
Poza é| Disacreg. INDECISION- DISAGREE- 
uenr |AGREEMENT perm 
Decision- 
agreement .134** .292tt .060 
Decision-dis- 
agreement .195t .004 
Indecision- 
agreement .228* 


*4 = 1.90, 16 df, p < .05 (one-tailed test). 
T4 = 2.61, 20 df, p < .05 (two-tailed test). 
** i. 246, 30 df, p < .01 (one-tailed test). 
ttt = 4.13, 19 df, p < .01 (two-tailed test). 


W = 5.4 (range N = 2 to 8), respectively. 
Where proportions are given equal weight in 
averaging, without regard to differences in size 
of sample, the proportions based on the samples 
of smaller size are more likely than those based 
on larger samples to take extreme values, and 
therefore may have unjustifiably strong effects 
in either raising or lowering the mean of all 
proportions. In Table 2, we pool all decision- 
disagreement puzzles and all decision-agree- 
ment puzzles, respectively, and compare the 
frequencies with which these puzzles are re- 
called, disregarding the fact that puzzles were 
recalled by different Ss. The effect of this proce- 
dure is to give greater weight in the comparison 
to recalls which are based on larger samples, 
The procedure appears to be statistically justi- 
fied, for it assumes that the frequency with 
which an individual recalls puzzles under a 
given treatment can be treated as a random 
sample drawn from a common population of 
individuals within that treatment. The tenabil- 
ity of this assumption is tested by considering 
whether the variations in frequencies of indi- 
vidual recall are no greater than what would 
be expected in randomly sampling homogene- 
ous material. The test for homogeneity applied 
to the data on recall of decision-agreement 
puzzles by separate Ss yields a chi square of 
25.045, which for 30 df is not significant. 
For recall by separate Ss of decision-disagree- 
ment puzzles, chi square = 29.629, 30 df, 
which is likewise not significant, indicating 
that it is legitimate to pool individual data 
under each class of puzzles, and to treat the 
data thus combined as a single sample repre- 
sentative of a common population. As shown 


Nor Re- 


Tyre or PuzzLE RECALLED ZA Torar 
Agreement puzzles 46 (29.1%) | 112 (70.9%) | 158 (100%) 
Disagreement puz- | 73 (43.4%) | 95 (56.6%) | 168 (100%) 

ales 

Total 119 207 326 


Note: x? = 7.22, 1 df, p < .01 (one-tailed test). 


by Table 2, the pooled recall of disagreement 
puzzles (43.4 per cent) is significantly greater 
(p < .01) than recall of agreement puzzles 
(29.1 per cent). It is interesting that the cor- 
responding mean proportions reported above, 
where the / test was applied to unpooled data, 
are, in terms of percentages, 41.4 per cent 
and 27.9 per cent, respectively, suggesting 
that variations in sample size had negligible 
effects on the results. 

The same comparison is made in Table 3 
for those cases where Ss were in a state of 
indecision. Again the legitimacy of pooling 
data from separate Ss is supported by the 
results of chi-square tests for the homogeneity 
of individual recall (chi square — 22.013, 17 df, 
for indecision-disagreement puzzles; chi square 
= 14040, 17 df, for indecision-agreement 
puzzles—neither chi square being significant). 
The predicted reversal in recall is again ap- 
parent in Table 3. A higher percentage of 
agreement puzzles is recalled (50.0 per cent) 
than disagreement puzzles (32.0 per cent), 
this difference being significant at better than 
the .05 level. The corresponding mean propor- 
tions once more appear to be very close, being 
in percentage terms, 50.5 per cent and 27.7 
per cent, respectively. 

We have made no predictions comparing 
levels of recall across the decision and indeci- 
sion conditions, Theoretically, tension systems 
should not increase recall for decision-agree- 
ment puzzles or indecision-disagreement puz- 
zles. The mean proportions of recall for Ss who 
can be compared on these two sets of puzzles 
are .295 and .355, respectively, the difference 
of .060 (Table 1) being nonsignificant, which 
suggests that in the absence of effects due to 
tension systems, decision and indecision puzzles 
tend to be equally recalled. The percentages 
based on pooled recall of these puzzles are 315 
per cent and 38.8 per cent, respectively (chi 
Square — .686, 1 df, not significant), which are 
sufficiently close to the preceding mean propor- 


. 


DECISION MAKING AND THE RECALL OF FINISHED AND UNFINISHED TASKS 


TABLE 3 


COMPARISON OF POOLED RECALL OF AGREEMENT AND 
DISAGREEMENT PUZZLES IN STATES OF 


INDECISION 
Tyre or Puzzle | RECALLED Ne Toran 
CALLED 
‘Agreement puzzles | 29 (50.0%) | 29 (50.0%) | 58 (100%) 
Disagreement puz- 16 (32.0%) | 34 (68.0%) | 50 (100%) 
zles 
"Total 45 63 108 


Note: x? = 3.58, 1 df, p < .05 (one-tailed test). 


tions to reassure us that the latter have not 
been unduly affected by individual proportions 
which are based on small samples. 

Theoretically, tension systems should oper- 
ate to increase recall of agreement puzzles in 
the indecision condition, but should not affect 
recall of agreement puzzles in the decision 
condition. The mean proportions for Ss who 
can be compared for these two classes of 
puzzles are .571 and .279 respectively, the 
difference (.292) being significant at better 
than the .01 level (Table 1). We used a two- 
tailed test, since differences in recall across 
the indecision and decision conditions are not 
derivable from our theory without an addi- 
tional assumption, namely, that recall between 
the two conditions will be equal in the absence 
of effects from tension systems. The corre- 
sponding percentages based on pooled recall 
are 51.8 per cent and 30.5 per cent (chi 
Square = 6.337, 1 df, 02 > p > .01), which 
appears to indicate again that the preceding 
mean proportions were not unduly affected by 
Proportions based on small samples. 

In a similar fashion, we can compare the 
mean proportions for decision-disagreement 
Puzzles, where tension systems are theoreti- 
cally operative, with indecision-disagreement 
puzzles, where they are presumably inopera- 
tive. Here the mean proportions are very close, 
340 and .336 respectively, the difference (.004)" 
being nonsignificant. The corresponding pooled - 
values are 35.5 per cent and 35.2 per cent, 
Tespectively (chi square = .0016, 1 df, not 
Significant). Thus, we find no evidence of re- 
liably greater recall where tension systems 
are operative in situations of decision than 
where tension systems are inoperative in situ- 
ations of indecision. This, it should be noted, 
I5 In contrast to the preceding finding that the 
operation of tension systems in situations of 
indecision produces greater recall than the 


207 


nonoperation of tension systems in situations 
of decision. 

The question now arises whether tension 
systems operate with equal effect in the 
decision and indecision conditions. If we com- 
pare the mean proportions for decision-dis- 
agreement puzzles (.372) and indecision-agree- 
ment puzzles (.567), the difference (.195) is 
significant at better than the .02 level, using a 
two-tailed test (Table 1). There is some evi- 
dence that this difference was produced in 
part by proportions based on small samples. 
The corresponding pooled percentages are 
39.6 per cent and 51.6 per cent, respectively, 
their difference being reduced to 12.0 per cent 
(chi square = 2.232, 1 df, p = .14). While the 
reliability of this difference is open to question, 
the possibility is suggested that equivalent 
amounts of motivational energy will have a 
greater effect on recall in indecision states than 
in decision states. In other words, tension 
systems may affect recall more strongly when 
an individual is engaged in wish fulfillment, his 
cognitive structure being highly fluid, than 
when he is oriented toward action, and his 
cognitive structure is relatively nonfluid. 

Our hypotheses are stated in terms of 
psychological mechanisms which should be 
exhibited by any person. One would therefore 
be obliged to explain the behavior of persons 
not conforming to the hypotheses in terms of 
their failure to adopt the set which the experi- 
mental manipulations were designed to pro- 
duce. Questionnaires distributed to Ss after 
each session attempted to ascertain how Ss 
viewed the experimental situation. To measure 
the degree to which Ss accepted the goal which 
we tried to induce, they were asked: “Have 
you ever lost interest or become impatient 
with the test?"; answers were indicated on a 
seven-point rating scale ranging from “never” 
to "frequently." Scores were assigned to Ss' 
responses, ranging from 7 for "never lost 
interest" to 1 for "frequently lost interest.” 
Since all Ss found themselves in disagreement 
with the group vote approximately half the 
time, any tension systems which they may 
have developed, related to the goal of attaining 
maximum agreement in voting, should remain 
to some degree unreduced at the time of recall. 
However the initial magnitude of Ss’ tension 
systems should be expected to vary with their 
level of interest in the assigned task—which 
implies systematic differences in the degree to 


e 


208 


TABLE 4 
Comparison oF MEAN INTEREST SCORES FOR Ss 
WsosE RECALL oF DECISION OR INDECISION 
Puzzles Was CONSISTENT OR INCONSISTENT 
WITH THE HYPOTHESIS 


MEAN 
Tyre or RECALL Score  Dmr ł 

Decision puzzles 

Consistent (N= 20) 4.95 1.41 2.58* 

Inconsistent (N = 11) 3.54 
Indecision puzzles 

Consistent (N = 12) 4.58 72.38 3.56** 

Inconsistent (N = 5) 2.20 


* df = 29, p < .02 (two-tailed test). 
** df = 15, p < .01 (two-tailed test). 


which different Ss will manifest the effects we 
have hypothesized. 

Analysis of responses to the "interest" item 
shows a significant relationship between Ss’ 
expressed interest and whether their recall con- 
forms to the hypothesis (Table 4). In the deci- 
sion condition, Ss who recalled relatively more 
disagreements than agreements, ie, who 
showed the predicted Zeigarnik effect, had sig- 
nificantly higher interest scores (M = 4.95) 
than those who failed to show the effect 
(M = 3.22). The difference between these 
means is significant at better than the .02 
level. In the indecision condition, Ss who 
recalled relatively more agreements than dis- 
agreements, ie. who showed the predicted 
reversal of the Zeigarnik effect, were again 
significantly higher in interest (M = 4.75) 
than those who did not perform as predicted 
(M = 2.20). This difference is significant at 
better than the .01 level. It is evident from 
these data that at least some of the variance 
in our data can be attributed to the fact that 
the attempt to induce Ss to accept the experi- 
mental goal did not meet with uniform success. 
One can conclude that the assumption that 
we are dealing with a psychological mechanism 


Common to individuals in general is not an . 


untenable one. 


Discussion 


The problem dealt with in this paper is an 
instance of the more general one of how events 
in the social environment impinge on psycho- 
logical processes in the individual. The attempt 
to ask questions involving the interaction of 

, individual and group variables, appears to lead 
5. to formulations with interesting consequences 
* 


Murray Horwitz AND Francis J. Lee 


for individual and group psychology considered 
separately. In the following discussion we wish 
to indicate implications of the study, first for 
the psychodynamics of decision making, and 


second for the conceptual treatment of motiva- _ 


tional persistence in social groups. 
Psychodynamics of decision making. We al- 
luded earlier to Cartwright’s (2) theory of the 
process involved in coming to a decision. This 
theory, which has been elaborated and quanti- 
fied by Cartwright and Festinger (3), assumes 
that the person who is making a choice between 
two courses of action views himself first in one, 
then in another, of two overlapping situations. 
When the forces operating on the person in 
one situation exceed the forces in the other 


situation by a given amount, the person | 


will act. 

In a subsequent paper, dealing with the 
so-called group decision experiments by Bave- 
las and others, Lewin (12) suggests an exten- 
sion of this theory according to which decision 
making additionally involves a reduction of 
one of the opposing sets of forces. In one of the 
Bavelas experiments Ss, in two different treat- 
ments, were presented with the alternative of 
serving or not serving certain new foods. It 
was found that in the treatment in which Ss 
engaged in an act of decision about using the 
new foods, a greater proportion served these 
foods, and persisted in doing so, than did those 
in the treatment which omitted the act of 
decision. On the strength of these results, 
Lewin argues that decision making must mean 
not merely that the forces toward one alterna- 
tive have become stronger than the forces 
toward*the other alternative, but that “the 
potency of one alternative has become zero 
or is so decidedly diminished that the other 
alternative and the corresponding forces 
dominate the situation....If the opposing 
forces in a conflict merely change so that the 
‘forces in one direction become slightly greater 
than in the other direction, a state of blockage 
or extremely inhibited action results rather 
than that clear one-sided action which follows 
a real decision” (12, p. 336). 

In Lewin's system of motivation, needs or 
tension systems are the sources of energy 
which underlie psychological forces. If de- 
cision making involves reducing the potency 
of one set of psychological forces, a further 
problem can be raised within this system 
about what happens to the tension systems 


E 


4 
DECISION MAKING AND THE RECALL OF FINISHED AND UNFINISHED TASKS 


underlying these forces when decisions are 
made. The present study, which makes use of 
the fact that in a social group behavior beyond 
a choice point may occur with or without a 
prior act of decision, sheds light on this prob- 
lem. On the assumption that indecision corre- 
sponds to an individual's being located in a 
fluid behavioral field, we derived the conse- 
quence that when he is unable to make a 
decision, motivational energies should be 
channeled into wish fulfillment. The act of 
decision making, we assumed, stabilizes the 
behavioral field and makes cognitive restruc- 
turing difficult, so that if the organism is to 
reduce its need tensions, it must channel 
energy into action. From this standpoint the 
act of decision making can be viewed as in- 
volving a mechanism for controlling the dis- 
position of motivational energy into tendencies 
toward wish fulfillment versus tendencies to- 
ward action. The experiment reported here 
attempted to isolate these two modes of chan- 
neling energy, and the results appear to sub- 
stantiate Lewin's suggestion that, contrary to 
the prevalent view, action cannot be regarded 
as a direct result of motivation, but requires 
mediation by decision or some equivalent 
process. This would account for the effects of 
the presence or absence of decision making 
upon persistence or nonpersistence of action 
in the Bavelas experiments, and upon recall of 
unfinished or finished tasks in the present 
study. 

Motivational persistence in social groups. 
Theorists of social organization such as Simon 
(18) and Barnard (1) have indicated the need 
for a more adequate psychological treatment 
of decision making for dealing conceptually 
with the structure of organizational activities. 
After discussing the ubiquitousness of de- 
cision making in social life, and vainly consult- 
ing the psychological literature for a treatment, 
of the processes involved, Barnard ebserves 
that “decisive behavior, as contrasted with 
tesponsive behavior, seems to have received 
little attention in the psychologies” (1, p. 30). 

_ Recent work on group dynamics raises a 
Similar theoretical demand. Groups operating 
on the basis of consensus have been studied in 
which the member is allowed to decide about 
alternative paths before he engages in activi- 
ties toward a goal. These groups have been 
contrasted with “autocratic” ones in which 
the decision-making step is more frequently 


209: 


short-circuited, with decisions being made for - 
the member about the path he is obliged to 
follow in acting toward a goal. Lippitt (14) 
found that in democratic" or decision making 
groups, individuals showed greater persistence 
in their work than did members of “auto- 
cratic" groups. The work of Coch and French 
(4) and Maier (15) give further evidence that 
persistence in goal striving—one aspect of 
what we generally designate as morale—is 
associated with the opportunity of group 
members to make decisions relevant to their 
goals. 

The present study, which focuses on the 
effects of decision making on internal processes 
of the individual, appears to clarify to some 
extent why decision making in a group should 
produce the above results. The hypothesis 
investigated here implies that a significant 
dimension of a group is the degree to which it 
constitutes a psychological environment in 
which, at one extreme, members are carried 
toward or away from goals or avoidances. 
With respect to the activity of decision making, 
this characteristic of a group will have the 
following consequence: If the group is one in 
which individuals are carried through points 
of decision, members will tend to find them- 
selves in fluid psychological situations, and 
tend to channel motivational energies into 
wish fulfillment. If the group is one in which 
members can engage in the activity of decision 
making, they will find themselves in less fluid 
psychological situations and show greater 
“action orientation." 

The indecisiveness of members in the present 
experiment resulted from the fact that the 
group required them to act even though they 
had insufficient information on which to base 
a decision. In this respect the present situation 
differs from that of “autocratic” groups in 
which members are not allowed to make 
decisions. Other conditions which affect the 
degree of decisiveness of individuals have been 
described in sociological terms by Durkheim 
(6) in his discussion of anomie. An experi- 
mental attempt to reproduce anomie and 
related characteristics of groups, and to in- 
vestigate their effects from the standpoint of 
decision making and persistence in action, has 
been reported in a recent publication (10). 


SUMMARY 


In attempting to interpret results of an 
earlier experiment, the hypothesis was sug- 


210 


gested that as a result of decision making, 
motivational energy tends to be channeled 
into action, and that in the absence of de- 
cision making, motivational energy tends to be 
channeled into wish fulfillment. To isolate 
these hypothesized effects, behavior of Ss was 
compared in a group situation under condi- 
tions where Ss were able or unable to make 
decisions. The channeling of motivation into 
action was conceptually coordinated to the 
tendency to recall unfinished tasks; the chan- 
neling of motivation into wish fulfillment, to 
the tendency to recall finished tasks. 

The goal set for Ss in the present experiment 
was that of voting so as to agree with an- 
nounced majority decisions. Tendency to recall 
finished tasks was operationally measured by 
recall of agreements, and tendency to recall 
unfinished tasks by recall of disagreements. 
"Three modes of statistical analysis were em- 
ployed in an effort to deal with problems which 
arose because our scores were based on propor- 
tions. These tests showed that the hypothesis 
concerning the tendency to recall disagreements 
in decision states and the contrasting tendency 
to recall agreements in indecision states could 
be accepted at close to, or better than, the .05 
level. A significant relationship was found, too, 
between degree of acceptance by Ss of the 
experimentally assigned goal and whether or 
not their behavior conformed with the hypoth- 
esis, indicating that negative cases were due 
in part to failure to induce all Ss to accept the 
experimental situation. Implications of these 
findings for the psychodynamics of decision 
making, and for the explanation of group 
effects on the motivational persistence of 
members, were discussed. 


REFERENCES 


1. BAnNAnD, C. I. The nature of leadership. In S. D. 
Hoslett (Ed.), Human factors in management. 
New York: Harper, 1946. Pp. 13-38. 

2i CARTWRIGHT, D. Decision-time in relation to the 
differentiation of the phenomenal field. Psychol. 
Rev., 1941, 48, 425-442. 

3. CARTWRIGHT, D., & Festincer, L. A quantitative 
theory of decision. Psychol. Rev., 1943, 50, 
595-621. 

4. Cocn, L., & FRENCH, J. R. P., Jr. Overcoming 
Md to change. Hum. Relat., 1948, 1, 


5. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19, 


20. 


Murray Horwitz AND Francis J. LEE’ 


Cocuran, W. G. Analysis of variance for per- 
centages based on unequal numbers. J. Amer, 
statist. Ass., 1943, 38, 287-301. 

Dourxuem, E. Suicide, a study in sociology. (Trans. 
by J. A. Spaulding & G. Simpson.) Glencoe, Ill.: 
Free Press, 1951. 


. ESCALONA, SYBILLE K, Play and substitute satis- 


faction. In R. G. Barker, J. S. Kounin, & H. F. 
Wright (Eds.), Child behavior and development. 
New York: McGraw-Hill, 1943. Pp. 363-378. 


. Fisner, R. A. Statistical methods for research work- 


ers. Edinburgh: Oliver & Boyd, 1946. 


. Honwirrz, M. The recall of interrupted group tasks: 


an experimental study of individual motivation 
in relation to group goals. In D. Cartwright & 
A. Zander (Eds.), Group dynamics: research and 
theory. Evanston, Ill.: Row, Peterson, 1953. Pp. 
361-386. 

Horwitz, M., ExumE, R. V., Gorpman, M., & 
Lee, F. J. Motivational effects of alternative deci- 
sion making processes in groups (ONR Tech. 
Rep.) Urbana, Ill.: Univer. of Illinois, College of 
Education, Bureau of Educational Research, 
1953. 

Lew, K. Principles of topological psychology. 
(Trans. by F. Heider & Grace Heider.) New 
York: McGraw-Hill, 1936. 

Lewin, K. Group decision and social change. In 
T. H. Newcomb & E. L. Hartley (Eds.), Readings 
im social psychology. New York: Holt, 1947. 
Pp. 330-344. 

Lewm, K. Formalization and progress in psychol- 
ogy. In K. Lewin, Field theory in social science, 
D. Cartwright (Ed.). New York: Harper, 1951. 
Pp. 1-29, 

Lreerrr. R. An experimental study of the effect 
of democratic and authoritarian group atmos- 
pheres. Univer. Ia. Stud. Child Welf., 1940, 16, 
No. 3, 45-195. 

Marr, N. R. F. Principles of human relations, 
e eons to management. New York: Wiley, 

OvsraNKINA, Marra. Die Wiederaufnahme unter- 
brochener Handlungen. Psychol. Forsch., 1928, 
14, 302-372. 

Rapaport, D. The conceptual model of psycho- 
analysis. In D. Krech & G. S. Klein (Eds) 
Theoretical models in personality theory. Durham, 
N. C.: Duke Univer. Press, 1952. Pp. 56-81. 

Smon, H. A. Administrative behavior. New York: 
Macmillan, 1950. 

ZEIGARNIK, BLUMA. Ueber das Behalten von 
erledigten und unerledigten Handlungen. Psychol. 
Forsch., 1927, 9, 1-85. 

Tables of the binomial probability distribution. 
Washington: U. S. Dept. of Commerce, Na- 
tional Bureau of Standards, 1952. 


Received December 18, 1952. 


ee 


THE INFLUENCE OF ROLE PLAYING ON OPINION CHANGE! 


IRVING L. JANIS anp BERT T. KING 
Yale University 


induced to play social roles in which they 

express ideas that are not necessarily in 
accord with their private convictions. That 
certain types of role-playing experiences can 
facilitate changes in personal opinions has 
been suggested by various impressionistic 
observations (e.g, Myers [8]. In recent 
years, psychodramatic techniques which in- 
volve role playing have been developed for 
use in adult education programs, leadership 
training, employee counseling and psycho- 
therapy (1, 5, 6, 7, 9). The usual procedure 
consists of having persons in a group play 
specified roles in a simulated life situation. 
One of the main values of this role-playing 
device, according to its proponents, is that it 
has a corrective influence on various beliefs 
and attitudes which underlie chronic difficul- 
ties in human relations (cf. Maier [6]). 

As yet little is known about the conditions 
under which role playing leads to actual 
changes in personal opinions. The present ex- 
periment was designed to investigate the 
effects of one type of demand that is fre- 
quently made upon a person when he is in- 
duced to play a social role, namely, the re- 
quirement that he overtly verbalize to others 
various opinions which may not correspond 
to his inner convictions. 

As a preliminary step in exploring the effects 
of role playing, one of the present authors in- 
terviewed a group of collegiate debaters who, 
as members of an organized team, repeatedly 
were required to play a role in which they 
publicly expressed views that did not cor- 
respond to their personal opinions. Most of 
the debaters reported that they fréquently 


T MANY everyday situations, people are 


ended up by accepting the conclusions which ` 


‘This study was conducted at Yale University as 
Part of a program of research on factors influencing 
changes in attitude and opinion. The research program 
1s supported by a grant from the Rockefeller Founda- 
tion and is under the general direction of Professor 
Carl I Hovland, to whom the authors wish to express 
appreciation for helpful suggestions and criticisms. 
The authors also wish to thank Professor Fred D. 
Sheffield for valuable suggestions during discussions 
preparatory to designing the experiment. 


they had been arbitrarily assigned to defend. 
Myers’ (8) impressionistic account of the 
improvement in morale attitudes produced by 
participation in an Army public-speaking 
course points to the same phenomenon and 
suggests that attitude changes may occur 
even when role playing is artificially induced. 
If true, it would appear that “saying is be- 
lieving"—that overtly expressing an opinion 
in conformity to social demands will influence 
the individual's private opinion. Conse- 
quently, it seemed worth while to attempt to 
investigate the effects of this type of role 
playing in a more controlled laboratory situa- 
tion where, if the alleged gain from role play- 
ing occurs, it might be possible to isolate the 
critical factors and to explore systematically 
the mediating mechanisms. 

The role-playing effects described above 
have not as yet been verified by systematic 
research. If verified, they would still remain 
open to a variety of alternative explanations. 
For instance, inducing the individual to play 
a role in which he must advocate publicly a 
given position might guarantee exposure to 
one set of arguments to the exclusion of others. 
An alternative possibility, however, is that 
even when exposed to the same persuasive 
communications, people who are required to 
verbalize the content to others will tend to be 
more influenced than those who are only 
passively exposed. In order to test this hypoth- 
esis, the present experiment was designed 
so that communication exposure would be 
held constant by comparing the opinion 
changes of active participants and passive 
controls who were exposed to the same com- 
munications. 


METHOD AND PROCEDURES 


An initial questionnaire, which was administered 
as an opinion survey in a large classroom of male 
college students, contained a series of questions con- 
cerning expectations about the future. Included in this 
“before” questionnaire were the following key opinion 
items, which dealt with the subject matter of the three 
communications to which the experimental groups 
were subsequently exposed: 

Item A: During the past year a number of movie 


211 


212 


theaters were forced to go out of business as result of 
television competition and other recent developments. 
At the present time there are about 18,000 movie 
theaters remaining. How many commercial movie 
theaters do you think will be in business three years 
from now? 

Item B: What is your personal estimate about the 
total supply of meat that will be available for the civilian 
population of the United States during the year 1953? 
(...— per cent of what it is at present.) 

Item C: How many years do you think it will be 
before a completely effective cure for the common cold 
is discovered? 

The experimental sessions were held approximately 
four weeks after the initial questionnaire had been 
filled out, and were represented as being part of a re- 
search project designed to develop a new aptitude 
test for assessing oral speaking ability. The subjects 
(Ss) were asked to give an informal talk based on an 
outline prepared by the experimenters (Es) which stated 
the conclusion and summarized the main arguments to 
be presented. The arguments were logically relevant 
but highly biased in that they played up and inter- 
preted “evidence” supporting only one side of the issue. 
Each active participant was instructed to play the role 
of a sincere advocate of the given point of view, while 
two others, who were present at the same experimental 
session, listened to his talk and read the prepared out- 
line. Each S delivered one of the communications and 
was passively exposed to the other two. In order to 
prevent selective attention effects, the active partici- 
pant was not told what the topic of his talk would be 
until his turn came to present it, He was given about 
three minutes to look over the prepared outline, during 
which time the others (passive controls) also were re- 
quested to study duplicate copies of the same outline 
80 as to be prepared for judging the adequacy of the 
Speaker's performance. After the first talk was over, 


another S was selected to present the second com- 


munication, and then the remaining S presented the 
third communication, the same procedures being fol- 
lowed in each case. 

Immediately after the last talk was finished, Ss 
were given the "after" questionnaire, much of which 


_ was devoted to rating the performance of each speaker. 


"The key opinion items were included among numerous 
filler items, all of which were introduced as questions 
designed to provide information about the student's 


interests and opinions concerning the three topics so 


"as to enable the investigators to select the most ap- 


) “exposure to a given communication, 


propriate topic for 
speaking test. 

Tn all three communications, the conclusion specified 
an opinion estimate which was numerically Jower than 
that given by any of the students on the "before" test. 
Thus, all active participants were Tequired to argue in 
favor of an extreme position which differed from their 
initial beliefs, The influence of each communication 
could readily be observed by noting the degree to 
which the students in each group lowered their opinion 
estimates on the “after” test, 

The basic schema of the experiment is shown in 
Table 1. In each row of the table which represents 
there is one group , 
of active participants and two contrasting groups which, 


future applications of the oral 


Irvine L. Janis AND Bert T. Kine 


TABLE 1 


SCHEMA OF THE EXPERIMENTAL CONDITIONS 
DE ti Se oro ertum em poe 


Grour A GrovpB GrourC 

(N = 31) (N = 29) (N = 30) 
Communication A: | active passive ^ passive 
movie theaters partici- ^ controls controls 

pants 
Communication B: | passive active passive 
meat supply controls partici- ^ controls 
pants 
Communication C: | passive ^ passive ^ active 

cold cure controls controls  partici- 
pants 


when combined, form the group of passive controls. 
In effect, the experimental treatments were repeated 
with different communication contents, providing 
three separate instances of active versus passive ex- 
posure, although the same Ss were used throughout. 

In order to obtain some information for checking 
on selective attention effects, a variation of the passive 
control condition (not represented in the table) was 
introduced into the experiment by using a small sup- 
plementary group who listened and took notes on all 
three talks, In addition, base-line data for assessing the 
effectiveness of the communications were obtained 
from a comparable group of “pure” controls who were 
not exposed to any of the communications. 


RESULTS AND DISCUSSION 


Effects of active participation. Initially, on 
each of the three key items in the precom- 
munication questionnaire, the difference be- 
tween the active participation group and the 
passive control group was nonsignificant. The 
opinion changes observed after exposure to 
the three communications are shown in Table 
22 The results indicate that in the case of two 
of the three communications (A and B), the 
active participants were more influenced than 
the passive controls. For both communica- 


? The table does not include the data on the “pure” 
(unexposed) control group. The net changes for this 
group were approximately zero in the case of all three 
key items, and the corresponding net changes for the 
active participants and the passive controls (shown in 
the last rows of the table) were significantly greater 
(¢’s range from .10 down to <.01). Hence, all three 
communications had a significant effect on the opinions 
of those who were either actively or passively 
to them. r 

The probability values reported throughout this 
paper are based on one tail of the theoretical distribu- 
tion. Whenever intergroup comparisons were made 
with respect to the net percentage who changed by 2 
given amount, the reliability of the difference was 
tested by the formula presented in Hovland, 
Lumsdaine, and Sheffield (3, p. 321). 


— 


ou ——————— 


| 
| 


e 


InFivence oF Rote PLAYING ON OPINION CHANGE 


213 


TABLE 2 
COMPARISON OF ACTIVE PARTICIPANTS WITH PASSIVE CONTROLS ON AMOUNT OF CHANGE IN OPINION ESTIMATES 


COMMUNICATION A: 


COMMUNICATION B: COMMUNICATION C: 


(Movie THEATERS) (MEAT SHORTAGE) (Corp Cure) 
CHANGES IN OPINION 
ESTIMATESİ ACTIVE PASSIVE ACTIVE PASSIVE ACTIVE PASSIVE 
PARTICIPANTS CONTROLS PARTICIPANTS (CONTROLS PARTICIPANTS CONTROLS 
(N =31) (W=57)* (N= 29) (N = 57) (N = 30) (N = 53) 
Sizable increase 0% 2% 0% 2% 1% 6% 
Slight increase 3% 9% 1% 14% 10% 9% 
No change 23% 20% 24% 16% 13% 19% 
Slight decrease 29% 46% 271495 49% 23% 15% 
Sizable decrease 45% 23% 411496 19% 47% 51% 
Total 100% 100% 100% 100% 100% 100% 
Net change (% increase 
minus % decrease) 
SM or sizable —7196 —58% 62% —52% —53% —5196 
ge 
Sizable change —45% —21% —413496 —1796 —40% —45% 
b .0t .01 be 


* The number of cases in each passive control group is slightly smaller than expected from the N’s shown in Table 1 because the 
data from a few cases were inadequate and hence were eliminated from the analysis (e.g., the individual failed to give an answer to 


the particular question). 


t The “net change (slight or sizable)” is defined as the percentage changing in the direction advocated by the communication 
minus the percentage changing in the opposite direction. The “net sizable change” in the case of Communication A refers to the dif- 
ference in the percentages who lowered and raised their estimate by 5,000 (movie theaters) or more. For Communication B, a sizable 
change was 25 (per cent) or more; for Communication C it was 5 (years) or more. 


tions, the differences in net sizable change are 
statistically reliable, and the differences in net 
(slight or sizable) change, although non- 
reliable, are in the expected direction. 

In the case of the third communication 
(C), the two groups showed approximately 
the same amount of opinion change. But addi- 
tional findings (based on confidence ratings 
given by each S immediately after answering 
the key opinion questions) indicate that the 
active participants who presented Communi- 
cation C, like those who presented the other 
two communications, expressed a higher 
level of confidence in their postcommunication 
estimates than did the corresponding passive 
controls. Table 3 shows the net changes in 
confidence ratings for each of the three com- 
munications in terms of a breakdown that 
takes account of the direction and magnitude 
of opinion change. The breakdown was neces- 
Sary inasmuch as a successful communication 
would be expected to increase the confidence 
only of those who changed their opinions in 
the direction advocated by the communica- 
tion. The net change in confidence shown for 
each subgroup is based on a comparison of pre- 


and postcommunication ratings given by each A 


S, and was computed by subtracting the per- 


centage who showed a decrease in confidence 
from the percentage who showed an increase 
in confidence. In general, the findings in Table 
3 reveal a consistent pattern for all three 
communications: in every instance, active par- 
ticipation tended to have at least a slight posi- 
tive effect with respect to increasing the con- 
fidence of those whose opinion estimates were 
influenced by the communication. The results 
indicate that active participation resulted in 
a significant gain in confidence, particularly 
among those students whose opinion estimates 
were markedly influenced by Communication 
C3 This finding is especially striking in view 


3 For the entire group of active participants who 
were exposed to Communication C, there was a net 
increase in confidence of 37 per cent; the corresponding 
net increase for the entire group of passive controls 
was only 1314 per cent. This difference was due en- 
tirely to the marked gain in confidence manifested by 
those students in the active group who had changed 
their opinion estimates in the direction advocated by 
the communication. The results in the first row of the 
table indicate that, among the students whose opinion 
estimates were uninfluenced by Communication C, the 
active participants showed a small net decrease in 
confidence which was equal to that shown by the 
passive controls. The next row of Table 3 indicates 
that, among those students who decreased their opinion 
estimates by at least one-half year or more after ex- 


214 


Irvine L. Janis AND Bert T. Kinc 


TABLE 3 


COMPARISON OF ACTIVE PARTICIPANTS WITH PASSIVE CONTROLS ON AMOUNT OF CHANGE IN CONFIDENCE 


NET CHANGE IN CONFIDENCE 


SuBGROUP BREAK- 


(PER CENT INCREASE MINUS PER CENT DECREASE) 


DOWN ACCORDING 


COMMUNICATION A 
TO CHANGES IN 


COMMUNICATION B COMMUNICATION C 


ACTIVE PASSIVE 
PARTICIPANTS CONTROLS 


OPINION ESTIMATES 


PARTICIPANTS CONTROLS 


ACTIVE ACTIVE PASSIVE 


PARTICIPANTS CONTROLS 


PASSIVE 


1. Uninfluenced: 
opinion estimates 
increased or un- 
changed 
2. Influenced: 
opinion estimates 
slightly or siza- 
bly decreased 
Gain from active 
participation 
3. Highly influenced: 
opinion estimates 
sizably decreased 
Gain from active 
participation 


12% 
(N = 8) 


—595 
(N = 18) 


—1095 


+9% 
(N = 23) (N = 39) 


+19% 


—196 —38% 
(V= 14) — (W = 13) 
+31% 


0% +6% —11% —11% 
(V=9) (N= 18) (N = 9) (N = 18) 
+20% +5% +57% +26% 
(N=20) (N=39) (N=21) (N= 35) 

+15% +31% 
+25% o +644% +15% 
(N=12) (N=11) (N=14) (W-2) 

+25% +4934% 


of the fact that the opinion change results for 
Communication C (Table 2) failed to show any 
gain from active participation. 

Insofar as confidence ratings can be re- 
garded as indicators of the degree of conviction 
with which the new opinions are held, the 
positive findings based on the opinion change 
data for Communications A and B are par- 
tially confirmed by the confidence change 
data based on Communication C. Thus, the 
data based on all three communications con- 
tribute evidence that the effectiveness of the 
communications (as manifested by opinion 
changes or by confidence changes) tended to 
be augmented by active participation. 

Although Ss were not told what their topic 
would be until they were about to begin giving 
the talk, it is possible that the ego-involving 
task of presenting one of the talks may have 


posure to Communication C, the active participants 
showed a greater net increase in confidence than the 
passive controls; the difference of 31 per cent approaches 
statistical significance (p = .07). Finally, the last row 
of the table shows that an even greater difference in 
confidence changes emerges when the comparison is 
limited to those students who decreased their opinion 
estimates by five years or more. (The 4914 per cent 
difference is reliable at beyond the .05 confidence 
level.) Further analysis of the subgroup data indicated 
that the differences shown in this table could not be 
attributed to statistical artifacts arising from initial 
differences between the various subgroups. 

bI 


given rise to emotional excitement or other 
interfering reactions which could have had the 
effect of reducing the Ss' responsiveness when 
passively exposed to the other two communi- 
cations. This possibility appears extremely 
improbable, however, in the light of supple- 
mentary control observations: 

1. Some of the passive controls had been 
exposed to the communications before giving 
their own talk, while others were passively 
exposed after having given their own talk. 
Nonsignificant differences were found in the 
amount of opinion change shown under these 
two conditions. 

2. The results from the passive controls 
were "replicated" by the results from an 
independent group of 16 students who did 
not give an oral presentation, but who were 
asked to follow the prepared outline carefully 
and to note down the main arguments given 
by each of the three speakers. Despite the fact 
that their notes were fairly complete and indi- 
cated a relatively high degree of attention to 
the content of all three communications, 
these supplementary controls displayed ap- 
proximately the same amount of opinion 
change as the original group of passive con- 
trols. 


‘It is conceivable, of course, that the activity of 
taking notes on the talks might have interfered with 


| 


INE 


e 
t 


INFLUEMCE or ROLE PLAYING ON OPINION CHANGE 


Observations pertinent to explanatory; hypoth- 
eses. Many different types of speculative 
hypotheses could be put forth to account for 
the facilitating effects of active participation, 
postulating a gain in attention and learning 
from overtly rehearsing the communication, 
or a gain in comprehension from reformulating 
the arguments in one’s own words, or a gain 
in motivation from playing the role of com- 
municator, etc. Some supplementary ob- 
servations were made for the purpose of ex- 
ploring various factors which might provide 
leads to the key mediating mechanisms. 
Although far from conclusive, the evidence 
derived from these observations provides a 
preliminary basis for selecting explanatory 
hypotheses which warrant further experimen- 
tal analysis. 

The findings based on the supplementary 
controls (who were required to take notes on 
the three talks) suggest that variation in 
attention level probably was not a crucial 
factor that could explain the participation 
effects observed in the present experiment. 
More promising clues were discovered by 
taking account of differences in the types of 
reactions evoked by the three communica- 
tions. We have seen that in the case of Com- 
munications A and B, a clear-cut gain from 
active participation was manifested by changes 
in opinion estimates; but, in the case of Com- 
munication C, opinion estimates were un- 
affected, the gain being manifested only in 
the form of increased confidence. With a 
view to discovering some differentiating factor, 
we examined the available evidence bearing 
on the question of why active participation 


.might be more effective under certain stim- 


ulus conditions (represented by Communica- 
tions A and B) than under other conditions 
(represented by Communication C). 

The first step in this inquiry was to examine 
E’s notes on: (a) the active Ss’ behavior while, 
giving their talks, and (b) Ss’ statements in 


the informal interviews conducted at the end * 


Tesponsiveness to the persuasive content of the com- 
munications. While this possibility cannot be excluded, 
it seems implausible inasmuch as our Ss were college 
students who had had considerable practice in taking 
notes during lectures. Educational research on the 
RA of note taking indicates that this form of ac- 

vity generally has a beneficial rather than a detri- 
mental effect on the student's ability to absorb the 
content of an oral communication (2). 


215 


of each experimental session. These observa- 
tions provide two suggestive leads: j 

1. The active participants who presented 
Communication C seemed to engage in less 
improvisation than those who presented the 
other two communications. The Communica- 
tion C group appeared to adhere much more 
closely to the prepared outline, making little 
attempt to reformulate the main points, to 
insert illustrative examples, or to invent addi- 
tional arguments. 

2. Active participants in the Communica- 
tion C group seemed to experience much more 
difficulty than the other groups in presenting 
their talks. During their performance they 
appeared to be more hesitant and tense. 
Afterwards, they expressed many more com- 
plaints about the task, claiming that their 
topic was more difficult to present than either 
of the other two. In general, these subjects 
seemed less satisfied with their performance 
than those who presented the other two topics. 

The first observation suggests that mere 
repetition of a persuasive communication 
may have little or no effect as compared with 
an improvised restatement. This observation 
is in line with some suggestive findings from 
an opinion change study by Kelman (4) in 
which seventh-grade students were given a 
communication, and, immediately afterwards, 
were offered various incentives to write essays 
in support of the communicator’s position. 
Kelman observed that the essays written by 
the group which showed the greatest amount 
of opinion change tended to be longer, to 
include more improvisation, and to be of 
better over-all quality (as rated by several 
judges) than the essays written by the other 
experimental groups. 

Reformulating and elaborating on the com- 
munication might be a critical factor in pro- 
ducing the gain from active participation, 
perhaps because the communicate is stimu- 
lated to think of the kinds of arguments, 
illustrations, and motivating appeals that he 
regards as most convincing. The importance 
of the improvisation factor in relation to par- 
ticipation effects could not be investigated 
further with the data at hand from the present 
experiment, but is currently being studied by 
the present authors in another experiment 
that is specifically designed to compare the 
effects of different types of active participa- 
tion. 


D 


216 


TABLE 4 


Irvine L. Janis AND Bert T. Kino) 


) | 


7 


SELF-RATINGS OF Active PARTICIPANTS IN EACH EXPERIMENTAL GROUP 


EXPERIMENTAL GROUPS (ACTIVE PARTICIPANTS) 


SELr-RATING RESPONSE 


COMMUNICATION A: 


COMMUNICATION B: 


Communication C: 
| 


(Movie Theaters) (Meat Supply) (Cold Cure) 
(N = 31) (N = 29) (N = 30) 
1. Over-all performance was at least 9495 83% 63% 
“satisfactory” 
2. Rarely or never spoke in a monotonous 6495 7695 53% 
tone of voice 
3. Rarely or never incoherent in pre- 74% 83% 57% 
senting arguments 
4. No distortions or misinter pretations of 32% 52% 13% } 
arguments in the prepared outline | 
5. No omissions of any of the main argu- 74% 7296 7095 
ments 
6. Succeeded in giving the impression of 52% 52% 43% 
being “sincere” 
Combined rating on all six items:— 39% 52% 18% 


five or more favorable self-ratings 


With respect to the second observation, it 
should be noted that there may have been 
an objective basis for the greater dissatisfac- 
tion experienced on Communication C be- 
cause of the greater amount of unfamiliar 
technical material it contained. The “cold 
cure” outline referred to a great many tech- 
nical details concerning the cold virus, anti- 
biotics, allergic reactions, and antihistamines. 
Many of these details were probably un- 
familiar to Ss, and consequently it may have 
been difficult for them to “spell out” the im- 
plications of the arguments. In contrast, the 
outlines for the other two topics contained 
very little technical material, relying mainly 
on arguments that were likely to be quite 
familiar to college students. 

Systematic evidence relevant to Ss’ per- 
ception of the difficulty of presenting each 
communication was obtained by making use 
of the self-rating schedule which each student 
filled out after exposure to the three communi- 
cations. Table 4 shows the percentage in each 
experimental group who rated their own per- 
formance as adequate or satisfactory on each 
of six self-appraisal items. 

The most comprehensive question was the 
following: “What is your over-all rating of the 
informal talk given by this speaker—how 
good a job do you think he did in 


presenting 
his material? ___ Excellent; __ Very 
Good; — Satisfactory; ___ Poors 2 
Very Poor.” 


The percentage who rated themselves as 
"satisfactory" or better (shown in the first | 
row of the table) was significantly lower for 
the group who presented Communication c 
than for the groups who presented Communi- 
cations A and B (p = .002 and .04, respec- 
tively). On the remaining five items, each of | 
which dealt with a specific aspect of the — 
speaker’s performance, the Communication 
C group also tended to rate themselves lower 
than did the other two groups. (On the com- 
bined rating, based on all six items, the per- 
centage differences are statistically significant 
at beyond the .05 confidence level.) The find- 
ings consistently indicate that the students | 
in the Communication C group felt less satis- 
fied with their oral speaking performance than 
did those in the other two groups. Since the 
group differences in self-ratings tend to paral- 
lel the group differences in amount of gain 
Írom active participation, the results suggest 
that satisfaction with one's own performancé 
may be a critical factor that determines the 


. magnitude of participation effects. 


Further evidence which supports this 
hypothesis was obtained from an analysis of 
individual opinion changes, comparing acüve 
participants with high and low self-ratings | 
for each of the three communications. For [ 
example, among the active participants who | 
presented Communication C, there were 18 | 
students whose self-ratings were compara- i 
tively “high” (three to six favorable responses) | 

[ 


t 
G 


INFLUENCE or ROLE PLAYING ON OPINION CHANGE 


and 12 cases whose self-ratings were predom- 
inantly “low” (zero, one, or two favorable 
responses); 55 per cent of the “highs” as 
against only 17 per cent of the “lows” showed a 
sizable net opinion change in the direction 
advocated by the communications (5 = .05). 
In general, the comparisons based on all three 
communications consistently indicate that a 
greater amount of opinion Change occurred 
among those active participants who rated 
their oral speaking performance as satisfac- 
tory or better. Active participants who felt 
that they performed poorly, on the other 


hand, failed to show any more opinion change- 


than the passive controls, and, in the case of 
Communication C, showed markedly less 
change than the passive controls (5 = .07). 

During the experimental sessions there were 
no apparent sources of external social rewards 
from the environment. Since the others pres- 
ent remained silent, the active participant had 
no opportunity to know how they were react- 
ing to his talk, except possibly by subtle signs 
from their facial expressions or from their 
bodily movements. But even in the absence 
of any external cues to social approval, it 
seems probable that anticipations concerning 
such approval would occur if the individual 
felt that he was performing well, as expressed 
in his self-ratings. Thus, expectations of 
favorable audience reactions may have oc- 
curred less frequently among Ss who were 
required to perform the relatively difficult 
task of presenting the unfamiliar technical 
material in Communication C than among 
those who were required to perform the less 
difficult task of presenting Communication A 
or B. The increase in opinion change produced 
by Tole playing might be mediated by the 
individual's sense of achievement or his elated 
feelings about the adequacy of his oral per- 
formance. One hypothesis that would follow 
from this assumption is that when a person 
conforms outwardly to social demands by 


playing a role which requires him to advocate * 


a given opinion, he will begin to believe what 
he is saying if he is made to feel that he says 
it well. 

Although the above hypothesis is suggested 
by the supplementary correlational findings, 
it will obviously remain open to question until 
tested by more precise methods. One cannot 
be certain that the responses used to assess 

satisfaction” represent a separate variable 


217 


which is causally related to opinion changes. 
Acceptance of the communication might bea 
common factor which inclines those who are 
most influenced to perceive themselves as 
having performed well, in which case the self- 
ratings might merely reflect the same thing as 
the measures of opinion change. Moreover, 
even if the two variables can be varied and 
measured independently, the possibility re- 
mains that the observed relationship may be 
due to some third variable, such as amount of 
improvisation. 

As was noted earlier, the group of active 
participants who showed the least amount of 
opinion change not only expressed a low de- 
gree of satisfaction but also displayed a rela- 
tive absence of improvisation in their oral 
performances. Either the “satisfaction” factor 
or the improvisation" factor might prove to 
be a critical mediating variable. Before draw- 
ing a definite conclusion, it is necessary to in- 
vestigate each factor experimentally—for 
instance, by giving the Ss "expert" perform- 
ance ratings which raise or lower their feelings 
of satisfaction, and by using instructions which 
increase or decrease the amount of improvisa- 
tion. These methods are currently being 
employed in our further research on the effects 
of role playing. i 

There is another important problem which 
arises from the findings in the present experi- 
ment and which also requires systematic in- 
vestigation: Does social role playing facilitate 
the internalization of externally imposed 
value judgments, mores, and taboos? The 
persuasive communications used in this study 
dealt with relatively impersonal beliefs about 
the future, and the main findings show that ac- 
ceptance of opinions of this sort was markedly 
increased by experimentally induced role 
playing. It remains problematical, however, 
whether active participation also influences 
the acceptance of opinions and attitudes that 
are more directly tied up with daily life ac- 
tivities, interpersonal relationships, and emo- 
tionally charged dilemmas. 

Obviously, it is unsafe to generalize widely 
from a single exploratory study based on the 
opinion changes of college students pro- 
duced in a somewhat artificial test situation. 
Nevertheless, the present experiment pro- 
vides preliminary evidence indicating that 
verbal conformity elicited by role playing can 
significantly influence the acceptance of new 


218 


beliefs. Under certain specifiable conditions 
which await further investigation, it seems to 
be true that “saying is believing." 


SuMMARY AND CONCLUSIONS 


The experiment was designed to determine 
whether or not overt verbalization, induced by 
role playing, facilitates opinion change. Male 
college students were assigned at random to 
two main experimental groups: (a) active par- 
ticipants, who, with the aid of a prepared out- 
line, played the role of a sincere advocate of 
the given point of view, and (b) passive con- 
trols, who silently read and listened to the 
same communication. In the experimental 
sessions, three different communications were 
used, each of which argued in favor of a 
specific conclusion concerning expected future 
events and was presented by a different active 
participant. Opinion measures obtained at the 
end of the session were compared with the 
"before" measures obtained about one month 
earlier. 

In general, the active participants tended 
to be more influenced by the communications 
than were the passive controls. In the case of 
two of the communications the active par- 
ticipants showed significantly more opinion 
change than the passive controls. In the case 
of the third communication, both groups 
Showed approximately the same amount of 
opinion change, but active participation, 
nevertheless, tended to increase the level of 
confidence of those whose opinion estimates 
were influenced by the communication. The 
main findings, together with various methodo- 
logical checks, support the hypothesis that 
overt verbalization induced by role playing 
tends to augment the effectiveness of a per- 
suasive communication. 

Additional observations were analyzed in 
order to explore possible mediating factors 
underlying the gain in opinion change due to 
active participation. From behavioral records 
and interviews, two suggestive leads emerged. 
In those cases where role playing produced a 


Irvine L. Janis AND BERT T. Kine’ 


marked increase in opinion change: (a) the 
individual displayed a relatively great amount 
of improvisation in his talk, and (b) he felt 
comparatively well satisfied with his oral 
speaking performance. The first factor sug- 
gests that the gain from role playing may occur 
primarily because the active participant 
tends to be impressed by his own cogent 
arguments, clarifying illustrations, and con- 
vincing appeals which he is stimulated to 
think up in order to do a good job of “selling” 
the idea to others. The second factor suggests 
an alternative explanation in terms of the 
rewarding effects of the individual's sense of 
achievement or feelings of satisfaction with 
his performance in the role of active partici- 
pant. Additional evidence pertinent to the 
second factor, based on a self-rating question- 
naire which the Ss filled out immediately 
after giving the talk, consistently indicated 
that the greatest amount of opinion change 
occurred among those active participants who 
felt that their oral speaking performance was 
satisfactory. Both the “improvisation” factor 
and the “satisfaction” factor warrant further 
investigation. 
REFERENCES 
1. Bavetas, A. Role-playing and management train- 
ing. Sociatry, 1947, 1, 183-191. 
2. CRAWFORD, C. E. Some experimental studies of the 
results of college note taking. J. educ. Res., 1925, 
12, 379-386. 
3. Hovianp, C. I., LuuspAmz, A. A., & SHEFFIELD, 
F. D. Experiments on mass communication. Prince- 
ton: Princeton Univer. Press, 1949. 
4. Kerman, H. C. Attitude change as a function of 
response restriction. Hum. Relat., in press. — 
5. Lieprrr, R. The psychodrama in leadership training. 
Sociometry, 1943, 6, 286-292. M 
6. Mater, N. R. F. Principles of human relations. 
New York: Wiley, 1952. 
7. Moreno, J. L. Psychodrama. Vol. 1. New York: 
Beacon House, 1946. k 
8. Myers, G. C. Control of conduct by suggestion: 
an experiment in americanization. J. appl 
« Psychol., 1921, 5, 26-31. 
9. ZANDER, A., & Lrerrrr, R. Reality-practice as edu- 
cational method. Sociometry, 1944, 7, 129-151. 


Received December 28, 1952. 


ng ae M PP 


E = on 


PERSONALITY CHANGES FOLLOWING TRANSORBITAL LOBOTOMY? 


HARRY W. ALLISON Aw» SARAH G. ALLISON 
University of Oklahoma 


LTHOUGH transorbital lobotomy has 
As used extensively since 1946, 
there has been a paucity of experi- 
mental research on the effectiveness of this 
operation. And although much interest has 
been manifest in the psychological changes 
following this operation, the findings have 
been inconclusive and obscure. This may be 
in part because of a lack of sensitivity of some 
of our measuring instruments, but another 
reason for these inconclusive results has been 
the lack of rigid experimental controls. 

In many studies psychological tests have 
been administered to a few patients pre- and 
postoperatively. However, the experimenters 
often failed to validate their results by match- 
ing the operative group with a comparable 
group (in terms of age, sex, diagnosis, etc.) 
who were treated alike in every respect except 
that they did not undergo the operation. 

Owing to the relative simplicity and brevity 
of this operation, as well as the need for some 
type of aid for those patients who suffer 
chronic and severe states of mental illness, 
many psychiatric hospitals have utilized this 
operative procedure despite the fact that there 
is little conclusive and objective knowledge 
concerning the results. 

In view of the present general acceptance of 
these surgical measures, more precise investi- 
gations need to be undertaken for three 
purposes: (a) to determine more precisely 
those psychological changes which have oc- 
curred, (b) to furnish an objective basis for 
evaluating the effectiveness of transorbital 
lobotomy, and (c) to enable selection of those 
patients for whom this operative procedure 
seems warranted. f 

A review of the literature indicates that some” 
of the changes which are felt to result from 
lobotomy are: a decrease in anxiety and ten- 


1The authors are indebted to Dr. C. G. Holland, 
H. G. Hansen, and many others on the administrative 
staff at Western State Hospital, Staunton, Va., where 
this study was conducted. Grateful acknowledgement 
also is due to Dr. O. A. Trice, Mary Baldwin College, 
for his helpful advice, and to B. Moskowitz, University 
of Oklahoma, for statistical assistance. 


sion (6, p. 658), some loss of insight and the 
ability to introspect (7), some loss of creativ- 
ity (6, p. 417), a decrease of enthusiasm and 
zeal (10), but a better personality integration 
which reflects a total improved personality 
adjustment. This study attempts to validate _ 
these empirical findings and, in addition, 
attempts to ascertain whether or not patho- 
logical indications of brain damage will be 
evidenced as a result of the transorbital 
lobotomy. 


PROBLEM 


This study has as its over-all purpose the 
task of attempting to denote the personality 
changes which are effected by transorbital 
lobotomy. These changes are assessed by the 
Rorschach test, which is commonly thought 
to delineate the structure ‘of personality. 

Klopfer (9, pp. 279-281) states that m 
(inanimate movement) reflects tensions within 
the personality structure. He adds that an 
individual experiences such promptings from 
within as hostile and uncontrollable forces 
working upon him, rather than as sources of 
energy at his disposal. If the lobotomy has the 
effect of reducing tension, a diminution of this 
scoring determinant, M, would be expected 
after the operation. 

It is generally believed that the lobotomy 
has the effect of lessening anxiety, apprehen- 
sions, and fears. Klopfer states, “Every single 
K (shading as diffusion) and every & (shading 
as three dimensional expanse projected on a 
two dimensional plane) response other than 
‘clouds’ to Card VII is an expression of some 
anxiety" (9, pp. 241-242). Therefore, it is 
hypothesized that fewer K and k determinants 
will be elicited postoperatively. 

Tt is hypothesized that the lobotomized pa- 
tient loses much of his ability to introspect; 
that is, there is a lessening of his critical self- 
appraisal and insight. Thus, a decrease in the 
measure of introspection and self-awareness, 
FK (shading as three dimensional expanse in 
vista or perspective), would be expected in the 
Rorschach protocols after the operation. 


219 


220 


If creative imagination is reduced in lobot- 
omized patients, it would be expected that 
this Rorschach determinant, M (human move- 
ment), should decrease. 

Since transorbital lobotomy has the effect 
of destroying cortical tissue, it would be 
expected that this cortical damage would be 
revealed as organic pathology on the Ror- 
Schach test. The extent of such pathology is 
evaluated in this study by the use of a check 
list proposed by Dorken and Kral (3). 

If the lobotomized patient has lost his ar- 
dent enthusiasm and active interest to some 
degree, this should be reflected by prolonged 
reaction times to the ten Rorschach cards. 

Finally, if transorbital lobotomy is to effect 
a better level of adjustment, the Rorschach 
test should manifest this improvement. There- 
fore, it would be expected that two quantita- 
tive measures of general adjustment, the 
Munroe check list (12) and the Harrower- 
Erickson check list (8), would reflect this 
improved adjustment. 


PROCEDURE 


Subjects. All the subjects (Ss) in this experiment 
were patients at Western State Hospital, Staunton, 
Virginia. The eight psychotic patients comprising the 
operative group were selected for transorbital lobotomy 
on the basis of two criteria: (z) They had failed to 
respond to all types of therapy employed at this hos- 
pital. (6) The relatives of these patients had signed 
operative permits. The control group was derived from 
the parent hospital population of approximately 2,500. 
Eight control Ss were selected who matched the ex- 
perimental group on seven criteria (sex, age, education, 
diagnosis, treatment, duration of hospitalization, and 
duration of illness). 

"Table 1 shows a comparison of the two groups for 
five of the seven matching criteria. In regard to diag- 
nosis, each group contained six schizophrenics, one 
involutional-paranoid, and one manic-depressive, 
depressed type. All patients in both groups had re- 
ceived extensive electroshock therapy with the excep- 
tion of one matched pair who had received no therapy 
of this type prior to the experiment. 

One month prior to the operation, all Ss of both 
groups were given the Rorschach test. In an attempt to 
control all variables, from this time until one month 
after the operation the “buddy system” was rigidly 
enforced; that is, each patient of the experimental 
group was paired with a patient of the control group, 
and each pair was housed on the same ward, attended 
occupational therapy together daily, participated in 
the same recreational activities, ate at the same time, 
etc. One month from the date of testing, the experi- 
mental group received the operation. For this opera- 
tion it is customary to use electroconvulsive shock to 
induce anesthesia. In similar fashion, the control 
group was given the same number and intensity of 


Harry W. ALLISON AND SARAH G. ALLiSON 


TABLE 1 


COMPARISON OF CONTROL AND EXPERIMENTAL GROUPS 
IN TERMS OF MATCHING CRITERIA 


Means 
Grovur* DURATION or — DURATION oF 
Ace EDUCATION HOSPITALIZATION ILLNESS 
Experi- 48 9yrs.9mos. 6 yrs. 12 yrs. 3 mos, 
mental 
Control 45  10yrs.7 mos. Syrs,1mo, 12yrs. 


* Each group was composed of one male and seven female 
patients, è 


electroshock treatments at this time but did not receive 
the operation. One month following the date of opera- 
tion the Rorschach test was readministered to all Ss 
in both groups. The 32 Rorschach protocols were 
scored by the senior author, employing the Klopfer 
and Kelley (9) system of scoring. The identity of each 
protocol was masked and all protocols were scored in 
random order. 


RESULTS 


Cronbach (2) criticizes the use of many of 
the commonly employed statistical techniques 
in Rorschach research because they do not 
control for the inequality of the scale intervals 
and the skewness of the distribution of the 
data. To meet these requirements made 
explicit by Cronbach concerning the inequal- 
ity of points along the continua and the non- 
normal character of our data, a nonparametric 
or distribution-free method of analysis was 
utilized. Moses (11) has recently reviewed and 
evaluated this type of statistic. 

The usual manner of obtaining a distribu- 
tion-free test of significance is to place the 


data in rank order independently on the two | 


variables concerned and then to obtain 
the probability with which the smaller sum 
of rankings may appear. However, since the 
patients in this experiment were matched, # 
paired-score difference technique was U 
and the positive and negative differences 
ranked independently. The probability of the 
lowest sum of rankings, either positive O! 
negative, was then determined. Wilcoxon (15) 
discusses this particular nonparametric method 
under the title, *Paired Replicates," and 
provides an abbreviated table of significan! 
values (15, p. 14). 
From an inspection of Table 2 it can be 
noted that four of the ten Rorschach factor 
significantly differentiate between the contro! 


e 


i 
PERSONALITY CHANGES FOLLOWING TRANSORBITAL LOBOTOMY 


TABLE 2 
NONPARAMETRIC COMPARISONS OF CONTROL AND 
EXPERIMENTAL Grours ON Ten RORSCHACH 
FACTORS AND THREE RORSCHACH CHECK Lists 


Rorscuach LowER RANK 'RORSCHACH Lower RANK 
FAcTORS TOTALS Factors TOTALS 
K% + k% —1.00 Rejections —6.50 
FK% —4.50* Responses —11.00 
W% +4.50* Reaction time +3.00° 
M96 £7.50 Check lists 
m% —1,00°* Harrower-Erickson £13.00 
P% £16.00 Munroe 13.50 
FC% —9.00 Dörken & Kral —6.00 


* Significant at the .05 level. 
** Significant at the .01 level. 


and experimental groups? The positive and 
negative signs preceding the lower rank total 
indicate an increase or decrease respectively 
of the measured factor for the operative or 
experimental group as contrasted with the 
control group. A plus-minus (+) lower rank 
total denotes a lack of differentiation be- 
tween the groups. Thus, it can be seen that 
there is a significant decrease of m% and FK %, 
and a significant increase in W% and Reaction 
Time in the experimental group after the 
operation. All other factors studied were not 
significant, although there are obvious trends 
in regard to organicity (as measured by the 
Dörken and Kral check list), number of re- 
jections, and K% + k%. 


Discussion 


From these data it can be inferred that 
transorbital lobotomy produces or results in a 
lessening of inner tension, a lessening of 
introspective self-awareness and insight, and a 
loss of ardent enthusiasm and active interest or 
zeal. The significant increase in W% is diffi- 
cult to interpret except as some change in 
apperception, although it may indicate a 
better organizational ability. 


Current literature contains abundant quali^ 


tative information concerning the functions 
of the frontal lobes. According to Freeman 


2 To save printing costs, two pages of Rorschach raw 
score data for both the experimental group and the 
control group have been deposited with the ADI. 
Order Document No. 4058 from the ADI Auxiliary 
Publications Project, Photoduplication Service, Library 
of Congress, Washington 25, D. C., remitting in ad- 
vance $1.25 for 35 mm. microfilm or $1.25 for 6 by 8 
in. photocopies, Make checks payable to Chief, Photo- 
duplication Service, Library of Congress. 


221 


and Watts (5), this operation upon the fron- 
tal lobes succeeds because it divorces psychot- 
ic ideas from accompanying emotional com- 
ponents. They state that the psychotic ideas 
usually persist for a while following the opera- 
tion but gradually fade away upon being re- 
leased from their affective charge. They feel 
that this is due to the sectioning of the an- 
terior thalamocortical projections through 
which the affective and emotional charges 
surge to the prefrontal areas to become in- 
tegrated with the intellectual processes of 
foresight, imagination, and consciousness of 
self. 

Freeman and Watts have emphasized the 
point that “. . . emotional tension is the prime 
requisite for success in Prefrontal Lobotomy” 
(6, p. 658). It appears that as yet there is no 
psychiatric diagnosis that causes one to think 
immediately of psychosurgery. Rather, there 
is a constellation of symptoms described by 
Arnot as, “a fixed state of tortured self-con- 
cern” (1, p.267). Inner emotional tension 
is implied in this description, and Robinson 
states the following in this regard: ""Psycho- 
surgery, then, not only relieves emotional 
tension; it prevents the development of future 
tensions by reducing the individual's aware- 
ness of his own self-continuity” (14, p. 422). 
It was observed by the authors that following 
the operation, some patients, particularly 
the paranoid schizophrenics, continued to 
manifest delusional ideation. However, these ' 
formerly painful thoughts and beliefs seemed 
to become less severe in that these patients 
evidenced less tension and emotional concern 
about this ideation. For example, prior to the 
operation one paranoid patient was convinced 
that he was being followed by a “big, black 
nigger” who was intent on killing him. Follow- 
ing the operation, he continued to feel that 
he was being followed; the delusion remained, 
but he no longer appeared to have the tre- 
mendous emotional concern about it. The 
most significant finding in the present study, 
a pronounced decrease of m%, tends to con- 
firm these empirical observations concerning 
a lessening of tension as a result of this opera- 
tive procedure upon the frontal lobes. 

Freeman and Watts have also advanced the 
hypothesis that the frontal lobes are espe- 
cially concerned with foresight and insight 
and that the emotional component associated 


222 


with these functions is supplied by the thala- 
mus. They feel that when the thalamic 
connections are severed, the functions of 
foresight and insight suffer temporary oblit- 
eration ^...and even in the later course of 
recovery are never as completely endowed with 
feeling tone as they were before” (7, pp. 3-4). 
The finding in the present study of a signifi- 
cant decrease of FK% in the experimental 
group lends support to this hypothesis in 
regard to a lessening of introspective self- 
awareness and insight as a result of trans- 
orbital lobotomy. 

The significantly prolonged reaction times 
seen in this study as a result of the operation 
may well be related to what has been postu- 
lated by Landis. He believes that the loboto- 
mized patient lacks vigilance, “. .. acting as 
though he were sleepy, often making such 
remarks as, ‘I’m too tired to do this’” (10, 
p. 413). Landis further feels that the loboto- 
mized patient has lost his zealousness, his 
ardent enthusiasm, and active interest. 

There is obscure and conflicting evidence 
concerning signs of intracranial pathology 
due to lobotomy as shown by psychological 
tests. In this regard, Freeman states that 
“the failure of mental tests to reveal either 
positive or negative symptoms after removal 
of a considerable portion of both frontal 
lobes means either that the tests heretofore 
developed are not sensitive to the defects, or 
that the powers of compensation on the part 
of the remaining portions of the frontal lobes 
are so great that no measurable defect re- 
mains” (4, p. 102). Although the authors of 
this study utilized a brain damage check 
list which was thought to be more sensitive 
than that proposed by Piotrowski (13), no 
significant increase of organic signs was demon- 
strated in the experimental group as a result 
of the operation. In the same light, the failure 
of both the Munroe and Harrower-Erickson 
check lists to demonstrate any quantifiable 


change in total adjustment in the operative ' 


group may also be due to the fact that these 
check lists are too insensitive for this par- 
ticular discrimination. 

It is to be noted that, although nonsignifi- 
cant, three of the factors which were studied 
did approach the defined level of significance. 
In the experimental group as contrasted with 
the control group there was an obvious de- 


Harry W. ALLISON AND SARAH G. ALLISON 


crease in the number of rejections and in K% 
+ k%, along with an increase in signs of 
intracranial pathology. With a larger control 
and experimental population it is possible 
that significant evidence of brain damage and 
a significant decrease of emotional blocking 
and anxiety will be more clearly noted. 

In 1946, Freeman stated that “statistics 
are a poor medium by means of which to 
convey the changes that occur in patients 
following Prefrontal Lobotomy” (6, p. 657). 
He felt that the empirical evidence of change 
in some of these patients and their resulting 
transformation into placid, quiet, uncom- 
plaining individuals who showed little con- 
cern about their troubles was justification 
enough for the more drastic operative proce- 
dures. However, the authors of this study are 
of the opinion that only by precise investiga- 
tions can the psychological changes be deter- 
mined. And it can only be from an accurate 
knowledge of these changes that an objective 
basis for the evaluation of the effectiveness 
of the transorbital lobotomy operation can be 
established. Furthermore, we will be better 
able to select those patients for whom this 
operative procedure seems warranted only if 
we have precise and objective knowledge of 
those personality characteristics which are 
most amenable to change due to this type of 
psychosurgery. d 


SUMMARY 


This study was designed to investigate the 
personality changes effected by a particular 
type of psychosurgery, transorbital lobotomy. 
The experimental or operative group was com- 
posed of eight hospitalized psychotic patients. 
From the parent hospital population, eight 
control subjects were selected on the basis of 
their close resemblance to the individual mem- 
bers of the experimental group in regard to 
seven criteria by which they were matched. 
The Rorschach test was administered to both 
groups one month prior to, and one month 
following, the date on which the experimental 
group received the operation. During the m- 
terim between pre- and posttesting, every 
effort was made to control environmental 
variables; members of each matched pal 
were treated alike, except that the controls 
did not receive the operation. However, the 


~~ 


—— 


controls did receive electroshock treatments | 


M 
e 


PERSONALITY CHANGES FOLLOWING TRANSORBITAL LOBOTOMY 


comparable in number to those given the ex- 
perimental Ss on the day of operation. 

Four Rorschach factors significantly dif- 
ferentiated between the control and experi- 
mental groups. There was a significant de- 
crease of m% and FK% and a significant 
increase in W% and Reaction Time in the 
experimental group following the operation. 
From these data it can be inferred that transor- 
bital lobotomy results in a lessening of inner 
tension, a lessening of introspective self-aware- 
ness and insight, and a loss of ardent enthusi- 
asm and active interest. The significant in- 
crease in W % is difficult to interpret except as 
some change in apperception. These findings 
and their implications should be considered if 
psychosurgery of this type is planned. 


REFERENCES 


1. Arvor, R. E. Clinical indications for pre-frontal 
lobotomy. J. nerv. ment. Dis., 1949, 109, 267-269. 

2. CRowBAcH, L. J. Statistical methods applied to 
Rorschach scores. Psychol. Bull., 1949, 46, 
383-429. 

3. Dorxen, H., JR., & KraL, V. A. The psychological 
differentiation of organic brain lesions by means 
of the Rorschach. Amer. J. Psychiat., 1952, 108, 
764-770. 

4. FREEMAN, W. Personality and the frontal lobes. 
Univer. west. Ontario med. J., 1951, 21, 101-117. 


223 


5. FREEMAN, W., & Warts, J. W. Interpretation of 
functions of frontal lobe based upon observa- 
tions in 48 cases of prefrontal lobotomy. Yale J. 
Biol. Med., 1939, 11, 527-539. 

6. FREEMAN, W., & Warts, J. W. Progress in neurol- 
MH psychiatry. New York: Grune & Stratton, 
1946. 

7. Freeman, W., & Warts, J. W. Psychosurgery 
during 1936-1946. Arch. Neurol. Psychiat., 
1947, 58, 417-425. 

8. HarroweR-Erickson, Morry R. The value and 
limitations of the so-called “neurotic signs." 
Rorschach Res. Exch., 1942, 6, 109-114. 

9. Kuoprer, B., & Kzirev, D. M. The Rorschach 
technique. Yonkers, N. Y.: World Book Co., 
1946. 

10. Laxprs, C. Psychological observations on psycho- 
surgery patients. Psychiat. Quart., 1951, 26, 
409-417. 

11. Moszs, L. E. Non-parametric statistics for psycho- 
logical research. Psychol. Bull., 1952, 49, 122-143. 

12. Munro, R. L. Prediction of the adjustment and 
academic performance of college students by a 
modification of the Rorschach method. Appt. 
Psychol. Mongr., 1945, No. Ts 

13. Prorgowskr, Z. Positive and negative Rorschach 
organic reactions. Rorschach Res. Exch., 1940, 
4, 147-151. 

14. Rosson, M. F. Psychological effects of psycho- 
surgery. Dig. Neurol. Psychiat., 1949, 17, 422. 

15. Wircoxow, F. Some rapid approximate statistical 
procedures. New York: American Cyanamid 
Co., 1949. 


Received December 19, 1952. 


a 


SOME DETERMINANTS OF BEHAVIORAL RIGIDITY: 


DEE G. APPLEZWEIG 
Connecticut College 


fe ERSONALITY-CENTERED” psychology has 

recently been receiving much attention 

(4). According to one of the most 
extensive researches in this area (1), we can, 
for example, type a person as an “authori- 
tarian personality," and should be able to 
predict how he will behave in nonsocial as 
wellas social situations. One of the distinguish- 
ing characteristics of such a person, we find, 
isrigidity. According to Rokeach, "the rigidity 
inherent in an ethnocentric person's solution of 
social problems is not an isolated phenomenon 
within the personality, but is rather an aspect 
of a general rigidity factor which will also 
manifest itself in the solution of any problem, 
be it social or non-social” (18, p. 259). 

Further analysis of the concept of rigidity, 
as it is understood by other investigators, 
reveals that there exists little agreement as to 
the specificity or generality of rigidity of an 
individual (1, 7, 10, 17). The Kounin-Werner 
controversy (14, 21) and, more recently, the 
Luchins-Rokeach controversy (17, 19) em- 
phasize the differences of conceptualization. 
One of the striking deficiencies in this area of 
study has been the lack of agreed upon meas- 
urement techniques; investigators have usually 
developed their tests independently of each 
other. The present investigation was an at- 
tempt to synthesize some of the antecedent 
studies.? 

The study was designed to sample the so- 
called measures of rigidity employed in a 
number of other investigations, singly or in 
various combinations (1, 2, 7, 8, 10, 16, 18), 
to discover empirically what relationships 
might exist among them when all are ad- 
ministered to the same subjects. One of the 
hypotheses being tested, then, was related to 
the existence of a general factor of rigidity, 
Since the conclusions of the experimenters 

1 This report is based upon a dissertation submitted 
to the University of Michigan in partial fulfillment of 
the requirements for the degree of Doctor of Philosophy. 
The writer wishes to express her appreciation to Dr. 
E. Lowell Kelly for his guidance and criticism in the 
direction of the dissertation and for reading and criticiz- 
ing this manuscript. 

: ^ Not all of the data collected will be presented here, 
Additional information can be found elsewhere (3). 


noted above include such variations as the 
assumption of a general factor of rigidity (18), 
the proposal of an ego level and a peripheral 
level of rigidity (10), and an emphasis on the 
importance of social field conditions (16). 
This investigator was further interested in 
the relationship between age and rigidity in 
normal subjects, with emphasis on possible 


changes in this relationship in old age asa | 


function of insecurity rather than of age itself. 
Therefore, the present study was conducted 
under three varying conditions of security. A 
second purpose of this study, then, was to 
investigate empirically how scores on each 
measure, and the relationships among these 
measures, might change under stress, i.e., to 
test for differences under varying conditions 
of security. 

We may, therefore, state our major hypothe- 
ses as follows: (a) There is no general factor 
among a number of so-called measures of 


rigidity under varying conditions of security. | 


(b) On any particular measure of rigidity em- 
ployed, there are no significant differences 
between mean scores of equivalent or matched 
groups under varying conditions of security. 

It was expected that the first hypothesis 
would be confirmed, but that the second 
hypothesis might be rejected. 


METHOD 


The subjects (Ss) were 79 candidates for Submarine — 


School at the U.S. Naval Submarine Base, New London, 
Connecticut? The stress condition was a real-life threat. 
All Submarine School candidates at the Base must pass, 
in addition to the physical and psychological require 
ments, a pressure-chamber and water-tank-escape test: 
‘Thus, each candidate must demonstrate the ability to 
withstand 50 pounds of atmospheric pressure. If he fails 


, this test, he is ineligible for further Submarine School 


training. If he passes, another test involves an “escape” 
from a 100-foot tank of water. Candidates are cli 

medically before the pressure test, and there is n9 
physical reason why any of them should not be able to 
withstand 50 pounds of pressure. Yet candidates 4? 


* The author wishes to express her thanks to Lt. 
Comdr. Dean Farnsworth, Comdr. Harry J. Alvis, 20 
Captain T. L. Willmon for their cooperation in pro 
viding Ss and facilities at the U. S. Naval Submarine 
Base, New London, and to the personnel at the basè 
who served as Ss. 


224 


e 


Some ‘DETERMINANTS OF BEHAVIORAL RIGIDITY 


225 


TABLE 1 
Means, SD’s, AND F's FOR AGE, EDUCATION, GCT SCORES, AND LENGTH OF SERVICE 


AGE (in yrs.) Epvc. (in yrs.) GCT Serv. (in mos.) 
Group N Mean SD Mean SD Mean SD Mean SD 
Day Before 24 19.17 1.61 11.83* 1.32 59.50 5.76 13.79 16.84 
Day After 26 20.31 3.10 11.32*  .88 56.48* 6.13 29.15 34.93 
Week After 29 19.52 2.84 11.50* 2.64 58.93 6.65 17.76 21.14 
Ft 1.19 1.40 1.60 1.77 


* Information not available for one S in this group. 


fail at this point. Although no one has ever "failed" 
the tank-escape test, there have been candidates who 
refused to take the test. Therefore, it was believed 
that this total situation represented a threat to the 
candidate which might be considered a threat to his 
security, In support of this assumption, Cook and 
Wherry (9) have demonstrated that undergoing routine 
training at the escape training tank is sufficiently 
stressful to candidates to be reflected in a significant 
increase of 17-ketosteroid output over basal samples, 
the highest increase being during the prestress period. 

In our study, one group was tested the day before 
the stress situation, one group was tested one day 
after, and a third group was tested one week after the 
stress situation. (The W’s were 24, 26, and 29, respec- 
tively.) Each group comprised the entire group of 
available candidates during a particular month for 
three successive months, and may be considered a 
random sample of candidates for the purposes of this 
experiment, since no selection or elimination was em- 
ployed by the experimenter. A comparison of these 
groups for age, education, length of service, and Navy 
General Classification Test scores yields no significant 
differences (see Table 1). 

The battery of tests included six measures of rigidity: 
the group Rorschach (scored for rigidity according to 
Fisher [10]), the California Ethnocentrism Scale in its 
final 20-item questionnaire form (1), the Angyal Per- 
ceptual Test (2), the Luchins arithmetic Einstellung 
problems (16), the Luchins Hidden Words Test (16), 
and a Hidden Objects test (7, 15). Since all of these 
measures have been described in the literatúre, they 
will not be described further here. All were group ad- 
ministered in a single day with one rest period. The 
exact forms for all of these measures, and instructions 
for their administration and scoring, as well as further 
information on the complete battery utilized, can be 
found elsewhere (3). 


RESULTS AND DISCUSSION « 

Kendall’s (12) tau correlation coefficients 
were computed for the scores of every test 
with every other. test for each of the three 
groups; the resulting matrices are shown in 
Table 2. An examination of this table reveals 
no evidence of a generalized rigidity. Of the 
45 correlations among the six rigidity measures, 
22 are found to be negative, 21 are positive, 
and two are zero. Furthermore, only three of 
these 45 correlations reach the .05 level of 


t None of the differences was significant. 


significance (we could have expected two by 
chance), and two of these three turn out to 
be negative correlations. There is, with rare 
exception, no consistency between any two 
measures under all three conditions. For 
example, we find the correlations between 
Ethnocentrism and Hidden Objects to be 
— 47, +.15, and +.01. 
In other words, we seem to be dealing with 
a chance distribution of correlation coefficients 
in spite of the fact that we chose measures on 
the basis of their having been used in previous 
investigations as measures of rigidity. There 
are two possible explanations for such findings: 
(a) the measures are not reliable, and un- 
fortunately reliability figures are not reported 
for any of the rigidity measures except the 
Ethnocentrism Scale (estimated at .90) and 
Hidden Objects Test (.55); and (6) the tests 
are all measuring different things. The ab- 
sence of a general factor of rigidity is further 
borne out by the coefficients of concordance 
(W) among these measures (W = .073, .151, 
and .153 for the Week After, Day After, and 
Day Before groups respectively). None of 
the three approaches statistical significance. 
Space does not permit a detailed discussion 
of these results or a comparison of how these 
results support or contradict those of other 
experimenters. However, because of the wide 
audience which has been reached by the work 
of Adorno, Frenkel-Brunswik, Levinson, and 
Sanford (1), and the so-called confirming 
” evidence of Rokeach (18), let us look at the 
results for the Ethnocentrism Scale a little 
more closely. According to the investigators 
just named, ethnocentric people can be differ- 
entiated from nonethnocentric people. As 
Bruner has stated it,“... the major emphasis is 
upon the representation of certain generalized 
personality processes in different specific 
spheres of mental functioning" (6, p. 122) 
(eg. 


rigidity in so-called nonsocial problem- 


af 


226 


TABLE 2 
MATRICES or CORRELATION COEFFICIENTS (TAU) FOR 
A BATTERY OF Six Ricmity MEASURES AND THE 
Navy GENERAL CLASSIFICATION TEsT[ 


DEE G. APPLEZWEIG | ] 


TABLE 3 
A COMPARISON OF MEAN NUMBER OF NONRIGID 
SOLUTIONS ON ARITHMETIC EINSTELLUNG PROBLEMS 
or HIGHS AND Lows ON ETHNOCENTRISM 


Task Ror. Erm. Ar. Wps. Osj. Anc. GCT 
+.16 +.04 +.05 +.07 —.05 —.14 

Ror. x —.02 —.09 00 —.15 +.04 —.20 
—.14 —.17 4.08 +.08 —.05 —.12 

+15 423 —a7 +.02 —.25 
Eth, xz  —.3* +.31° 4.01 —.05 —.40** 
+.21 4.05 +.15 —.10 —.32* 

4.20  —.39* —.05 +.06 

Arith, x —.26 —.18 00 +.33 
+.03 +.16 +.16 —.29 

—.11 —.07 —.03 

Wds. x —.08 +.09 4.03 
-2 —.25 +.34° 

—.19 +.01 

Obj. x +.28 —.03 
—.03 —.27 

Ang. -4 
x +.05 

+.07 

GCT x 


* Significant beyond the .05 level of confidence. 

** Significant beyond the .01 level of confidence. 

1 Top row = week after stress; middle row = day after; 
bottom row = day before. 


solving situations). Our results do not support 
such a simplified typology. 

The only consistent positive correlations 
under all three conditions are between the 
Ethnocentrism Scale and the Hidden Words 
Test, and even these become negative, inci- 
dentally, if we interpret the Hidden Words 
Test in the manner suggested by Cattell and 
Tiner (7). Even if we divide our groups into 


“Highs” and "Lows" on the Ethnocentrism . 


Scale, and compare their respective solutions 
on the Luchins arithmetic Einstellung 
problems for rigidity of solution, in a manner 
similar to that employed by Rokeach, we find 
contradictory results (see Table 3). 

The results for the Day Before group sup- 
port the Rokeach study in that we find the’ 
Highs solve fewer problems by the short 
method than the Lows, and this difference is 
statistically significant (p = .02). In the 
Week After group, we find that both Highs 
and Lows solve about the same number of 
problems in the nonrigid way. The really con- 
tradictory results, however, are those pro- 
duced by the Day After group, where the Highs 
produce significantly more nonrigid solutions 
than the Lows, a direct reversal of Rokeach’s 
findings (p = .02). 


STANDARD 
Mrans DEVIATIONS 
2 2 
E E 
Grover N © N Lows f ? E Lows 
Day Before 12 .50 12 2.3 2.74 02 -87 1.82 
Week After 14 .50 15 .67 50 NS .90 .87 
Day After 13 1.08 13 .15 2.66 02 1.M4 36 


Brown (5) also found that the correlations , 
between the F scale and several variations of | 
the arithmetic Einstellung problems varied 
according to the conditions of administration, | 
and were apparently a function of the degree | 
of ego involvement. It is suggested that 
Rokeach's results were obtained under ego- 
involving rather than neutral conditions. 

From an analysis of the findings so far re- 
ported, it would appear that we are left es- | 
sentially with the rather disturbing picture of [ 
low correlations among our measures, which & 
appear to behave erratically under different 
conditions of motivation. We conclude, there- 
fore, that there is no general factor among à 
number of so-called measures of rigidity under 
varying conditions of security. Let us now 
turn to a consideration of this “erratic” be- 
havior. 

As was pointed out above, we would seem 
well advised to be cautious about accepting 4 
simple personality-centered theory which 
seems to leave little room for variations 
field conditions. Harris (11) reports that 
creating a “test” situation increased rigidity. 
Luchins (17) has stressed the importance 0! } 
field conditions in creating or overcoming the 
Einstellung effect. Although he attempted to | 
exclude insecurity from his study, Kount 
(13) does mention it, and other motivation] 


, factors, as causes of rigidity. 


We 4ind that the intergroup comparisons m | 
this study can be quite meaningfully organ! 
within the framework of some speculations 
about the different motivations within eai 
condition. Briefly, as can be seen in Table h 
there are some statistically significant mea? 
differences among the groups, although the 
direction of these differences does not app 
to be consistent. Let us first compare the mor? 
extreme conditions: the Day Before and thé 
Week After groups. 4 

The security-stress condition utilized in th 
study was an acute threat, physical in natu! 


ç 


Some DETERMINANTS OF BEHAVIORAL RIGIDITY 227 
TABLE 4 


Means, SD’s, AND TESTS OF SIGNIFICANCE OF Dr- 
FERENCES FOR Six RicmpITy MEASURES 


Week After group. Thus, on those tasks where 
hidden meanings can be'found, we can expect 
the Day Before group to find more than the 
Week After group. This means that we might 


Groups 


Task Gnourt N Mean SD Comrarrp $ F expect them to score less rigid on Hidden 
WA 29 22.79 10.28 DB-WA -2.00* 2.62** Words, Hidden Objects, and arithmetic 
Ror. DA 26 33.06 15.78 DB-DA — SL. 1U Einstellung problems. This turns out to be 
DB 24 30.58 16.65 WA-DA —2.84**  2.36* * 
the case, although several of these differences 
WA 29 86.48 18.56 DB-WA —2.23* 1.22 do not achieve statistical significance. 
Eth, DA 26 8592 19.27 DB-DA —209 1.13 What about the scores on the Ethnocentrism 
DB 24 74.25 20.52 WA-DA + -11 1.08 
Scale? The Day Before group does not yet 
WA 2 0.5.8 DB-WA -4204* 3.4* know if it is an “in-group” or an “out-group” 
Arist DR 2D re i DA pu "a in terms of Submarine School. Furthermore, it 
has been shown by Stouffer et al (20) that in a 
và TA 2 E IA DENA F * ie “threat to life” situation prejudice decreases. 
s. 5 2 : É : 
DB ^ 35 1,98 WADA -L23 | 2.19° We would therefore expect the Day Before 
group to be less ethnocentric than the Week 
WA 29 9.69 2.80 DB-WA +1.43 — 2.19* í in 
Obj.H DA 26 10.23 2.72 DB-DA + .65 2.0 After group. Sherman differents us an the 
DB 24 10.67 1.89 WADA —.71 “6 expected direction and is statistically signifi- 
cant. 
Brea Sic Gal EREUC DEN LE re An analysis of the results of the Week After 
DB 24 5.13 2.7227 WADA —1.86 — 1.02 group can be explained similarly. These candi- 


* Significant beyond the .05 level of confidence. 
** Significant beyond the .01 level of confidence. 
T WA = week after stress; DA = day after; DB = day 
before. 
_ tt Low score = high rigidity on these tests; on all others, 
high score = high rigidity. 


but with psychological implications. One may 
hypothesize that the candidate is attempting 
to “prove himself.” He is self-oriented, since 
pass" or “fail” in the pressure chamber and 
tank are not related to anyone’s ability but 
his own. It is not a competitive situation. 
Therefore, those tasks in this battery which 
are viewed as tasks in which he might in- 
advertently reveal something of himself'should 
be most threatening to a subject under the 
security-stress conditions of the Day Before 
group. 

There are no "reality checks" on the answers 
he gives to the Rorschach or the Angyal tests. 
Therefore, those tests should be perceived as* 
most threatening, and should be expected to . 
reveal greater rigidity in the Day Before group. 
We would assume that this group is more wary, 
is looking for hidden meanings. In that event, 
on both of these tasks, where they have no 
way of knowing if they are “right,” they would 
m most guarded in their approach. We find 
that this is true; ie., the Day Before group 
*s more rigid on these two measures. 
du Can be seen, then, that on some of the 
sd tasks, this same kind of approach might 
a to behavior which could make the Day 

efore group appear more flexible than the 


dates know they have been accepted for school, 
but training has not yet started. There is no 
immediate threat for them. We can expect 
that they can return to the cultural norm and 
view the experimental situation in a competi- 
tive frame of reference. Therefore, they would 
be most threatened by those tasks which do 
have a reality check, i.e., those on which they 
know if they are “right” or “wrong” when they 
write down an answer. For these people, then, 
the Hidden Words, Hidden Objects, and arith- 
metic Einstellung problems are most threaten- 
ing. It is even conceivable that the last, which 
is most like a school-type problem, is the most 
threatening. Thus it is not surprising that the 
Week After group scores significantly more 
rigid on the arithmetic problems, and more 
rigid (although not significantly) on Hidden 
Words and Hidden Objects. 

The Day After group are the victims of 
conflicting motivating states. They have taken 
the tank and pressure tests, but do not yet 
know if they have been admitted to Submarine 
School, since candidacy can be refused even if 
these tests are passed. The results similarly 
indicate this conflict. We find that this group 
tends to score high on all of the rigidity meas- 
ures, but the coefficient of concordance is not 
higher than those for the other two groups. 
Thus it would appear that there are different 
kinds of people within the group who are 
affected differentially by the conflict of mo- 
tives. In this situation, it is conceivable that 
the most important determinant of the rigidity 


228 


scores will be the kind of individual, i.e., one 
who assumes he is “in,” assumes he is “out,” 
or vacillates in his belief. 

. The above findings would seem to suggest 
that there are at least three major components 
which contribute to variations in the manifest 
behavior which we label *rigid." These are 
the individual, the nature of the task, and the 
general conditions under which the tasks are 
administered. Rather than asking *is there a 
general factor of rigidity?" we might better 
ask "under what conditions are we likely to 
find generalized rigidity?" 


, SUMMARY 


The tests of rigidity included in this study 
have all been used in at least one other investi- 
gation as measures of rigidity. Three equivalent 
groups of Ss were tested the day before, one 
day after, and one week after a real-life stress 
situation which was presumed to arouse feel- 
ings of insecurity. Tau correlation coefficients 
were computed for the scores of every test 
with every other test, and appeared to yield 
essentially a chance distribution. Furthermore, 
coefficients of concordance among these meas- 
ures did not approach statistical significance. 
Therefore, it was concluded that there is no 
general factor among a number of so-called 
measures of rigidity under varying conditions 
of security. 

A comparison of mean differences among the 
groups on the various measures of rigidity 
revealed that there were some statistically 
significant differences, but the direction of 
these differences was not consistent. The 
hypothesis that, on any particular measure of 
rigidity employed, there are no significant 
differences between mean scores of equivalent 
groups under varying conditions of security 
could not be conclusively rejected. An ex- 
Planation of these results was offered, and 
evidence in support of this explanation pre- 
sented. 

The following conclusions appear to be 
-justified: (a) There is no general factor of 
rigidity among a number of so-called measures 
of rigidity; the interrelationships of these 
measures appear to vary with the nature of 
the tests employed and the conditions of test 
administration, as well as behavioral de- 
terminants within S(s). (6) Scores obtained 
by an individual on any so-called measure of 

» rigidity appear to be a function not only of 


DEE G. APPLEZWEIG 


the individual, but also of the nature of the 
test and the conditions of test administration, 


REFERENCES 


| 

1. Aporno, T. W., FRENKEL-BRUNSWIK, Erst, | 
Levinson, D. J., & Sawronp, R. N. The 
thoritarian personality. New York: Harper, 1950, 

2. ANGYAL, ALICE F. The diagnosis of neurotic traits 
by means of a new perceptual test. J. Psychol, 
1948, 25, 105-135. 

3. APPLEZWEIG, DEE G. An investigation of the inter- 
relationships of several measures of rigidity 
under varying conditions of security. Unpub- f 
lished Ph.D. thesis, Univer. of Michigan, 1951. 

4. BLAKE, R. R., & Ramsey, G. V. (Eds). Percej- * 
lion—an approach lo personality. New York: 
Ronald, 1951. 

5. Brown, R. W. Some determinants of the relation- | 
Ship between rigidity and authoritarianism. Un- ' 
published Ph.D. thesis, Univer. of Michigan, 1951. 

6. Bruner, J. S. Personality dynamics and the process 
of perceiving. In R. R. Blake & G. V. Ramsey 
(Eds.), Perception—an approach to personality. 
New York: Ronald, 1951. Pp. 121-147. 

7. CATTELL, R. B., & Tiner, L. Grose. The varieties 
of structural rigidity. J. Pers., 1949, 17, 321-341, | 

8. Curistir, J. R. The effects of frustration upon { 
rigidity in problem solution. Unpublished doc- 4 
tor’s thesis, Univer. of California, 1949. 

9. Coox, E. B., & Wuerry, R. J. The urinary 17- 
ketosteroid output of naval submarine enlisted 
candidates during two stressful situations. Hum. 
Biol., 1950, 22, 104-124. 

10. Fisuer, S. Patterns of personality rigidity and 
some of their determinants. Psychol. Monogr 
1950, 64, No. 1 (Whole No. 307). n 

11. Hanzrs, R. A. The effects of stress on rigidity of 
mental set in problem solution. Unpub 
doctor's thesis, Harvard Univer., 1950. 

12. KeNpALL, M. G. Rank correlation methods. London: 
Charles Griffin, 1948. 


13. Kounm, J. S. Experimental studies of rigidity 2$ 
a function of age and feeblemindedness. Un- 
Published doctor's thesis, Univer. of Iowa, 1999. 

14. Kounm, J. S. The meaning of rigidity: a reply t0 
Heinz Werner. Psychol. Rev., 1948, 55, 157-166. 

15. Kries, H. Puzzles. Child Life, 1940, 45, 19. 

16. Locus, A. S. Mechanization in problem solving 
the effect of Einstellung. Psychol. Monogr., 194% 
54, No. 6 (Whole No. 248). ù 

17. Lucmmws, A. S. Rigidity and ethnocentrism: & | 
critique. J. Pers., 1949, 17, 449-466. 


, 18. Rokfcm, M. Generalized mental rigidity 25.3 


1948, 49, 259-278. 
19. Roxeacn, M. Rigidity and ethnocentrism: & I 
joinder. J. Pers., 1949, 17, 467-474. j 
20. Srourrer, S. A., SucmwaN, E. A., Devout i 
L. C, San, Samuxy A, & Wir, R. M^ y 
Jn. The American soldier: adjustment during a) — 
life. Vol. I. Princeton: Princeton Univer. Pres 
1949, 
j 


| 
factor in ethnocentrism. J. abnorm. soc. Psych | 


21. WERNER, H. The concept of rigidity: a critical 


evaluation. Psychol. Rev., 1946, 53, 43-52. 
Received January 2, 1953. 


. Ohio 


PHYSIOLOGICAL NEED, VERBAL FREQUENCY, AND WORD 
ASSOCIATION: 


LAUREN G. WISPÉ 
The Ohio State University 


REVIOUS experiments in which the in- 

structions have been to “associate” 

while under the influence of various 
periods of food and water abstinence have 
employed such diverse methods and depriva- 
tion periods as to make comparison of the find- 
ings exceedingly difficult. Nevertheless, es- 
pecially since it is not available elsewhere, a 
perusal of the literature may prove valuable 
to see whether, despite differences in methods, 
certain conclusions can be synthesized to 
formulate hypotheses for further investigation. 


PROBLEMS 
1. The question of the exact strength of the 


' physiologically induced deprivation has clearly 


been ignored by earlier research. Deprivation 
periods, both in the experimental and the con- 
trol groups, have been unsystematically varied 
from one to 24 hours. In other needs this would 
be less crucial than in hunger, which in our 
culture is highly cyclical. For example, in 
Sanford's work (9, 10) the 27 experimental 
subjects (Ss) were given a battery of tests 
(including words for association, pictures for 
Interpretation and completion, and various 
subjective rating scales) after a 24-hour fast, 
while the 35 control Ss were tested at various 
times during the normal eating cycle. In the 


; Chein, Levine, and Murphy experiment (3) 


Ss were told to verbalize an association to 40 
colored and 40 achromatic cards (containing 
ambiguous drawings and drawings of mis- 
cellaneous household articles); the five ex- 
perimental Ss were tested once a week with 
one, three, six, and nine hours of abstinence. 
Írom food, while the five control Ss were tested 
from 45 minutes to two and one-half hours 
after eating. The results of these studies will 
be analyzed later. Our present object is to 
Suggest that these experiments carried the 
* This research was supported by a grant from The 
Ho State University Research Foundation. The 
MUS. wishes to thank Ferdinand van der Veen and 

vin Lerner for assistance in collecting and proc- 
ts s Mie data. Dr. Julian Rotter graciously read 

Paper, and made many helpful suggestions. 


deprivation periods neither far enough nor 
systematically enough. This applies as well 
to the McClelland and Atkinson project (1, 6) 
to be discussed below. Therefore, the first 
purpose of this research is to induce a pro- 
tracted period of total abstinence from food 
and water, and to test during the course of the 
deprivation the relationship between the 
physiologically induced need and the word 
associations to need-relevant and neutral 
stimuli. 

2. McClelland and Atkinson (1, 6) tested 
their Ss 1, 4, and 16 hours after eating. In 
one experiment, 12 blanks were flashed upon 
the screen with various hints given: naming 
food objects, places related to eating, pro- 
jected feeling responses; and three were shown 
with no comment. In another aspect of the 
same study 40 Ss were asked to make size and 
number comparisons of food-related and non- 
food-related objects during three deprivation 
periods. In a separate experiment, using the 
same Ss and procedures, protocols were col- 
lected on eight TAT-type pictures, represent- 
ing all aspects of the food-getting process. 
Responses were categorized to cover the whole 
gamut of food acquisition, as “goal objects" 
(food), ‘instrumental responses” (knife, fork), 
“food imagery,” etc. The results showed that 
although the total number of food responses 
increased reliably in both experiments with 
hours of deprivation, this was due primarily 
to reliable increases in the instrumental re- 
sponses alone. The size and number estimates 
of food-related objects also increased signifi- 
cantly. The object responses, however, did 
not increase reliably with increased depriva- 


“tion, nor did the food-related imagery in the 


TAT-type test. One of the general conclusions 
following from McClelland and Atkinson’s 
results is that the responses made under in- 
duced needs have an importance for the study 
of the cognitive processes beyond their sheer 
number. Their analyses have shown that the 
gross number of food responses is concealing 
important information. The second purpose 


229 


230 


of this research, therefore, is to extend and 
confrm the response categorization of Mc- 
Clelland and Atkinson, especially to sharpen 
the definitions of the categories, and to include 
classifications for the affective responses that 
characterize the verbal expressions of intense 
needs. 

3. The work of Solomon and Howes (4, 12) 
with recognition times has demonstrated the 
importance of the commonness of the stimulus 
material. Their findings suggest that in study- 
ing the associative process, cognizance must 
be taken of the strength—in Thorndikian 
terms—of the stimulus material. The final 
purpose of this research is, thus, to determine 
in what way the commonness of the stimulus 
word wil affect the number and kinds of 
association responses. 


METHOD 


The stimulus material. Forty-eight stimulus words 
were used, of which half were need-relevant and half 
were neutral (supposed to have no connotations of 
food and/or water deprivation). Of the 24 “need” 
words, 12 were related to food deprivation and 12 to 
water deprivation. Following the work of McClelland 
and his colleagues (1, 6), the need words were divided 
into “act” words (those denoting the taking of food 
and/or water into the body) and "object" words (the 
names of satisfiers, etc.). Thus there were four sub- 
classes of need words: (a) food-act words, (b) food- 
object words, (c) water-act words, and (d) water-object 
words; and a class of neutral words which were matched 
with the need words for commonness. The stimulus 
words were randomized in two lists of 24 words each, 
each list containing equal representation from the four 
subdivisions of the need words, and the same number 
of neutral words. 


Lauren G. WisPÉ 


The commonness of the stimulus words was assessed 
as follows: Correlations were computed between the 
Thorndike-Lorge (13) frequency ratings for the stimulus 
words and the estimates of two independent random |. 
samples (N’s of 100 and 150) of college undergraduatés. © 
who were asked to rate the stimulus words as "com. ' 
mon," "mid-common," or “uncommon,” depending i 
upon their frequency of usage in the daily vocabulary 
of the judges. Since these correlations ranged between | 
70 and .97, the undergraduate judgments were usedi 
in the final commonness ratings. Table 1 gives the | 
stimulus words, their order of presentation, ratings. 
and classifications. +a 

Subjects. The Ss used in the word association study; - 
men and women students at The Ohio State University, , 
were 50 of the 60 Ss who had just served in the recog 
nition-time experiment described elsewhere (14). Nom 
had completed an introductory course in psychology: — 
Although the same words were used in both the word | 
association and the recognition-time study, upon inter- 
rogation none of the subjects reported awareness of 
this fact. | 

Deprivation conditions. On the testing days Ss were | 
not permitted to eat or drink, and they were requested | 

| 


to limit their activities to classes and study. Smoking 
was permitted with restraint. Three levels of depriva- 
tion were induced: 0-2 hours (the control group), | 
10 hours, and 24 hours. The 0-hour group ate a normal 
but late lunch; the 10-hour group ate breakfast at 
7:30 a.m; and the 24-hour group had their last meal 
at dinner time on the night preceding their testing 
period. All testing started at 5:30 P.M. The Ss were 
randomly assigned to one of the three deprivation 
periods, and one of the two lists. 

Procedure. The study was presented as one of dep- 
rivation and verbal fluency. The Ss were instructed 
“to give the first words which came to mind," and to | 
continue associating “until the experimenter said 
‘Stop.’ " The experimenter (E) then read the stimulus ” 
words to S. Seated at a table about six feet behind and 
to the left of S, E recorded the first 19 responses. Som’ 
where between the nineteenth and twenty-fifth re 
sponse, E stopped S, and proceeded to the next word. 


TABLE 1 
PRESENTATION ORDER AND CLASSIFICATIONS OF THE Two Lists OF StimuLus WORDS 
ListI List II 
munch (AUF)* drink (ACW) nibble (AUF) sip (ACW) 
godson (NU-) coke (OCW) deyout (NU-) eater (OCW) 
need (NC-) dine (ACF) beat (NC-) drain (ACW) 
mean (NC-) waddle (NU-) speak (NC-) idler (NU-) 
serenade (NU-) waffle (OUF) solar (NU-) cider (OUW) 
feed (ACF) chuckle (NU-) eat (ACF) ram (NU-) 
lemonade (OUW) mine (NC-) soda (OUW) lake (NC-) | 
think (NC-) hunch (NU-) rip (NC-) quibble (NU-) 
comb (NC-) chocolate (OUF) weather (NC-) ham (OUF) ' 
imbibe (AUW) guzzle (AUW) gulp (AUW) devour (AUF) 
milk (OCW) dive (NC-) cake (OCF) dread (NC-) | 
imbed (NU-) meat (OCF) gust (NU-) steak (OCF) 


* The classification of the words is: A, act; O, object; N, neutral; C, common; U, uncommon; F, food; W, water. 


RESULTS 


Our first concern is the effect of the stim- 
‘ulus words upon the word association re- 
“sponses. The responses may be categorized as 
_ _“fdod-related” (meat, bread, cake, etc.), 

“water-related” (water, pitcher, beer, etc.), 
"and neutral responses" (those responses 
“supposedly unrelated to food and water dep- 
rivation). The chi-square analysis of the 
i data presented in Fig. 1 is significant beyond 
-« the 196. level. The food-related stimuli beget 
. "more food-related responses, the water- 
„related words elicit more water association, 
"and the neutral stimuli get more neutral 

word responses. Thus there is a congruity be- 

tween the nature of the stimulus words and 
'! the kinds of responses made to them. 

The design of the experiment permitted the 
investigation of the hypotheses, derived from 
related research (1, 6, 12), that the “act- 
object” and *common-uncommon" classifica- 
tions of the stimulus words would be related 
to the number and kinds of responses made. 
The act-object stimuli prove to be nearly 
significantly related to the number of food, 
water, and neutral responses (chi square = 
5.70; a chi square of 5.99 is significant at the 


1,000 
1,800 
i000 now 


500 


000 


Woter-Related 
Stimulus 
Words. 


ET 
oos 
Fic. 1. Numser or Foop, WATER, AND NEUTRAL 
: Associations TO FOOD, WATER, AND 
NEurRAL Srmutus Worps 


. PHYSIOLOGICAL NEED, VERBAL FREQUENCY, AND WORD ASSOCIATION 


251 


5% level); the object words received more 
food responses and the act words elicited more 
water associations. There was also reason to 
believe that more responses would be made 
to the need-related common word stimuli, 
but the chi-square analysis fails to confirm 
this. We can summarize our findings thus far 
by noting that although neither the act-object 
analysis of the stimulus words nor the com- 
mon-uncommon one is significant, the food- 
water-neutral classification proves to be an 
important factor. The implications of these 
findings for an understanding of the asso- 
ciative process is clear; even in the presence of 
strong deprivation, certain categories of the 
stimulus words are significant determiners of 
the associations made. 

Our second major interest is the relation- 
ship between the need-related responses and 
the deprivation periods. The food- and water- 
related responses were categorized for each 
of the deprivation periods. This chi square is 
significant beyond the 1% level, indicating 
that there is a significant relationship between 
the number of food and water word associa- 
tions and the hours of deprivation. More 
food- and water-related responses are given 
at 10 hours of deprivation than at zero hours 
(the control period). However, at the 24-hour 
deprivation period there is a slight decrease 
in the number of food and water responses. 
Our results suggest that the relationship of 
need-relevant associations and a physiological 
need is of the general curvilinear type. 

Our third interest is with the nature and 
the number of the word association responses 
and the periods of deprivation. The responses 
can be further refined into any one of the five 
categories listed below: 

1. Act responses—verbs and verb forms (gerunds) 
directly denoting taking food and/or liquids into the 
body, as eat, eating, masticate, drain, etc. 

2. Object responses—names of hunger and thirst 
satisfiers, as the names of foods in their edible state 
and the names of meals. 

3. Instrumental responses—names of objects and 
processes instrumental to need satisfaction which, 
however, are not themselves satisfiers, as (a) the names 
of specific establishments for obtaining food and/or 
drink (restaurant, bar, etc.); (b) the names of all utensils 
necessary for preparing, containing, and consuming 
food and/or beverages in our culture (pots, plates, 
glass, etc.); (¢) the names of foods which in our culture 
are not edible in their raw form (grain, hogs, etc.); (d) 
the names of all processes for the preparation of food 
(cooking, baking, etc.). 


232 


TABLE 2 
Tur NUMBER OF Act, OBJECT, INSTRUMENTAL, 
AFFECTIVE, AND NEUTRAL WORD ASSOCIATION 
RESPONSES MADE AT EACH DEPRIVATION PERIOD 


vation Acr  Osjyzcr peel ^N NrvurRAL Totar 
Perions 


0 132 748 297 12 5,551 6,90 
10 186 1,075 192 197 $,190 6,840 
u 143 696 299 81 562 6,840 

Total 461 2,519 788 390 16,362 20,520 


4. Affective responses—all those adjectival responses 
referring more or less exclusively to hunger and thirst 
satisfiers, as sweet, sour, delicious, etc. 

5. Neutral which could 
not be coded into one of the above categories, and 
which were taken therefore as generally unrelated to 
the hunger-thirst process. 


Table 2 gives the number of responses in 
each of the five categories for each of the 
deprivation periods. This information is 
presented graphically in Fig. 2. In their raw 
form these data failed to satisfy the assump- 
tion of homogeneity for the analysis of vari- 
ance, so a logarithmic transformation was 
necessary (11). The results of the analysis of 
variance of the transformed data are given 
in Table 3. The principal hypothesis, that the 
deprivation will selectively influence the 
number and kinds of word association re- 
sponses, is tested by the interaction, which 
is significant beyond the 1% level. Inspection 
of Fig. 2 shows that at the 10-hour deprivation 
period there is an increase in the number of 
act, object, and affective responses, but that 
these responses decrease by the 24-hour period. 
The instrumental associations, on the other 
hand, decrease at the 10-hour period, but 
increase by the 24-hour deprivation period. 
Tests for the significance of the differences (/) 
indicate that the differences among the mean 
number of instrumental responses at 0, 10, 


and 24 hours are significant beyond the 5%. 


level. This is also true for the affective re- 
sponses, but not for the act or object classi- 
fications. Furthermore, the mean number of 
instrumental responses is significantly dif- 
ferent (beyond the 5% level) from the mean 
number of act, object, and affective associa- 
tions at both the 0- and the 24-hour depriva- 
tion periods. j 

In the analysis of variance (Table 3) the 


A oct 

O object 
I instrumental ———— 
F affective —-— 


Word 
Assoc. 
Categories 


Fic. 2. NUMBER or WORD ASSOCIATION RESPONSES IN 
EACH or THE CATEGORIES AT EACH 
DEPRIVATION PERIOD 


TABLE 3 
ANALYSIS OF VARIANCE OF THE LOGARITHMIC 
TRANSFORMATION OF THE WORD 
ASSOCIATION DATA 


Source df VARIANCE F ? 

I OIBLOAITO UI ALT OLSIPONUENVRMN NNNM 
Response categories 3  6.63401463 — 19.0608 <.01 

(act, object, instrumen- 

tal, and affective) 
Deprivation periods 2 .11001195 1.28 
Interaction 6 — .347879866 — 4.046 <0 
Within cells 168 -0862234285 

Total 179 — .2055695955 


differences among the response-category means 
are significant beyond the 1% level. Perusa! | 
of Fig. 2 shows that there are more object | 
associations made at all hours. Both thes | 
results are probably due to the fact that OU | 
language has more words for hunger and thirst 
satisfiers than for acts instrumental to thé 
satisfaction of hunger and thirst. Be | 

Our findings indicate that the classification | 
of instrumental associations operates CI 
ferently with increasing deprivation from i 
the act, object, or affective response cate | 
gories, and that object associations decreas? 
and instrumental responses increase with Y" | 
creased deprivation periods. 


Discussion 


What general conclusions can be reaches 
about the effects of abstinence from food 


PHYSIOLOGICAL NEED, VERBAL FREQUENCY, AND WORD ASSOCIATION 


water upon the associative process? Despite 
the difficulties inherent in generalizing from 
the results of different experiments, three 
aspects of the relationship between physio- 
logically induced deprivation and word asso- 
ciation should be indicated. 

1. In the first place, the number of food and 
water responses, generically conceived, does 
not increase linearly with hours of deprivation. 
In general, those studies which have apposited 
physiological deprivation and tests of projec- 
tion or word association have found a signifi- 
cant increase in the number of need-related 
responses up to a certain point, but beyond 
this point the need responses decrease. 

For a precise description of the relationship 
between the associative process and depriva- 
tion it is necessary to know the exact need 
strength at which the curve for need responses 
begins to decelerate. This information can be 
obtained only by the construction of a family 
of curves derived from systematically spaced 
testing of the deprivation periods using com- 
parable material throughout. Although only 
in this way will we be able to discuss the 
function of association on deprivation, some 
hypotheses may be derived by considering the 
findings of previous research in this area. In 
the Sanford study (10) the results of the con- 
trols, who were tested from one to four hours 
after eating, showed that the number of food 
responses continued to increase up to four 
hours but that thereafter they leveled off. 
In the McClelland and Atkinson research (6) 
the mean number of instrumental responses 
increased markedly from the one-hour to the 
Íour-hour deprivation periods, but Between 
the four-hour and the 16-hour deprivation 
Periods the curves were negatively accelerated 
and the differences were not significant. Chein, 
Levine, and Murphy (3) found that there was 
a decrease in the number of food responses 
Occurring between the six- and nine-hour' 
deprivation periods, and this occurred some- 
time after ten hours in the present study. 
Juxtaposing these findings, we can derive the 
following relationship between deprivation 
and the gross number of need-related associa- 
tions: The number of need-word associations 
increases as a function of deprivation up to 
about four hours, although this may be largely 
the result of a “food habit” rather than real 
tissue need. The need responses maintain 
themselves at this increased level until about 


233 


the ten-hour deprivation period. Sometime 
after nine or ten hours of deprivation the 
number of need-related responses decreases. 
Thus our hypothesized function of need- 
related association on physiological deprivation 
is one in which the curve for the need re- 
sponses accelerates rapidly at the outset, 
becomes asymptotic, and finally decelerates 
with the protracted periods of experimentally 
induced deprivation. Probably the relation- 
ship between physiologically induced depriva- 
tion states and need-related responses, when 
it is completely written, will be curvilinear, as 
in the other studies of motivation and per- 
formance. 

2. The more recent experiments have shown 
that important information may be obtained 
from a finer analysis of the need-relevant 
responses. For example, McClelland and At- 
kinson's categorization into instrumental and 
goal responses proved helpful in reconciling 
some of their findings with those of Sanford 
(10), for Sanford was recording only goal 
responses, which in both studies failed to 
increase significantly (6, p. 220). Although it 
is difficult to compare the results of the present 
experiment with those of McClelland and 
Atkinson (6) (especially since they tested 
in a group situation, which tends to depress 
egocentric responses) both studies show that 
as deprivation increases, Ss make more re- 
sponses which are concerned with acts instru- 
mental to need-satisfaction, while the number 
of names of need satisfiers decreases.? 

3. In order to account for the rise and fall 
of the need-related associations, especially 
goal responses, all the writers in this area have 
had to conceptualize two antagonistic proc- 
esses; the one, call it “wish fulfillment,” or 
“autism” (8), or “vivification” (2), accounts 
for initial rise in the curve, while the other, 
call it “Drang nach Realität” (6) or “realism” 
(8), attempts to explain the subsequent de- 
-cline in the number of need responses. Thus, 
regardless of one’s terminological preferences, 
the empirical finding remains that some place 
between the 10-hour and the 24-hour depriva- 
tion periods both the number and the kinds of 
responses take an important shift. Behind 
each of the various ways of conceptualizing 


2 MacLeod’s excellent phenomenologically oriented 
discussion of the motivation and perception problem 
(7) suggests an interesting interpretation of this em- 
pirical result. 


234 


this fact is the suggestion that this shift is 
contingent upon the ungratifying nature of 
the responses to the subjects. 


SUMMARY 


Fifty college men and women were deprived 
of food and water for 0, 10, and 24 hours, and 
were presented with a word association list of 
24 words which had been matched for com- 
monness and need-relevance. Each S was 
tested only once. The results show that (a) 
more food, water, and neutral word associa- 
tion responses were made to food, water, and 
neutral stimulus words, respectively; (b) 
there was an increase in the number of food 
and water responses up to the tenth hour, but 
a decrease thereafter; and (c) with protracted 
periods of deprivation the number of responses 
pertaining to acts instrumental to need satis- 
faction increased while the number of re- 
sponses involving the names of need satisfiers 
decreased. The implications of these findings 
for a generalized statement of the function 
of association on physiological deprivation 
are discussed, with the hypothesis that the 
relationship is a curvilinear one. 


REFERENCES 


1. ATKINSON, J. W., & McClelland, D. C. The pro- 
jective expression of needs: IL. The effect of 
differential intensities of the hunger drive on 
thematic apperception. J. exp. Psychol., 1948, 
38, 643-655. 


Lauren G. WisPé 


2. BRUNER, J. S., & Goopman, C. C. Value and need 
as organizing factors in perception. J. abmorm. 
soc. Psychol., 1947, 42, 33-44. 

3. Cuem, L, Levine, R., & Murry, G. The rela- 
tions of the intensity of a need to the amount 
of perceptual distortion. J. Psychol., 1942, 18, 
283-293. 

4. Howes, D. H., & SorouoN, R. L. Visual duration 
thresholds as a function of word probability. 
J. exp. Psychol., 1951, 41, 401—410. 

5. Kwarp, R. H. Experiments in serial reproduction 
and related aspects of the psychology of rumor, 
Unpublished Ph.D. thesis, Harvard Univer, 
1948. 

6. MCCLELLAND, D., & ATKINSON, J. The projective 
expression of needs: I. The effect of different 
intensities of the hunger drive on perception. 
J. Psychol., 1948, 25, 205-222. 

7. MacLzop, R. B. Perceptual constancy and the 
problem of motivation. Canad. J. Psychol., 1949, 

8, 57-66. 

8. Murray, G. Personality. New York: Harper, 

1947. 


9. Sanrorp, R. N. The effects of the abstinence from | 


food upon the imaginal processes. A preliminary 
study. J. Psychol., 1936, 2, 129-136. 

. SANFORD, R. N. The effects of the abstinence from 
food upon the imaginal processes. A further 
study. J. soc. Psychol., 1937, 8, 335-343. 

11. SwEpEcon, G. W. Statistical methods. Ames, Towa: 
Towa State Coll. Press, 1946. 

. Sotomon, R. L., & Howes, D. H. Word frequency, 
personal values and visual duration 
Psychol. Rev., 1951, 68, 256-270. 

TuornpiKe, E, & LomoE, I. The teacher's word 
book. New York: Teachers Coll, Columbia 
Univer., 1944. 

Wis, L., & DRAMBAREAN, N. Physiological need, 
verbal frequency, and visual duration thresholds. 
J. exp. Psychol., 1953, 46, 25-31. 


Received January 10, 1953. 


13. 


14. 


, 


thresholds. | 


THE RELATIONSHIPS BETWEEN OVERT AND FANTASY 
AGGRESSION 


PAUL H. MUSSEN axp H. KELLY NAYLOR 
The Ohio State University 


NVESTIGATIONS of the relationship be- 
tween fantasy and overt behavior have 
yielded varied results (5, 7, 8). Sanford 

et al. (7) found that ratings of some fantasy 
needs derived from the TAT were positively 
correlated with ratings of overt behavior mani- 
festing these needs, while in other cases the 
strength of the needs expressed in fantasy 
was negatively correlated with the degree of 
overt expression. Correlations between TAT 
fantasy needs and final staff ratings on overt 
behavior ranged from +.41 to —.44, with an 
average of +.11. Murray (5) also found dif- 
ferences among the variables in the extent to 
which overt and fantasy expressions cor- 
responded, In a group of college men, the two 
forms of expression were positively correlated 
on variables such as abasement, creation, 
dominance, exposition, nurturance, passivity, 
rejection, and dejection; but there was a nega- 
tive correlation between fantasy and overt 
sex needs. 

From these studies it may be concluded that 
Some needs reflected in the TAT are also 
apparent in overt expression, while other 
needs which are revealed strongly in the TAT 
are seldom demonstrated in overt behavior. 
As Lindzey has pointed out, one of the major 
problems involved in interpreting the TAT is 
the “determination of the conditions under 
which inferences based on the projective 
material directly relate to overt behavior and 
the conditions for the reverse" (4, p. 18). 
With respect to aggression specifically, Mur- 
Tay (5) found no correlation between the in- 
tensity of the need in fantasy and its overt, 
expression. On the other hand, Sanford ef al. 
(7) found that aggressive needs were among 
those which occurred frequently in the TAT 
Stories of adolescent subjects, but, according 
to teachers’ reports, were infrequently ex- 
Pressed in the overt behavior of this group. 
The correlation between ratings of the sub- 
jects’ TAT aggressive needs and final staff 
Tatings of the degree of their manifestation 
of this need was +.15. 


In explaining their findings, Sanford et al. 
(7) suggest that certain antisocial needs such 
as aggression may appear in the TAT stories 
but not overtly because cultural prohibition 
or internal conflict prevents the overt gratifi- 
cation of these needs and thereby increases 
their intensity in the individual's fantasies. 
According to these writers, needs which were 
frequently present in both fantasy and overt 
behavior were those which are encouraged 
by the culture, but, generally speaking, the 
individual does not have sufficient oppor- 
tunity for their satisfaction. 

In the middle class from which Sanford's 
and Murray’s subject populations were 
drawn, there are strong punishments for the 
expression of aggression. However, in lower- 
class culture, aggressive behavior is not 
punished but is encouraged (1). Although the 
manifestation of aggressive needs is accept- 
able, different individuals within the group 
have, as a result of their particular back- 
grounds and experiences, different strengths 
of these needs. If Sanford's hypotheses are 
correct, it can be predicted that, in a lower- 
class population, those who have intense 
fantasy aggression needs will express these 
needs in their overt behavior. 

The first hypothesis of the present study 
is based on Sanford's suggestions but is 
phrased in more quantitative terms. Specifi- 
cally it states that, in a lower-class group, 
individuals who give evidence of a great deal 
of fantasy aggression will also manifest more 
overt aggression than those who show little 
aggression in their fantasies. 

Sanford's statements about the withholding 


* of the overt expression of aggressive impulses 


are consistent with the theories of Dollard 
et al. (2), the Yale frustration-aggression 
theorists, who maintain that “the strength 
of inhibition of any act of aggression varies 
positively with the amount of punishment 
anticipated to be a consequence of that act" 
(2, p. 33). Although, generally speaking, the 
inhibition of aggressive expression is not an 


235 


236 


important part of lower-class mores, many in- 
dividuals of this class have experienced pun- 
ishment following the expression of aggression 
and consequently have learned to withhold 
this expression. In short, although there is no 
cultural prohibition against the expression of 
aggression, some lower-class people have 
‘Gnternal conflict” about it, i.e., they antici- 
pate punishment for aggressive behavior. 

The second hypothesis of the present study, 
derived from the frustration-aggression hy- 
pothesis, states that individuals who have 
strong fears of punishment relative to their 
aggressive impulses will manifest less overt 
aggression than individuals whose fear of 
punishment, relative to their aggressive needs, 
is small. In testing this hypothesis, the amount 
of punishment anticipated and the strength 
of aggressive needs were expressed as a 
fraction referred to as the punishment- 
aggression (P/A) ratio. The second hypothe- 
sis, phrased in terms of this ratio, states that 
individuals with high P/A ratios will show 
less overt aggression than those having low 
P/A ratios. 

If the first two hypotheses are supported 
or even partially supported, they may be 
combined into a third hypothesis which could 
also be systematically checked. If lower-class 
individuals with great fantasy aggressive 
needs express more overt aggression than those 
who have fewer fantasy aggressions (Hy- 
pothesis 1), and if individuals with low P/A 
ratios are more overtly aggressive than those 
with high P/A ratios (Hypothesis 2), then it 
follows that: among lower-class individuals, 
those who have high aggressive needs together 
with a low P/A ratio will manifest more 
overt aggressive behavior than those who have 
few fantasy aggressive needs together with a 
high P/A ratio (Hypothesis 3). 


METHOD 
Twenty lower-class white boys and nine lower-class 


Negro boys at the Bureau of Juvenile Research in, 


Columbus, Ohio, served as subjects (Ss) in this study. 
Their ages ranged from 9-0 to 15-8, and almost all of 
them had been referred to the Bureau for behaviors 
which brought them into conflict with school and court 
authorities, i.e., truancy, stealing, disorderly behavior 
in school, running away, etc. 

In order to check the three hypotheses, measures of 
fantasy aggression, fear of punishment, and aggressive 
behavior were required. The amounts of each S's 
fantasy aggression and anticipation of punishment were 
determined by analyzing his responses to TAT cards 1, 


* 


PauL H. MusseN AND H, KELLY NAYLOR 


3BM, 4, 6BM, 7BM, 8BM, 12M, 13B, 14, and 18GF. 
‘These were administered in the standard way within 
two days after S arrived at the Bureau and before he 
joined the cottage group to which he was assigned for 
the remainder of his stay. 

In the analysis of the stories, any act or thought of a 
hero of a TAT story which, implicitly or explicitly, had 
as its “goal response...injury to an organism (or 
organism surrogate)" (2, p. 11) was assumed to bea 
reflection of an aggressive need on the part of S.A 
fantasy aggression (FA) score was derived for each S by 
simply counting the number of times aggressive acts 
appeared in his 10 stories. The following were considered 
acts of aggression: fighting, killing, criminally assault- 
ing; getting angry, hating, quarreling, cursing; criticiz- 
ing, blaming, ridiculing; breaking and smashing objects; : 


escaping restraint, running away, resisting coercion; 
being negativistic, resisting authority, lying, cheating, 
stealing, gambling; forcing someone to change his be- 


eee 
Sa 


havior or ideas; domineering or restraining someone; re- 
jecting, scorning, or repudiating someone; suicide, self- 
injury, self-depreciation. The occurrence of death, 
illness, or accident of the parents in a story was regarded 
as an indirect expression of hostility and hence scored 
as a fantasy aggression. 

A measure of S’s fear of punishment was derived in 
an analogous way. A punishment (P) score was ob- 
tained by cumulating the number of times the heroes 
of the stories were subjected to punishment press. Asin 
the case of aggression, punishment was very broadly 
defined and any of the following was considered af 
instance of this press when it was directed toward the 
hero: punishment, assault, injury, killing; hate, threat, 
quarreling; deprivation of some privilege, object, 0r 
comfort; force, domination, restraint; physical handi- 
cap such as blindness, etc.; rejection, scorn, repudiation. 

Suicide, self-depreciation, and death, illness, or 
accident of parents or other loved objects were 
as both fantasy aggressions and punishment press. The | 
first two are obviously self-punitive; the last at 
included because a broad definition of punishment 
includes “injury to a loved object” (2, p. 34). | 

Observation of the overt aggressive behavior of § 
began the day he entered his cottage group and con- 
tinued for two weeks. Five attendants and a handicraft | 
teacher served as observers (Os) and recorded on B^ | 
forms, a weekly rating scale and a daily behavior report 
that were designed to facilitate and objecti 
observations. 

The daily behavior report, the first measure [E 
aggressive behavior, consisted of a check list of twelve 
kinds of aggressive behavior: physical attack, bragging 
threatening, teasing, saucy-impertinent, insulting 
calling, zidiculing, bullying, verbal castigation, mali 
cious gossip, destructiveness, and temper tan! d 
Each O filled out one of these reports each day for caei 
of the Ss under his care, checking the appropriate space 
if that type of behavior occurred that day. p. 

The total number of incidents of aggressive behavior | 
as indicated by the total number of checks on the forms 
was tabulated for each child. Unfortunately, despit 
the original plans, not all Ss were observed for the san 
number of days or by the same number of O5; Es 
quently, the number of reports submitted was not. 
same for all Ss. In order to get comparable measures j 
| 


RELATIONSHIPS BETWEEN OVERT AND FANTASY AGGRESSION 


aggressive behavior, the total number of checks re- 
corded for each S was divided by the number of reports 
submitted for him multiplied by 12, the number of 
aggressive behaviors listed on each report. This ratio, 
henceforth designated DBR, may be regarded as the 
proportion of the aggression observed to the total 
amount which theoretically could have been observed. 
These ratios ranged from .000 (least aggression) to .398 
(most aggression), with a mean of .088 for the 29 Ss. 

The second measure of aggression, the weekly 
rating scales, consisted of five separate scales involving 
different aspects of aggression: uncooperative-coopera- 
tive,  amiable-quarrelsome, — aggressive-submissive, 
docile-rebellious, and  antagonistic-friendly. These 
scales, which were designed to obtain an over-all 
measure of aggressive behavior, were completed by each 
O for each S under his care at the end of the first week 
of observation and again at the end of the second. 
Each of the five aggression scales was divided into 11 
equal units and scored according to the unit in which 
the rating check fell 1 representing the minimum 
amount of aggression and 11 the maximum. The weekly 
rating-scale score for each S was the sum of the scores 
for the five scales. Since the number of reports sub- 
mitted varied among Ss, average weekly rating (WRS) 
scores were used. For the 29 Ss, these scores ranged 
from 11.7 (lowest aggression) to 47.0 (highest aggres- 
sion), with a mean of 29.38. The rank-order correlation 
epe these two behavioral measures of aggression 
was .86. 


RESULTS 


The two types of data obtained—scores for 
fantasy aggression and fear of punishment, 
and the objective aggressive behavior scores 
—were used in testing all three hypotheses. 
For this purpose, all distributions were dichot- 
omized into high (median and above) and 
low (below median) groups. 

If the first hypothesis is valid, there will be 
a positive relationship between FA score, de- 
rived from the TAT, and aggressive behavior 
Scores on the weekly rating scales and the 
daily behavior reports. Table 1 shows the 
number of individuals with high and low FA 
Scores who received high and low DBR ratios. 
The probability of obtaining this set of cell 
frequencies or all other possible sets which. 


TABLE 1 
DISTRIBUTION or AccREssive BEHAVIOR SCORES 
AMONG Supyects HicH AND Low IN FANTASY 
Accression (FA) 


AGGRESSIVE BE- FA Scores 
HAVIOR SCORES 
pur Er] Hicn Low 
High 12 3 
Low 6 8 


237 


would be even more extreme (ie. more 
favorable to our hypothesis) was calculated 
directly by the method suggested by Fisher 
(3, p. 101). The value obtained was p = .046. 

The distribution of high and low FA scores 
was exactly the same in the case of the high- 
low dichotomy based on weekly behavior re- 
ports as it was in the case of the daily behavior 
reports; hence the same probability value was 
derived in testing the relationship between FA 
and the second measure of behavioral aggres- 
sion. 

As a further test of Hypothesis 1, the DBR 
ratios and WRS scores of the 10 Ss with the 
highest FA scores were compared with those 
of the 12 Ss with the lowest FA scores. Eight 
of the 10 highest FA scorers also received high 
(median or above) WRS scores and high DBR 
ratios, while 9 of the 12 lowest FA scorers also 
received low (below median) scores on the 
two behaviorial measures of aggression. The 
direct calculation of the probability of ob- 
taining this set of cell frequencies, or all other 
possible more extreme sets, yielded a p of .015. 
'This finding also indicates a strong positive 
relationship between covert and overt aggres- 
sion. 

It may be concluded from these findings 
that the first hypothesis is strongly supported 
and that, among a group of lower-class children, 
those with a greater amount of fantasy aggres- 
sion manifest more overt aggression than those 
with a smaller amount of fantasy aggression. 

Hypothesis 2 was tested by making a specific 
prediction concerning the relationship between 
a measure of TAT punishment press relative 
to TAT aggressive needs (P scores divided by 
FA scores to yield a P/A ratio) and the amount 
of overt aggression expressed as measured in 
this study. It may be predicted from the 
hypothesis that Ss having high P/A ratios 
will have low overt aggression scores; Ss having 
low P/A ratios will have high overt aggression 

» Scores. 

The number of individuals with high 
(median and above) P/A ratios and low (below 
median) P/A ratios in the high (median and 
above) and low (below median) categories of 
DBR and WRS is shown in Table 2. The dis- 
tributions for the two measures of overt ag- 
gression were identical. 

The probability of obtaining this set of cell 
frequencies or all other possible sets more 


- 238 


TABLE 2 
DISTRIBUTION OF AGGRESSIVE BEHAVIOR SCORES 
AMONG SUBJECTS WITH HIGH AND Low PUNISHMENT- 
AGGRESSION (P/A) RATIOS 


AGGRESSIVE BE- P/A Ratio 

HAVIOR SCORES Hion aus 
Low 10 4 
High 7 8 


favorable to the hypothesis was p = .165. 
Although there is not a statistically significant 
difference between the proportion of Ss in 
the high and low P/A groups having high or 
low overt aggression scores, the difference is in 
the predicted direction and may therefore be 
considered a trend which is mildly supportive 
of the hypothesis. 

Of the 10 Ss scoring highest in the P/A, 
seven received low DBR and WRS ratings, 
while of the 10 Ss having the lowest P/A 
ratios, only four received low overt aggression 
scores while six received high scores. The 
probability of obtaining this set of cell fre- 
quencies (or all other possible sets more 
favorable to the hypothesis) was calculated 
directly and the value yielded was p = .153. 
Again the finding may be regarded as favorable 
to the hypothesis since the direction of dif- 
ferences between the high and low P/A groups 
is as predicted. 

It will be recalled that the third hypothesis 
was a synthesis of the first two. Translated 
into the measures used in this investigation, it 
states that individuals who have high FA 
scores together with low P/A ratios will have 
high scores on DBR and WRS, while individ- 
uals who have low FA scores and high P/A 


TABLE 3 
DISTRIBUTION or Dary Bruaviog Report (DBR) 
Ratios AMONG Supyects HicH IN FANTASY 
Accression (FA) BuT Low IN PUNISHMENT- 


AGGRESSION (P/A) Ratios AND SUBJECTS LOW IN 3 


Fantasy AcGGRESSION (FA) BuT HIGH IN PUNISH- 
MENT-AGGRESSION (P/A) Ratios 


FA-P/A 
DBR Ra a a 
ToS | High FA-Low Low FA-High 
P/A P/A 
High 7 2 


Low 0 7 


Paut H. Mussen AND H. KELLY NAYLOR 


TABLE 4 
DISTRIBUTION OF WEEKLY Ratinc Scare (WRS) 
Scores AwoNG Susjecrs HicH 1m FANTASY 
Accression (FA) sur Low IN PUNISHMENT- 
Acocression (P/A) Ratios AND Susjects LOW IN 
Fantasy AGGRESSION (FA) BUT HIGH IN PUNISH- 
MENT-AGGRESSION (P/A) RATIOS 


FA-P/A 
WRS Scouts Hic FA-Low Low FA-Hrcg 
P/A P/A 
High 6 2 
Low 1 7 


ratios will have low overt aggression scores. 
Among the 29 Ss, there were seven who had 
high (median and above) aggression needs 
together with low (below median) P/A scores 
and nine who had both low FA scores and high 
P/A scores. The number of individuals in 
each of these two groups in the high and low 
DBR categories is shown in Table 3, while the 
corresponding distribution in the high and low 
categories of WRS scores is shown in Table 4. 
Direct calculation of the probabilities of ob- 
taining these sets of cell frequencies yielded 
the highly significant values of p = .003 in 
the case of Table 3, involving the DBR, and 
p = .020 for Table 4, involving WRS scores. 
These findings are clearly supportive of Hy- 
pothesis 3. Among those who have high fantasy 
aggression but relatively little fear of punish- 
ment, there tends to be a great deal of overt 
aggressive expression; among those who have 
few fantasy aggressive needs and relatively 
great fear of punishment, there tends to be 
little overt aggression. 


Discussion 


The results of this study support the hy- 
pothesis that those lower-class Ss who have ? 
relatively high number of aggressive needs 0? 
the TAT show more overt aggressive behavior 
than Ss having a relatively low number of 
fantasy aggressive needs. It was also found 
that Ss who scored high on punishment press 
relative to aggressive needs in their TA 
stories tended to express less overt aggression 
than Ss who scored low on punishment pres 
relative to their aggressive needs. Howeve® 
this second relationship was less marked tha" 
that between TAT aggression and overt 28- 
gression. When a high number of aggressive 


ee ee 


RELATIONSHIPS BETWEEN OVERT AND FANTASY AGGRESSION 


needs on the TAT is accompanied by a“ low 
ratio of anticipation of punishment to ag- 
gressive needs, there is a very strong likelihood 
that there will be a relatively great amount of 
overt aggressive expression; where a low num- 
ber of aggressive needs on the TAT is found 
with a high ratio of anticipation of punish- 
ment to aggressive needs, there is little 
probability that a great amount of aggression 
will be expressed overtly. 

These findings help delineate the conditions 
under which fantasy aggression may be used 
to predict overt aggressive expression, and 
consequently may be very useful in the in- 
terpretation of projective materials. For lower- 
class boys, at least, the amount of aggressive 
need shown by an individual in his TAT 
stories is some indication of the amount of ag- 
gression he will show in behavior. However, 
before we make predictions of overt behavior 
on the basis of the amount of aggressive need 
appearing in TAT stories, some attention 
should also be paid to the amount of punish- 
ment press relative to the aggressive need 
present. These conclusions should not be over- 
generalized since, as has already been pointed 
out, earlier studies have shown that in other 
Social classes and age groups the relationship 
between covert and overt aggression may be 
negligible or even negative. These findings 
may not be at all applicable to members of a 
social class which has rather rigid taboos 
against the expression of aggression, or to 
older people whose internal controls of ag- 
gressive expression may be more firmly estab- 
lished. 

It is probably impossible to make any over- 
all statements about the validity of a projec- 
iu test such as the TAT, which may be used 

assess many different aspects of personality. 
As Tomkins (9) has pointed out, attention 
Must be focused on whether or not specific 
inferences based on TAT protocols are valid. 
If effectiveness in predicting behavior in social 
Situations is accepted as a criterion of validity, 
the findings of this study may be regarded as 
evidence for the validity of TAT inferences 
ee broadly defined aggressive needs. 

urther research is necessary to elucidate the 
Conditions under which other kinds of in- 

ferences from the TAT are valid. 
i n of these data may be interpreted in 
of support of that aspect of the frustra- 


239 


tion-aggression hypothesis which states that 
inhibition of the expression of overt aggres- 
sion will vary with the degree of fear of punish- 
ment. The second and third hypotheses of this 
investigation both concern the relationship of 
fear of punishment to the expression of ag- 
gression. The partial support for the second 
hypothesis, together with the strong support 
for the third, may be taken as evidence in 
favor of the validity of the hypothesis that 
the expression of overt aggression is inversely 
related to the amount of punishment expected. 


SUMMARY 


The interrelationships among aggressive 
needs, anticipation of punishment, and overt 
aggressive behavior in 29 lower-class boys at 
the Bureau of Juvenile Research in Columbus, 
Ohio, were investigated in the present study. 
Analyses of Ss' TAT protocols yielded meas- 
ures of strength of aggressive needs and fear 
of punishment, while ratings and behavior 
reports submitted by attendants provided 
indices of the amount of overt expression of 
aggression. 

There was strong support for the first 
hypothesis of the study, which stated that 
among lower-class boys, those having a rela- 
tively great amount of fantasy aggressive 
needs indulge in more overt aggressive be- 
havior than those who have relatively few 
fantasy aggressive needs. 

As had been predicted from the second 
hypothesis, Ss whose TAT stories included a 
great deal of punishment press (ie., fear of 
punishment) relative to the number of their 
aggressive needs demonstrated less overt ag- 
gression than Ss whose ratios of punishment 
press to aggressive needs were low. The rela- 
tionship between the punishment press—ag- 
gressive needs (P/A) ratio and the behavioral 
aggression was not marked, but the findings 
may be regarded as mildly supportive of the 


-hypothesis. 


The third hypothesis, a synthesis of the 
first two, was strongly supported by the data. 
The hypothesis stated that Ss who have a 
great deal of fantasy aggression accompanied 
by a small degree of fear of punishment rela- 
tive to their fantasy needs (low P/A ratio) 
show more aggression in their behavior than 
those who have a small amount of fantasy 
aggression accompanied by a high degree of 


20 


fear of punishment relative to aggressive 
needs (high P/A ratio). 


REFERENCES 


1. Davis, A. Socialization and adolescent personality. 
Adolescence, Forty-Third Yearbook, Part I. 
National Society for Study of Education. Chi- 
cago: Univer. of Chicago Press, 1944. 

2. Dortarp, J., Doos, L. W., Mrs, N. E, 
Mowzer, O. H., & Sears, R. R. Frustration and 
aggression. New Haven: Yale Univer. Press, 1939. 

3. Fisner, R. A. Statistical methods for research 
workers. (7th Ed.) Edinburgh: Oliver & Boyd, 
1938. 

4, Linzey, G. Thematic Apperception Test; inter- 
pretive assumptions and related empirical evi- 
dence. Psychol. Bull., 1952, 49, 1-25. 


Paut H. Mussen AND H. KELLY NAYLOR 


5. Murray, H. A. Thematic A pperceplion Test manual, 
Cambridge: Harvard Univer. Press, 1943. 

6. Murray, H. A., & Srem, M. Note on the selection 
of combat officers. Psychosom. Med., 1943, 6, 
386-391. 

7. SANForD, R. N., Apxins, M. M., Mirrzs, R. B, 
Coss, E. A., & orHERS. Physique, personality 


and scholarship: a cooperative study of school f 


children. Monogr. Soc. Res. Child Develpm., 1943, 
8, No. 1. 

8. Symons, P. M. Adolescent fantasy; an investigation 
of the picture story method of personality study, 
New York: Columbia Univer. Press, 1949. 


9. Tomxms, S. S. The Thematic Apperception Test, 


New York: Grune & Stratton, 1947. 
Received January 21, 1953. 


FIXATION AND INHIBITION 


ALBERT EGLASH 
Mayor's Rehabilitation Committee, Detroit 


AILING to leap to an open window con- 
taining food, a fixated animal on the 
, Lashley apparatus will jump at a locked 
window instead (11, p. 43). Two aspects of this 
behavior may be distinguished: (a) The rat 
always jumps toward the same side of the 
apparatus. This aspect of the behavior has 
been described as the fixating of a response. 
(b) The rat never jumps toward the alternative 
side of the apparatus. This aspect of the be- 
havior will be described here as the inhibiting 
of a response. 

Either aspect may be considered as causing 
the other. Depending upon which is seen as 
Cause and which as effect, the problem may 
be described either in terms of fixation or in 
terms of inhibition. 


THE PROBLEM AS A FIXATION 


One view of the phenomenon emphasizes 
the animal's customary response of repeatedly 
jumping to one side of the apparatus. This 
view, describing the behavior in terms of 
fixation, implies that a consistent response 
occurs. It places no limit upon the number 
of responses which fail to occur. In the open- 
window situation, there may be dozens of 
possible responses, none of which appears. 

e required consistency of behavior is in 
terms of the response which occurs. 

i This view of the behavior describes the rela- 
tionship between fixation and inhibition by 
Suggesting that the ongoing response is so 
strong that, persisting, it inhibits the appear- 
ance of any alternative response: a strong 
habitual response — an inhibition of alterna- 
tive responses. * 


This view of the behavior is suggested both . 


by Maier's frustration theory (11, pp. 35, 39) 


$9 by Mowrer's anxiety theory (14, pp. 355- 


THE PROBLEM AS AN INHIBITION 
i T should like to discuss an alternative formu- 
ation of the problem. This formulation empha- 


oe the animal's repeated failure to jump to 
alternative side of the apparatus. This 


view, describing the behavior in terms of in- 
hibition, implies that some response consis- 
tently fails to appear. It places no limit upon 
the number of responses which do appear. In 
the open-window situation, there may be 
dozens of responses, all of which occur. The 
required consistency of behavior is in terms o! 
the response which fails to occur. i 

This view of the behavior describes the rela- 
tionship between fixation and inhibition by 
suggesting that the inhibiting of the max- 
imally adaptive response forces the animal to 
choose substitutes: inhibition of one response 
— appearance of substitute responses. 


INHIBITION OR FIXATION? 


Can either of these formulations be invali- 
dated? With respect to the variability and con- 
sistency which each implies, these two views 
of the phenomenon differ. Where fixation re- 
quires consistency of behavior, an inhibition 
view permits variability. 

Variability. Whereas fixation suggests rigid- 
ity or perseveration, the fixated animal remains 
as flexible as the normal. This flexibility, some- 
times taking the form of a surprising ingenuity, 
has been described by Hilgard (8, pp. 304-305), 
Maier (10, p. 20; 11, pp. 28, 42-43), and Wil- 
coxon (17) in their discussions of “abortive” 
behavior. 

In addition to displaying a variety of re- 
sponses, as described by Maier, the fixated 
animal may dive directly into the net, or leap 
so lightly that it fails to reach the window. It 
may assume a passive role: the air blast de- 
signed to force the jump blows the animal off 
the stand. 

The rat may jump at 45 or 90 or almost 180 
degrees to the habitual window. Feldman, 
Ellen, and Barrett's film (5) shows an animal 
leaping onto the roof of the apparatus. The 
animal may leap at the nosepiece which sep- 
arates the two windows, or at the locked 
window's ledge, and then crawl through the 
open window. 

While no one animal will use all of the above- 
described abortive responses, any given fixated 


241 


242 


animal may use several of them. Even after 
developing a habitual method of responding to 
the negative card, the animal continues to 
vary its behavior according to the demands of 
the situation. It leaps to the habitual window 
whenever the positive card is there, then uses 
one or more avoidance responses whenever the 
negative card appears. 

In other words, while the normal animal 
avoids the negative card by leaping to the 
alternative window, the fixated animal ac- 
complishes the same end by jumping abor- 
tively. This adaptability suggests substitute, 
rather than “fixated,” behavior. 

Consistency. Although in responding to the 
negative card, and in differentially responding 
to positive and negative cards, the fixated 
animal varies its behavior, it consistently fails 
to make the one response which is maximally 
adaptive: It does not leap to the open window. 
This suggests an inhibition of that response. 


OTHER INTERPRETATIONS OF THE BEHAVIOR 

While failure to display a maximally adap- 
tive response may indicate an inhibition of that 
response, alternative explanations are possible. 
Behavior may be considered a function of 
ability and motivation; unless the animal has 
learned to jump to the open window, and is 
motivated to do so, postulating an inhibition 
is superfluous. 
Ability 

Has the animal learned that, in this situa- 
tion, jumping to the alternative window is an 
appropriate response? All animals jump to 
that window during the initial training period; 
during the first few days of the insoluble 
problem, most animals try the window several 
times; in the soluble problem and in the open- 
window situation, the nonfixated animals re- 
turn to that response. Apparently it lies within 
the animals’ repertory. There is evidence, in 
other words, that the animals have acquired 
the ability to jump to the alternative window. 


Motivation 


Here we shall seek evidence that the animal 
is motivated to go to the open window, and is 
not motivated to avoid that window. 

Preference. In place of using an abortive 
response, does the fixated animal prefer to 
jump to the open window? 


ALBERT EGLASH 


a. One source of data is the animal’s pre- 
fixation behavior. During the initial training 
period, an animal can leap either to an open 
window or into the net. Invariably, it jumps 
through a window to the feeding platform. 

b. When the negative card is in the habitual 
window, the window is locked, and the animal 
consequently falls into the net. When the posi- 
tive card is there, the window is unlocked, and 
the animal lands on the feeding platform. 
Resistance to jumping increases whenever the 
fixated animal is confronted with the negative 
card (11, p. 41). 

c. Finally, a direct test of preference may 
be made. When offered a choice between the 
open window on the alternative side and the 
net, the animal dives into the net. It is possible 
to offer the same choice on its habitual side. 
This is done by placing the negative card in 
the alternate window, leaving the habitual 
window open. Rather than dive into the net, 
the animal now jumps through the open win- 
dow. 

Avoidance. Even though the animal prefers 


to land on the feeding platform rather than in | 


the net, an accurate description of the be- 


havior might still be given in terms of the 
consistent avoiding of the alternate window. 
Rather than inhibiting a response, the animal 
may be avoiding a location. 

Feldman (5; 11, p. 48) has provided a test 
of this hypothesis. After demonstrating that 
the animal would not leap to the open window, 
Feldman built a bridge or runway from the 
jumping platform to that window. The animal, 


in place of jumping, could now get to the feed- | 


ing platform by walking. : 

Offered this bridge, the fixated animal 
walked to the window. When the bridge wa | 
removed, the animal returned to its abortive 
jumping. 

Two aspects of these results seem relevant 
to the hypothesis that the behavior is 9? 
avoidant response: | 

a. An animal which, up to that time, had 
failed to jump to that window now walked 
there, indicating that it was not motivated t0 
avoid the window. 

b. The animal’s relearning that food was 
obtainable in that window had no effect up% 
its behavior. Mowrer (14, p. 513) has show? ' 
that an avoidance response, when no long | 
appropriate, ends. The relevant variable va 


* FIXATION AND ÍNHIBITION 


fixation is evidently a response rather than a 
location (11, p. 48). 

In short, there is evidence that the animal 
is both motivated and able to jump to the 
alternate window. The one consistent aspect of 
its behavior is its failure to do so. 


INHIBITION AS AN ANSWER 


The hypothesis that we are dealing with the 
inhibiting of a response seems consistent with 
observations that have been made both of 
human and of animal behavior. 

1. Even after showing differential resistance 
to positive and negative cards, the normal as 
well as the fixating animal persists in leaping 
toward its habitual side, and each may use 
abortive responses (17, pp. 329-331); the 
normal animal, however, soon drops this re- 
sponse and leaps to the alternate window. Con- 
sequently, only in what it fails to do does the 
fixated animal differ from the normal. 

Maier (11, pp. 147-148) has observed that 
some neurotic animals differ from normals only 
in the inhibiting of certain functions, while 
Freud (6, p. 11) has made the same observa- 
tion about human neurotics. 

2. The animal's differential resistance to 
positive and negative cards, and its use of 
abortive avoidance responses, suggests that 
with the same goals as the normal animal (to 
avoid punishment and gain rewards), the 
fixated animal takes substitute, ineffective 
paths (6, p. 17). If “behavior without a goal" 
(11, p. i; 16, pp. 25-28) means the behavior of 
an organism without a goal, this designation of 
fixated behavior is a misnomer. 

Like a wrong turn in a maze, or a miss in a 
dart game, fixated behavior might be described 
as misdirected or deflected. The behavior also 
suggests an analogue of reaction formation (6; 
9, pp. 193-194; 14, p. 386; 15, pp. 460-461): 
motivated to jump toward the food in the 
open window, the fixated animal jumps away: 
from it. a 


Animal fixation, like human neurosis (2, pp. : 


13-14), is a failure of an organism effectively to 
teach its goals; at best, fixations, like neurotic 
Symptoms (6, p. 26), are "compromise" solu- 
tions (15, p. 423). If “behavior without a goal” 
means that the fixated response fails to get the 
animal to its goal, or is relatively ineffective, 
the description seems accurate. 


243 


3. That a state of frustration or anxiety is 
responsible for the animal's behavior is not 
supported by the experimental findings: 

a. Specificity of the behavior. Were the ani- 
mal’s emotional state the cause of the behavior, 
this behavior would be general rather than 
specific. The fixated animal, generalizing its 
anxiety even to its home cage (11, p. 151), still 
shows normal adaptability in other problem 
situations (11, pp. 47-48). 

b. Effectiveness of guidance. When first of- 
fered guidance, the fixated animal, like the 
neurotic human (7, p. 253), shows strong re- 
sistance (11, p. 53 and Fig. 7). Were the ani- 
mal’sbehaviorfrustration-instigated or anxiety- 
induced, the blocking of one response and the 
forcing of another, which increase frustration 
and anxiety, would be ineffective in ending the 
behavior. Instead, forcing the animal’s re- 
sponse ends (11, p. 53) and prevents (12) ab- 
normal behavior. 

This raises the question of why guidance is 
effective. If the primary problem is the fixating 
of a response, then the importance of guidance 
is that it “prevents the expression of the fix- 
ated response” (11, p. 79). If the primary 
problem is the inhibiting of a response, then 
guidance is effective because it “leads the 
animal through an alternate response” (11, p. 
79). In applying guidance to alcoholism and 
kleptomania, Maier emphasizes the need to 
help the individual perform an act he has been 
unable to perform, rather than to prevent him 
from performing an act he feels he must per- 
form: 


If one applied the guidance method to a persistent 
type of behavior such as dipsomania, it would seem that 
the procedure would entail giving the alcoholic the 
experience of declining drinks. The method of attempt- 
ing to keep the alcoholic away from drinks avoids the 


problem. .. . 

Applied to the illustration of stealing in frustrated 
cases, it seems that the procedure would be a matter of 
taking the child to the store a few times and having 
him make actual purchases (11, pp. 162, 175). 


c. Ineffectiveness of reward. Rewarding ex- 
periences, which reduce frustration and anx- 
iety, should end frustrated, anxious behavior, 
as in Farber’s experiment (4). Instead, allowing 
the animal to rest in its cage for months (11, 
p. 44), to solve other problems successfully 
(11, p. 47), to walk from jumping stand to 
feeding platform (5; 11, p. 48), or to jump with 


244 


the habitual window unlocked,! does not end 
fixation. Maier (p. 175) notes that, at the 
human level, fixated behavior may persist 
after the frustration is alleviated. 

The view that the behavior is a function of 
anxiety or frustration is not consistent with its 
specificity, with the effectiveness of guidance, 
or with the ineffectiveness of reward. An al- 
ternative view is that the animal’s frustration 
is a function of its inhibition. Without this 
inhibition, the animal in the open-window 
situation would experience no frustration. A 
frustrating situation is one in which a barrier 

prevents an organism from reaching its goal. 
- In the open-window situation, where there is 
no external barrier, the barrier must be some 
aspect of the organism itself. Maier (11, p. 151) 
makes a similar suggestion in connection with 
human neurotics. The organism may become 
frustrated as a result of a conflict between its 
“excitatory tendency and an opposing inhibitory 
tendency” (1, p. 481).* Rather than frustration 
— fixation, the relationship may be inhibition 
— frustration. 

Similarly, in the open-window situation 
there is no external basis for fear. However, 
when the fixated rat inhibits a learned response, 
it apparently regresses to an initial or unlearned 
reaction (8, pp. 83, 94; 11, p. 40): it resembles, 
in its behavior, an untrained animal (11, pp. 
144—145); in its anxiety, a naive animal (9; 11, 
p. 131). Rather than anxiety — fixation, the 
relationship may be inhibition — frustration 
— regression — anxiety (6, p. 40). 


1 An unpublished study by the writer, carried out 
at the University of Michigan with the cooperation of 
Drs. N. R. F. Maier and Paul Ellen. It was this experi- 
ment which first indicated that some modification must 
be made in the present formulation of frustration the- 
ory. 
2 The fixated animal acts as if it had internalized a 
barrier to its goal. May (13, pp. 337-344) has made a 
similar observation about human neurotics. He found 
that neurotic anxiety occurred in those who, denying 
the objective reality of a parental rejection, internalized 
the parent-child conflict; this became the internal con- 
flict giving rise to anxiety. 

Maier (11, pp. 132-133) describes an internal con- 
flict between a frustration-instigated compulsive im- 
pulse and a fear-motivated inhibition of the impulse. 
One day a youngster at the beach said that he would 
like to swim under water, but was afraid. Here was the 
opposite kind of conflict from that described by Maier: 
the impulse was felt as motivated, the inhibition as 
compelling. It was this incident which first suggested 
the possibility of reinterpreting Maier’s data in terms 
of inhibition. 


ALBERT EGLASH 


INHIBITION AS A PROBLEM 

The substitution of “inhibition” for "fixa. 
tion” does not explain the behavior, but only 
describes it. Rather than solving the problem 
of behavior which “ʻis at one and the same time 
self-defeating and yet self-perpetuating” (14, p. 
351), this inhibition constitutes the problem. f 

While a cognitive theory can explain much 
of Maier’s data, it leaves this central problem, | 
the failure of the animal to leap to the open 
window, unsolved (3). Reinforcement (anxiety- 
reduction) theory will also account for much 
of the data, but Mowrer points out that this 
too leaves the essential problem unsolved: 


The only question which Maier’s results leave un- 
answered is why Maier’s subjects show less behavior | 
variation when fear-dominated than when hunger- 
dominated. ... Several investigations... have indi- 
cated that living organisms behave less freely, less 
flexibly when being subjected to punishment than when 
oe solely under the influence of reward (14, p. 
357). 


In one sense it is not true that the fixated 
animal behaves less flexibly. It has a repertoire 
of responses larger than that shown by the 
normal animal. In a more important sense, 
however, the animal is indeed less free, for the | 
maximally adaptive response is not displayed. 

This is the same dilemma which, at the 
human level, Freud described as unsolved 6, 
pp. 89-92). To explain this inhibition i$ to 
solve the problem of animal fixation, and the 
explanation may, by analogy, help solve the 
riddle of human neurosis. 


SUMMARY 


To the current view that animal fixation 
occurs when a persistent response inhibits the 
expression of more adaptive behavior, an 4” 
ternative view is offered: An underlying inhibi- 
tion leads to substitute behavior. This vie 
seems consistent both with Maier’s expe! 
mental findings with animals and with bi 
application of the derived principles to hum? 
guidance. i 

REFERENCES 

1. Brown, J. S., & Farser, I. E. Emotions concep” 
ized as intervening variables. Psychol. Pi. 

1951, 48, 465-495. B. 
2. Doxtarp, J, & Muer, N. E. Personality 4 

psychotherapy. New York: McGraw-Hill, 1950. 
3. EcLasm, A. Perception, association, and reason 
A animal fixations. Psychol. Rev., 1951, 58, 

4. 


10. 


FIXATION AND INHIBITION 


. FarBER, I. E. Response fixation under anxiety and 


nonanxiety conditions. J. exp. Psychol., 1948, 38, 
111-131. 


. FEtDMAN, R. S., ELLEN, P., & Barrett, R. H. 


Frustration and fixation. (Film.) State Coll., Pa.: 
Psychol. Cinema Reg., 1951. 


. FREUD, S. The problem of anxiety. New York: 


Norton, 1936. 


. FREUD, S. A general introduction to psychoanalysis. 


New York: Garden City, 1943. 


. HircAnD, E. R. Theories of learning. New York: 


Appleton-Century-Crofts, 1948. 


. LippELL, H. S. The role of vigilance in the develop- 


ment of animal neurosis. In P. H. Hoch & J. 
Zubin (Eds), Anxiety. New York: Grune & 
Stratton, 1950. Pp. 183-196. 

Marr, N. R. F. Studies of abnormal behavior in the 
rat, New York: Harper, 1939. 


11. 
12. 


13. 
14. 
15. 
16. 


17. 


245 


Marr, N. R. F. Frustration. New York: McGraw- 
Hill, 1949. 

Mater, N. R. F., & Kırg, J. B. Studies of abnor- 
mal behavior in the rat: XII. The pattern of 
punishment and its relation to abnormal fixa- 
tions. J. exp. Psychol., 1943, 82, 377-398. 

Mav, R. The meaning of anxiety. New York: 
Ronald, 1950. 

Mownrm, O. H. Learning theory and personality 
dynamics. New York: Ronald, 1950. 

Monn, N. L. Handbook of psychological research 
on the rat. New York: Houghton Mifflin, 1950. 
Secer, D. Frustration in adolescent youth; its 
development and implications for the school pro- 

gram. U.S. Ofice Educ. Bull., 1951, No. 1. 

Wircoxow, H. C. “Abnormal fixations” and learn- 

ing. J. exp. Psychol., 1952, 44, 324-333. 


Received January 21, 1953. 


ETHNOCENTRISM AND MISANTHROPY: 


PATRICK L. SULLIVAN ano JOSEPH ADELSON 


VA Mental Hygiene Clinic 
Oakland, California? 


T HAS been clear for some time that 
persons who are prejudiced against a 
particular ethnic minority do not ordi- 

narily limit their antipathies to the group in 
question. Those who express an animus against, 
say, Negroes will tend also to express hostile 
attitudes toward a diversity of other groups, 
including some which are fictitious (8). The 
ethnocentric person, despite his willingness to 
adduce specific reasons for rejecting designated 
groups, nevertheless appears to be oriented 
toward a general rejection of others. 

Recent researches into prejudice, particu- 
larly those of the University of California 
Public Opinion Study (hereafter, UC-POS) 
(1), have given further documentation to this 
observation. Perhaps the most important 
discovery stemming from these investigations, 
however, has been the finding that ethnic 
prejudice is somehow rooted in, or at the 
very least associated with, personality or- 
ganization. The persons designated as preju- 
diced are found to have undergone particular 
processes of socialization (1, 6), are shown to 
utilize peculiar vocabularies of motives (1), 
are seen to differ from the unprejudiced with 
respect to such ostensibly peripheral functions 
as memory (4), cognition (13), social percep- 
tion (14), and so on. This insight into the 
nature of prejudice—that it is functional to 
personality—enables us to understand the 
ubiquitous antipathies of the ethnocentric. 

Studies linking authoritarianism to ethno- 
centrism have yielded still other suggestions: 
the early interpersonal experiences of these 


persons have been such that they are given to » 


cynicism regarding the motives of others (5), 
a perception of others as hostile and threaten- 
ing (1, 5, 14), a general suspiciousness (1, 7); 

lThe authors are indebted to Drs. H. G. Gough 
and M. B. Freedman for their aid in the preparation 
of this paper. 

? From the Veterans Administration Regional Office, 
San Francisco, California. 


Bennington College 


in short, they are marked by an apparent 
misanthropy. 

Turning now to the measuring instruments 
used in the investigations of ethnic attitudes— 
usually, direct questionnaires—it is seen that 
the usual item can be separated into at least 
two elements: (a) a reference to a particular 


ethnic minority and (b) a statement, usually | 


derogatory although often couched in pseudo- 
democratic terminology, pertaining to some 
presumed attribute of the group in question, 
e.g, sensuality, greed, clannishness, and so 
on. It is assumed, if only implicitly, that there 
is a necessary linkage in the imagery of the 
respondent between the minority group and 
the invidious imputation. If, however, we 
formulate ethnocentrism as misanthropy, the 
possibility arises that assent to statements 


derogatory toward a specific group may | 


reflect or express hostility toward people in 
general—or, that the designation of the 
particular outgroup is not as important 
securing assent as is the imagery of hate and 
exclusion) The study to be reported is an 
attempt to test this hypothesis. 


METHOD 


A 29-item questionnaire was developed in the follow- 
ing manner: Preliminary Ethnocentrism scale items 


used in the UC-POS were revised so that references t0 — 


specific minorities were omitted; the words “people, 
“most people,” or “human(s)” were substituted. It 
was found necessary to eliminate from consideration 
a few items because they related too directly to stereo: 
types about particular groups, e.g., the Jews’ all 

crucifixion of Christ. As may be seen from the examples 
given below, elimination of these items did not E 
stitute a Serious departure from the plan of retaining 
the derogatory imputations. Aside from the substitu- 


è The existence of such an underlying misanthropy 
is suggested by Gough (6). He points to the existent 
of sets and attitudes which obscure the measuremt" 
of sought-after variables, citing the K factor 0 
Minnesota Multiphasic Personality Inventory 25 7^ 
example (10). Gough considers misanthropy to b° i 
variable underlying and confounding the measureme” 
of ethnocentrism. 


246 


* ETHNOCENTRISM AND MISANTHROPY 


tions, the original wording was retained except in 
instances where sensibleness necessitated slight revi- 
sions. Typical of the 29 items are the following, listed 


with their original counterparts: 


Original 

Jews seem to prefer the 
most luxurious, extrava- 
gant, and sensual way of 
living. 

Jews tend to lower the gen- 
eral standard of living by 
their willingness to do the 
most menial work and to 
live under standards that 
are far below average. 


Jews should be more con- 
cerned with their personal 
appearance and not be so 
dirty and smelly and un- 
kempt. 

The Negroes would solve 
many of their social prob- 
lems by not being so irre- 
sponsible, lazy, and igno- 


Revised 
People seem to prefer the 
most luxurious, extrava- 
gant, and sensual way of 
living. 
Most people tend to lower 
the general standard of 
living by their willingness 
to do the most menial 
work and to live under 
standards that are far be- 
low average. 
People should be more 
concerned with their per- 
sonal appearance and not 
be so dirty and smelly 
and unkempt. 
People would solve many 
of their social problems 
by not being so irrespon- 
sible, lazy, and ignorant. 


rant. 


The scale thus devised (hereafter denoted as M, for 
misanthropy) was administered to 221 students of 
elementary psychology at a midwestern university. 
The subjects (Ss) at the same session completed a 
UC-POS 20-item Ethnocentrism (E) scale (1). It should 
be made clear that the E scale statements did not dup- 
licate in phrasing any of the M scale items; that is, 
the M scale items used in the study were not among 
those included in the final, 20-item E scale but were 
drawn from the larger sample of preliminary items used 
in evolving the E scale. Two of the four classes tested 
received an M-E order of presentation; the order was 
Teversed for the other two. It may be noted that the 
mean E and M scale scores for the two sequences of 
administration did not differ significantly from each 
other: the p values for the critical ratios were'.23 and 
62 for E and M, respectively. 

RR Ss were permitted six categories of response for 
bo um ranging from complete agreement (+3) to 
CEU disagreement (—3), a middle category being 

c - The responses were translated into a seven- 
Point scoring scale, with maximum disagreement re- 
USD maximum agreement seven, and omissions 
ph points, Low scores, then, indicated relative absence 

prejudice and misanthropy. 


“A more detailed form of this scale has been 

t de- 
Apt with the ADI. Order Document 4034 from the 
Service Tie Publications Project, Photoduplication 
Pe rary of Congress, Washington 25, D. C., 
AS Hee in advance $1.25 for photoprints or $1.25 
payable u aaa Make checks or money orders 
of Con E hief, Photoduplication Service, Library 


247 


RESULTS 


The test of the major hypothesis is in the 
degree of association between the two scales; 
the product-moment correlation is .43. The 
corrected odd-even reliabilities for E and M 
are .84 and .79, respectively. Correcting for 
attenuation, we obtain an r of .53. Both 
correlation coefficients are significant at the 
.001 level of confidence. 


Discussion 


The meaning of the results may be illumi- 
nated if we imagine polar opinions concerning 
the nature of prejudice. One extreme would 
insist that prejudice is primarily, if not entirely, 
determined by personality dynamics, that if 
minority groups did not exist inner needs 
would invent them. An opposing view would 
hold that prejudice is unrelated to needs and 
motives, that it is attributable to the specific 
social and historical conditions which place 
certain groups at a disadvantage. To the 
extent that the present data are relevant to 
this dialectic, they suggest a middle position: 
On the one hand, misanthropy—which would 
conceivably be one of the key central variables 
to which proponents of the first view would 
reduce all prejudice—is associated with ethnic 
attitudes, as witness our correlation. On the 
other hand, the correlation is certainly not 
large enough to demonstrate that prejudice is 
isomorphic with an underlying misanthropy 
or that the designation of particular minorities 
as objects of hate is adventitious, i.e., free from 
social press. 

That our major hypothesis should have been 
substantiated by the results is not surprising 
when we remember that previous studies have 
indicated the prevalence of hostile, suspicious, 
cynical feelings in the authoritarian toward 
others. What is worthy of note here is the 
generality of reference of these attitudes. It 


* appears, at least on the surface, that for many 


.of the antidemocratic there may be no ingroup 
other than the self" Whereas previous concep- 


5 Indeed, antipathies may even be directed toward 
the self or aspects of the self. Erikson (3, p. 215) sees 
the rejection of minorities as a. manifestation of in- 
tolerance of tendencies within the self which arise 
genetically from unresolved nuclear conflicts. Again, 
Rogers (12) and his associates have stressed the salience 
of negative attitudes toward the self concept in mal- 
adjustment. 


248 


tualizations of ethnocentrism have stressed 
ingroup-outgroup dichotomization, it may 
ultimately be shown that this dichotomy does 
not exist for many of the high-scoring persons 
and that they are totally exclusionistic. The 
outstanding example of such a person is the 
paranoid schizophrenic, who, it appears from 
clinical observation, is very strongly charac- 
terized by the tendencies and predispositions 
we have been discussing and who, we know, is 
almost completely estranged from others. 

If we direct our attention again to the com- 
ponents of the usual questionnaire used in 
antiminority researches, the fact that there is 
reason to question the existence of ingroups for 
some prejudiced persons may be more clear. 
What we find, to repeat, are (a) designations 
of specific groups with (b) certain charac- 
teristics attributed to them: The outgroups 
are concretely specified and defined by the 
items and by the pre-existing stereotypy ap- 
pealed to. For the ostensible ingroup, no such 
concreteness of specification and description is 
provided. Rather, the assumption is made that 
"opposite" groups with "opposite" attitudes 
directed toward them must exist on the basis 
of the information gathered concerning out- 
group attitudes. This is not to say that there 
may not be ingroups with which many ethno- 
centric persons are identified, but that evidence 
derived from direct questionnaires does not 
provide an adequate basis for this conclusion. 

An exception to the practice of limiting 
scale items to the two components noted above 
is offered by the Ethnocentrism scale, which 
was designed with an eye toward representing 
ingroups in the items. For example, some of 
the items concerning Negroes indicate whites 
as the ingroup, Jews are opposed to Christians, 
etc., so that what emerges is an ingroup com- 
prised of "native, white, Christian Ameri- 
cans" (1, p. 109). Adding these ingroup 
references leads to the necessity of making 


assumptions concerning the characteristics of . 


the ingroup. These characteristics may not 
typify the ingroups of all the ethnocentrics. 
Rather than "defending" an ingroup with 
which they are identified, these persons may 
only be availing themselves of the opportunity 
to express exclusionistic feelings, toward the 
specified groups or generally. It is suggested, 
therefore, that the introduction of references 


PATRICK L. SULLIVAN AND JOSEPH ADELSON 


to ingroups into antiminority group ques 
tionnaires, based as they are on a priori de 
terminations, results in the confusion of 
variables and should be avoided. 

Underlying the foregoing is the important 
conceptual question of whether investigations 
into attitudes toward groups are best guided 
by a psychological or by a sociological frame 
of reference. Consider, for example, the grounds 
on which one can make a priori determinations 
of ingroups—and, for that matter, outgroups. 
In the designation, native, white, Christian 
Americans, an essentially sociological defini- 
tion has been resorted to. Consider also 
the UC-POS definition of ingroup and out- 
group: “‘Ingroup’ and ‘outgroup’ are socio- 
psychological rather than purely sociological 
concepts, since they refer to identification and, 
so to speak, contra-identification, rather than 
to formal membership in the group" (see 1, 
pP. 146-147). In this and in related text, 
reference is made to a primarily psychological 
point of view. Yet, when the major outgroups 
are enumerated, it is apparent that a socio- 
logical orientation is used. We do not take 
issue with the right or the necessity of doing 
this, at least insofar as outgroups are con- 
cerned, but must cite the fact of a sociological 
view in order to make a further point, It 
might parenthetically be noted that raising 
such considerations should help us clearly to 
define the nature of the investigations under 
discussion: They have been focused on the 
extremely important problem of measuring 
tolerance and intolerance of socially and 
economically deprived and oppressed minor 
ties. 

We would, however, separate sharply from 
this sort of interest that having to do with 
group identification and contraidentification 
and would contend that investigations into the 
latter phenomena must necessarily utilize 
different methods and different concepts. Idem 
tificatioh and contraidentification are diffe 
entiated here from ethnic tolerance and 
intolerance, for, although there may Ut 
doubtedly be overlap, the two cannot bé 
equated. Identification as a concept in custom" 
ary usage denotes a highly idiographic, Q^. 
namic process, in which idiosyncratic percepts: , 
needs, and feelings are of crucial important 
and defy externally imposed definitions. Her 


a 


T u i m 


. ETHNOCENTRISM AND MISANTHROPY 


then, a psychological approach must be used. 

The need for the separation of the tolerance 
and identification dimensions is suggested by 
Hartley’s study (8) in which a completely 
fictitious group was presented for rejection or 
acceptance. That many persons manifested 
intolerance of this nonexistent group reveals 
their prejudiced attitudes toward the strange 
and the alien. It does not, however, demon- 
strate contraidentification, for this term 
denotes, among other things, the association 
with the group in question of a set of attributes, 
learned either through experience or the 
propagation of stereotypes. 

Our arguments, then, lead us to conclude 
that further research in this area is necessary. 
It is apparent that inferences concerning 
ingroup attitudes from data concerned with 
the outgroup are untenable, yet it is also clear 
that our own information does not provide 
sufficient support for a supposition of an utter 
absence of ingroup feeling among the mis- 
anthropic. The topics which need investigating 
are identity and identification. We need in- 
formation concerning both the existential 
functioning of the affiliative and self-regarding 
sentiments and the processes by which these 
dispositions become existent. There is con- 
siderable evidence that writers in every area 
of social science are similarly persuaded. In 
Toward a General Theory of Action (11) such 
diverse theorists as Parson, Shils, Tolman, and 
several others join in pointing to the salience 
and significance of the identification concept 
in contemporary social science research. And 
it is significant, we feel, that Erikson (3) has 
made the notion of identity so central to 
his conception of ego development and func- 
Honing, and that Burke (2) has used the term 

identification” as a crucial one in his analysis 
of language. 

M as A SCALE 


Since the purpose of this study was the, 


testing of a hypothesis rather than the | 


Construction of a reliable instrument for the 
Measurement of the misanthropy variable, no 
eunt at further refinement of the M. scale 
t id made. A normal distribution of scores was, 

wever, obtained; the reliability figure of .79 
en adequate under these circumstances. 
d d seem that the present instrument 

ould be readily available for use as a scale 


249 


after the omission of the less differentiating 
items and possibly the rewriting of others. The 
validation would, of course, be a more difficult 
though not an insuperable problem. Of par- 
ticular interest would be the replications of 
researches into various functions, e.g., memory, 
social perception, problem solving, etc., using 
M instead of or along with E in selecting 
experimental Ss. Whether M would provide 
a basis for better differentiation of Ss differing 
in these functions is, of course, a matter for 
empirical determination. The M. scale would 
also appear to provide an instrument for 
identifying the nonmisanthropic among ethno- 
centrics, who would presumably be more sus- 
ceptible to re-education toward greater toler- 
ance. 


SUMMARY 


Twenty-nine items from existing scales of 
ethnic prejudice were rewritten so that the 
terms “people” or “most people" or “hu- 
man(s)' were substituted for the specific 
minorities originally designated. The scale 
thus constructed (termed M for misanthropy) 
was found to be correlated .43 (.53 when cor- 
rected for attenuation) with a 20-item version 
of the UC-POS scale for general ethnic in- 
tolerance. The results of the study were dis- 
cussed with reference to the possible connec- 
tions between prejudice and misanthropy. 
The implications of misanthropic imagery for 
attitude scale construction were noted, and 
certain methodological considerations were 
raised. Some suggestions were made pertaining 
to the need for revising our present conceptual- 
ization of ingroup-outgroup identification and 
contraidentification. Finally, it was indicated 
that the present instrument could, after some 
work, be used for subsequent research in this 
area. 


REFERENCES 


1. Avorno, T. W., FRENKEL-BRUNSWIE, ELSE, 
Levinson, D. J., & Sanrorp, R. N. The au- 
thoritarian personality. New York: Harper, 1950. 

2. Burke, K. A rhetoric of motives. New York: 
Prentice-Hall, 1950. 

3. Erxson, E. H. Childhood and society. New York: 
Norton, 1950. 

4, FısHER, J. The memory process and certain psycho- 
logical attitudes, with special reference to the 
law of Prägnanz. I. Study of nonverbal content. 
J. Pers., 1951, 4, 406-420. 


250 


5. FRENKEL-BRUNSWIE, ErsEe, & SANFORD, R. N. 
Some personality correlates of anti-Semitism. 
J. Psychol., 1945, 20, 271-291. 

6. Goucu, H. G. Studies of social intolerance: I. 
Some psychological and sociological correlates 
of anti-Semitism. J. soc. Psychol, 1951, 33, 
237-246. 

T. Govcn, H. G. Studies of social intolerance: III. 
Relationship of the Pr scale to other variables. 
J. soc. Psychol., 1951, 38, 257-262. 

8. Hartiey, E. L. Problems in prejudice. New York: 
King’s Crown Press, 1946. 

9. Levinson, D. J., & Sanrorp, R. N. A scale for 
the measurement of anti-Semitism. J. Psychol., 
1944, 17, 339-370. 

10. Mrent, P. E., & Haraway, S. R. The K factor 
as a suppressor variable in the Minnesota Multi- 


Patrick L. SULLIVAN AND JOSEPH ADELSON 


phasic Personality Inventory. J. appl. Psychol, 
1946, 30, 525-564. 

11. Parsons, T., & Sums, E. A. (Eds) Toward a 
general theory of action. Cambridge: Harvard 
Univer. Press, 1951. 

12. Rocers, C. R. Client-centered therapy; its current 
practice, implications, and theory. Boston: Hough- 
ton Mifflin, 1951. 

13. Roxeacu, M. Generalized rigidity as a factor in 
ethnocentrism. J. abnorm. soc. Psychol., 1948, 
43, 259-278. 

14. SuLLIvan, P. L. An investigation of the concepts 
governing the categorization of people, with 
special reference to certain social attitudes and 
values. Unpublished doctor's dissertation, 
Univer. of California, Berkeley, 1950. 


Received January 23, 1953. 


CONFORMING BEHAVIOR OF PSYCHIATRIC AND 
MEDICAL PATIENTS! 


JACOB LEVINE, JULIUS LAFFAL, 
VA Hospital, West Haven, Connecticut 


MARTIN BERKOWITZ, 


JAMES LINDEMANN, 


Pennsylvania State University 
AND JOHN DREVDAHL 
University of Nebraska 


HE symptoms of the neurotic or emo- 

tionally disturbed individual usually 

find expression in difficulties in his 
interpersonal relationships, particularly the 
more intimate ones. We observe that the emo- 
tional difficulties of the neurotic are manifest 
also in some ways in his group behavior, either 
by failure to integrate into a group or by in- 
ability to conform to group standards. It is 
probably safe to postulate that the more 
severely disturbed an individual is in his 
interpersonal relationships, the more likely it is 
that that disturbance will be reflected in his 
group behavior. We may expect at least some 
disturbance in the neurotic’s integration into 
à group or in his ability to conform to group 
demands. But we have few systematic studies 
that would provide substance to this type of 
expectation. 

The present study was designed to test the 
assumption that the neurotically disturbed 
Person will show some difficulty in his group 
behavior. On the basis of this assumption we 
may predict that one expression of this dif- 
ficulty will be the relative inability to'adjust 
to or be influenced by group standards. The 
disturbed individual, hampered by anxiety and 
emotional conflict, will not readily shift his 
behavior to accord with that of the group of 
which he is nominally a part. 

In order to measure the influence of group” 
norms upon the neurotic’s judgments, we have 
made use of the autokinetic phenomenon. In 

2 study of group influences upon the median 
fee judgments, Sherif (5) found 
that the successive judgments of individuals 

*This study was conducted at the VA Hospi 
aes Connecticut, and is submitted with xa 

*Proval of the Chief Medical Director, Veterans Ad- 
ministration, Washington, D. C. The opinions ex- 


Pressed here 
WESS are ae $ the authors and not of the 


251 


tended to converge toward a common group 
norm when they were tested in groups. He 
interpreted this finding as indicating a tend- 
ency of individuals as members of a group to 
conform to group standards. He further found 
that the convergence of the medians is not as 
close when subjects started with an individual 
session and were then brought into a group as 
when they began in the group situation. 
Finally, he observed that when a group mem- 
ber was subsequently tested alone he adhered 
closely to the group norm and range of judg- 
ments. Jenness (3) also found this type of 
convergence resulting from group influence in 
the estimation of the number of beans in a jar. 

Bovard (1) later found that group influence 
on convergence toward a group norm was 
greater in a group-centered than in a leader- 
centered group. He interpreted these findings 
to mean that there was much greater inter- 
personal interaction in the group-centered than 
in the leader-centered group. In another study, 
Levine and Butler (4) confirmed, with judg- 
ments of performance ratings, Lewin's earlier 
finding that a group-centered group was more 
effective in modifying individual behavior to- 
ward more socially acceptable norms than was 
a leader-centered group. These studies all sug- 
gest that the degree of convergence of the 
judgments of individuals toward a group norm 
is largely determined by the amount of inter- 


-personal interaction. 


On the basis of these findings, the present 
study was designed to test the hypothesis that 
the emotionally disturbed individual will inter- 
act less in the group situation than will the 
individual who is not so incapacitated. Spe- 
cifically, a comparison will be made with re- 
spect to the degree of convergence toward a 
group norm of autokinetic judgments between 
subjects (Ss) who were hospitalized because 


252 


of emotional difficulties and those who were 
hospitalized for minor medical illnesses. If our 
assumption is correct, the autokinetic judg- 
ments of the neurotics will not converge in the 
group situation to the same degree as will 
those of the nonneurotics. 


METHOD 


The method of inducing the autokinetic effect in 
the present study was essentially similar to that used 
by Sherif (5). The Ss, either singly or in groups of three 
to six, depending on the experimental session, were 
brought into a completely darkened room, They were 
told that they would see a pin point of light which would 
move, and that they were to estimate the greatest 
distance it moved in any one direction from the start- 
ing point. The instructions were: 

We want to see how well you can judge the 
movement of a point of light in complete dark- 
ness. In a moment the light will go out and 
when I say “Ready,” you will see a point of light 
before you. Watch this point. It will begin to 
move. A few seconds later it will disappear. 
When it disappears 

(For individual sessions): I want you to call 
out immediately how far it moved. 

(For group sessions); Dr. —— will call out 
your names individually and when he does I 
want you to call out how far you saw the light 
move. 

The point may move in several directions. 
I want you to give the furthest distance it 
moves in any one direction. Give your judg- 
ment in inches. 

After you have given your judgment the 
light will appear again when I say “Ready.” A 
few seconds later it will disappear. When it 
disappears... (appropriate instructions for 
individual and group).... We will do this 
several times. 

There were ten trials within each session, and there 
were four separate sessions. In each trial the pinpoint 
of light was exposed for 30 seconds. One E remained 
in the experimental lightproof room, and two Es were 
outside the room in an alcove separated from the ex- 
perimental room by opaque blanket drops. One of the 
outside Es operated the pin-point and room light 
switches in response to signals called by E in the room, 
and one called out the names of Ss in the room when 
the time came for reporting judged distances, and re- 
corded the judgments which were given aloud by Ss. 

‘The Ss were all hospitalized patients at a VA general 
hospital and were divided into two major groups. One 
group of ten patients came from the open neuropsy- 
chiatric service. Those with psychotic symptoms were 
excluded from this group. The other group consisted 
of seven patients from the medical wards of the hospital 
with minor organic complaints but with no evident 
disabling emotional difficulties. Cases with the common 
psychosomatic illnesses were ruled out of this latter 

group. For purposes of differentiation between the two 
groups, we shall refer to the first as the “neurotic” and 
the second as the “nonneurotic” group. 


Levine, LAFFAL, BERKOWITZ, LINDEMANN, AND DREVDAHL 


The Ss in each of these two major groups were di- 
vided in random fashion into two subgroups which then 
differed according to the sequence of individual (1) 
and group (G) sessions. Thus one neurotic subgroup and 
one nonneurotic subgroup started the experiment with 
an individual session in which each member of the 
group observed the pin-point light alone for ten trials, f 
followed by three successive group sessions, spaced one- } 
half day apart, in which the subgroup as a whole f 
observed the pin-point light. The other two subgroups T 
started the experiment with three group sessions spaced | 
at half-day intervals, followed by an individual 
session at a one half-day interval. Four groups were 
thus obtained, identified henceforth as group A, the 
neurotic IGGG group; group B, the neurotic GGGI 
group; group C, the nonneurotic GGGI group; and 
group D, the nonneurotic IGGG group, according to 
the sequence of individual and group sessions. 

The fundamental datum in this study was the 
amount of variability shown within each trial by the 
groups of Ss. For each of the ten trials per session, 
variability was calculated by the formula Za*/n — 1, 
which will be recognized as an estimate of variance, 
These variability scores, of which there were ten in 
each session, were then used as the basic scores in an 
analysis of variance to determine if the groups differed 
from each other in mean variability from session to 
session. A table similar to Table 1 was constructed 
showing the means of the variabilities for the groups 
and combinations of groups, and the standard devia: 
tions of these means, for each session. In this initial 
table it was evident that the means were related ta 
their standard deviations, and that the data const 
quently failed to meet the requirements of normal | 
distribution and homogeneous variance in analysis of 
variance. A logarithmic transformation of the vari 
ability scores (2, p. 203) was effected by adding 193 
each score and converting to its logarithm. Table 1 
represents the means and standard deviations of the 
log-transformed variability scores. The effect of the | 
transformation was to reduce the rank-order corre? 
tion between means and standard deviations from 45.) 
to .36, the latter being statistically nonsignificant. AS 
a result of the log transformation it was possible to 
assume that the variances of the variability scores wet 
homogeneous and normal among all groups, there?) 
meeting the requirements of the analysis of varianc | 
which was then carried out. | 


RESULTS 


. Table 1 gives the logarithmically trans” 
formed mean variabilities of judgments of t% 
autokinetic effect for each group and combi" 
tion of groups in each session. Group A, t 
neurotic IGGG group, had the greatest overall 
variability. As a matter of fact, the variability 
of this group increased from session to sess 
despite the shift to group sessions. GrouP p 
the nonneurotic IGGG group, was next 7 
over-all variability. But for this group then” 
appeared to be a general decrease in varia! ilit 


in successive group sessions, a trend in GN% 


CONFORMING BEHAVIOR OF PSYCHIATRÍC AND MEDICAL PATIENTS 253 
TABLE 1 
MEAN VARIABILITY OF JUDGMENTS OF THE ÁUTOKINETIC EFFECT 
(Logarithmically transformed scores) 
EQUEN SESSIONS 
Grour* TYPE or S S BAL 
1 2 3 4 ALL SESSIONS 

A (6 Ss) Neurotic IGGG M .67 ^ .86 1.05 1.17 .94 
SD .31 .33 .32 .16 1.65 
B (4 Ss) Neurotic GGGI M .)9  .68 :2 .82 14 
SD +20 25 .34 .24 1.29 
C (4 Ss) Nonneurotic GGGI M 37 51 .22 Me 45 
SD .25 .27 15 +22 81 
D (3 Ss) Nonneurotic IGGG M .95 .96 .86 .85 .90 
SD .34.. .39 .40 .39 1.58 
Aand D Neurotic and IGGG M .81 .91 .95 1.01 .92 
nonneurotic SD .36 .36 .38 .34 1.62 

Band C Neurotic and GGGI M «58 iiS AT 4 60.7 
nonneurotic SD 31 27 .36 .23 1.09 
Aand B Neurotic IGGG and M yk eels .88 .99 .84 
GGGI SD 27 .31 1.16 .26 1.48 
D and C Nonneurotic IGGG and M .66  .78 54 19 68 
GGGI SD .42  .40 ES .32 1.28 
All groups M 40  .74 Ni .89 76 


* Means of groups A, B, C, and D are based on 10 variability scores within each session. Means of the combinations of groups are 


based on 20 scores, and means of all groups are based on 40 scores. 


TABLE 2 
TRENDS IN VARIABILITY OF JUDGMENTS OF THE 
AUTOKINETIC EFFECT FROM SESSION TO SESSION, 
BASED ON ANALYSIS OF VARIANCE 


SEQUENCE DIRECTION 
Group  Tvrr or S orSesstons OF TREND f 


A Neurotic IGGG Increase 01 
B Neurotic GGGI None ns. 
c Nonneurotic — GGGI Mixed — .01 
D Nonneuroti — IGGG Decrease .01 
Aand Neuroticand IGGG None ns. 
D nonneurotic 
Band Neuroticand GGGI Mixed 05 
c nonneurotic 4 
Aand Neurotic IGGG and Increase .05 
DU GGGI 
and Nonneurotic ^ IGGG and None n.s. 
c GGGI 


tion opposite to that of group A. The other’ 
neurotic group, group B (GGGI) ranked third - 
Quran variability, but there did not appear 
9 be any consistent trend from session to ses- 
Sion. The nonneurotic GGGI group, group C, 
ranked lowest in over-all variability. The great- 
est variability in this group appeared in the 
Sica Session after the three group sessions 
Which otherwise showed no consistent trend. 
PES 2 shows the significance of the vari- 
ty trends for each of the individual groups 


and combinations of groups, as compared to à 
horizontal line, based on analysis of variance. 
Groups A, C, and D showed significant dif- 
ferences in variability from session to session, 
whereas group B showed no such difference. 
Inspection of the mean values for group A 
showed that the significant trend lay in the 
direction of a general increase in variability 
from session to session. For Group D the trend 
was in the opposite direction, toward decrease 
of variability from session to session. In the 
case of group C the trend was not uniform, 
being significant probably because of the large 
difference between the final group session, 
which was low in variability, and the last 
(individual) session, which had relatively high 
variability. 

In Table 3 the various groups and combina- 
tions of groups are compared with respect to 
their mean variabilities and the significance 
of the differences between them. A comparison 
between groups A and D showed no significant 
difference in over-all mean variability, but 
there was a significant difference in the inter- 
action between groups and sessions. As Table 
1 indicates, for group A the variability from 
session to session increased, whereas for group 
D it decreased. In a comparison of groups B 
and C both interaction and over-all mean 


« 


254 


TABLE 3 
THE SIGNIFICANCE OF THE DIFFERENCES IN VARI- 
ABILITY OF JUDGMENTS OF THE AUTOKINETIC EFFECT, 
BAsED ON ANALYSIS OF VARIANCE 


INTER- OVER-ALL 
ACTION MEAN 
BETWEEN DIFFER- 
Sequence GROUPS ENCES 


Grours or AND BETWEEN 
CoupareD Tyex or S SzssioNs Sessions GROUPS 
Avs.D Neurotic vs, non- IGGG +05 n.s. 
neurotic 
Bvs.C Neurotis vs. non- GGGI 05 ot 
neurotic 
A and D vs. Mixed IGGG vs. n.s. o 
B and C GGGI 
AandB vs. Neurotic vs. non- Mixed n.s. 01 
DandC neurotic 


variability were found to be significant. The 
two neurotic groups combined were also com- 
pared to the two nonneurotic groups com- 
bined. There was no significant interaction 
between groups and sessions, but the over-all 
mean variabilities were significantly different, 
the neurotic groups being significantly more 
variable in their judgments than the non- 
neurotic. Finally, the combined trend of the 
IGGG neurotic and nonneurotic groups was 
compared with the combined trend of the 
GGGI neurotic and nonneurotic groups (AD 
vs. BC). There was no significant interaction 
between groups and sessions but the over-all 
group variabilities were significantly different, 
with the IGGG groups being far more variable 
than the GGGI groups. 


DISCUSSION 


On the basis of the present findings we may 
conclude that the neurotic group of patients 
were more variable in their perceptual judg- 
ments than the nonneurotic. This variability 
was less affected by group influences in the 
neurotics than in the nonneurotics. As a matter 
of fact, the tendency for the judgments of the 


individuals to converge toward a group norm’ 


as observed by Sherif (5) was not found at all 
in the neurotic group. It was found only in the 
case of the nonneurotics who began with an 
individual session and then were exposed to 
three successive group sessions. 

These results confirm our original hypothesis 
that individuals beset by neurotic difficulties 
will be less influenced by group forces than 
those who are not. On an empirical basis, one 


| 


LEVINE, Larrar, BERKOWITZ, LINDEMANN, AND’ DREVDAHL [ | 


might perhaps have predicted that a bimodal 
type of distribution of reactions by the neurot- 
ics would have taken place. The reasoning 
would be that the neurotic who is withdrawn 
and isolated from the group and whose contact 
with others is minimal would be little in- 
fluenced by the group, whereas the neurotic 
who is anxious about his acceptance by the 
group would strive vigorously to conform to} 
group demands. The latter would very likely 
be even more influenced by the group norms 
than the individuals who are less socially 
anxious. Despite our small sample, the results’ 
are unequivocal. The group influence was | 
either absent or even tended to be negative for | 
the neurotics, since variability of their judg- | 
ments tended to increase with a shift to group | 
sessions. And contrary to what might have | 
been expected, none of the neurotics could be 
identified as social isolates. Every one of them | 
was to a greater or lesser degree a member of 
a 36-patient ward and had established some 
group relationships prior to this study. It may 
be that these group relationships were more + 
superficial and casual than is normally the 
case, but no basic difference was apparent. 
However, an increased self-preoccupation was 
clearly observable in these neurotic patients. 
This self-preoccupation may have blunted 
their responsivity to interpersonal interaction 
and thereby reduced the group influence. But 
there was no evidence of psychotic withdrawal, 
and no observable psychotic symptoms among 
the neurotic Ss which could have account 
for blunting of responsivity. li 
The demonstration in this study of the isoli 
tion of, the emotionally disturbed person from | 
the group leads one to wonder whether or not 
any attempts to help these suffering individuals | 
to integrate more into group activity woul 
have any fundamental therapeutic effect. Cer 
tainly the important group work of Slavs? 
(6) with children supports such a notion. It® 
also likely that where individual therapy 55 ^ 
fective, improved interpersonal and group 1° 4 
tionships would be discernible. But t% 
attempts to increase group integration E 
emotionally disturbed patients could affec 
their underlying difficulties remains to 
proved. It is evident that what is needed "° | 
objective methods for evaluating the effect ; 
upon behavior of various therapeutic P 
cedures. Studies such as the present one su i 
gest that one method which might have post 


CONFORMING BEHAVIOR OF PSYCHIATRIC AND MEDICAL PATIENTS 


bilities is that of group integration and inter- 
action. 


SUMMARY 


A group of ten nonpsychotic psychiatric pa- 
tients in the open ward of a general hospital 
was compared with seven medical patients 
without disabling psychiatric symptoms, with 
respect to the variability of their judgments of 
the autokinetic phenomenon. All Ss were given 
four successive sessions consisting of ten trials 
each. Sessions were varied from individual to 
group, and from group to individual. The re- 
sults indicated that neurotic Ss were con- 
sistently more variable than nonneurotic Ss 
in their judgments and were less affected by 
the group influence. The judgments of one sub- 
group of nonneurotics showed the tendency to 
converge to a group norm found by Sherif (5). 
The present findings further suggested that 


255 


individuals who first approach an unstructured 
situation in a group will show less difference in 
their judgments than when they first approach 
a similar situation individually and then in a 


group. 
REFERENCES 


1. Bovazp, E. W., Ja. Group structure and perception. 
J. abnorm. soc. Psychol., 1951, 46, 398-405. 

2. Epwanps, A. L. Experimental design in psychological 
research. New York: Rinehart, 1950. 

3. Jenness, A. The role of discussion in changing 
opinion regarding a matter of fact. J. abmorm. 
soc. Psychol., 1932, 27, 279-296. 

4. Levine, J., & Butter, J. Lecture vs. group decision 
in changing behavior. J. appl. Psychol., 1952, 36, 
29-33. 

5. Smer, M. A study of some social factors in per- 
ception. Arch. Psychol., 1935, No. 187. 

6. Stavson, S. R. Introduction to group therapy. New 
York: Commonwealth Fund, 1943. 


Received January 23, 1953. 


*MANIFEST" ANXIETY, NEUROTIC ANXIETY, AND THE RATE 
OF CONDITIONING! 


HUBERT SAMPSON anp DALBIR BINDRA 
McGill University 


suggestion that the rate of simple 
conditioning (conditioning eyelid re- 


sponse, for example) might be a cor- 
relate of anxiety has aroused considerable 
interest among experimental psychologists 
(1, 2, 4, 5, 6, 7). In most of the recent studies, 
designed to reveal a possible relation between 
anxiety and the rate of conditioning, the 
measure of anxiety used is what has come to 
be known as the Taylor Scale of Manifest 
Anxiety. The usual procedure is to place 
individuals who score high on the Taylor 
scale in one group labelled “anxious,” and 
those who score low into a second group 
labelled “nonanxious.” Following this alloca- 
tion, the subjects (Ss) in each group are ob- 
served individually in a conditioning situation, 
and their rates of conditioning (and subse- 
quent extinction) are determined. The data 
are then analyzed to see if the rates of condi- 
tioning and extinction are different for the two 
groups. Because these studies have failed to 
produce unequivocal results, and since they 
are of considerable theoretical interest, we 
consider it important to seek the reasons for 
the discrepancies, and to reconcile the results 
insofar as possible. The present paper is an 
attempt in this direction. 

The Taylor scale (6) purports to measure 
manifest anxiety. It is made up of a number of 
items, selected from the Minnesota Multi- 
phasic Personality Inventory, which were 
judged by clinical psychologists to reflect 
manifest anxiety. The items deal with gen- 
erally accepted overt “signs” of anxiety, such 


as sweating, restlessness, tenseness, etc., and > 


the score (ie. the number of positive re- 
sponses) is assumed to indicate the degree of 
manifest anxiety. Two forms of the scale have 
been in use. The second (“new”) form was de- 


1 This study was supported in part by the National 
Research Council of Canada (Grant A-P. 12). 


256 


veloped from an item analysis of the 65 items | 
that constituted the original (“old”) scale | 
Fifty of the most discriminating items were 
selected to make up the new scale (5). 

Taylor (6), using the old form of the scale 
and a simple eyelid-conditioning situation, 
found the anxious group to have a more rapid 
conditioning rate than the nonanxious group. 
Spence and Taylor (5) worked with the new | 
form, and also found significant differences in 
amount of conditioning between the two 
groups. However, Hilgard, Jones, and Kaplan 
(2), in one phase of a similar study, did not 
find a significant difference between anxious 
and nonanxious Ss. Finally, Bitterman and 
Holtzman (1), working with conditioning of 
galvanic skin response, also were unable to 
demonstrate any difference in rates of condi 
tioning and extinction between groups Sepa 
rated on the basis of scores on the Taylor i 
scale, 

As Bitterman and Holtzman (1) have sug 
gested, part of the disagreement in results 
may be due to differences in the ranges of 
scores used to define the anxious and nom 
anxious categories. These ranges are shown V | 
Table 1. It is evident that Bitterman and 
Holtzman’s nonanxious and anxious groups | 
(ranges: 2 to 11; 12 to 40) were not comparable 
to those of Taylor or Hilgard ef al. (ranges: T 
and below; 23 and above). But these differ- 
ences in scores mean little unless we know 
what the scores represent. Therefore, befor 
attempting to reconcile these findings in terms 
of differences in defining scores, it would seem 
‘wise to examine the validity of the Taylor 
scale, and to delineate as clearly as possible 
the variable or variables represented in it 
This was the aim of the following study 
which is based on the assumption that b 
rotics (especially anxiety patients) W0 
score high on any scale that "truly" measure 
manifest anxiety. 


d 


ANXIETY AND THE RATE OF CONDITIONING 


TABLE 1 


4 DEFINITIONS OF "ANXIOUS" AND "NONANXIOUS" 
Gnours IN TERMS OF RANGES OF SCORES MADE ON 
TAvLOR's SCALE OF MANIFEST ANXIETY 


} 
| RANGE 
k 


NONANXIOUS ANXIOUS 

INVESTIGATOR - GROUP GROUP FORM USED 

Taylor (6) 1-7 (lower 9 24-36 (upper 12 Old scale 

percentiles) percentiles) 

Spence & Taylor 9 & below 24andabove New scale 
[3] (lower 20%) (upper 20%) 

Hilgard, Jones, & 7 or below. 23 or above Old scale 
Kaplan (2) 

Bitterman & 2-11 12-40 Old scale 
Holtzman (1) 

PROCEDURE 


The new 50-item form of Taylor’s scale was ad- 
ministered to 51 male, hospitalized, neurotic patients 
at a Veterans Hospital in the Montreal area. Not less 
than six nor more than 15 patients were present at any 
one testing session. The patients were assured that 
their answers to the test items would have no bearing 
on their treatment, and in general, every effort was 
made to obtain truthful responses. Subsequently, each 
patient's hospital record was studied, and the most 
prominent symptoms present upon admission, together 
with the diagnosis, were noted. 

The same scale was administered to several large 
groups of college students, both men and women. There 
were 223 normals in all. 


RESULTS 


N eurolics ws. normals. The maximum score 
Possible on the new Taylor scale is 50. The 
Scores of the neurotic group ranged from 2 to 
45; of normals, from 2 to 43. The ranges over- 
lap almost completely. But, as may be ex- 
Pected, the means of the two groups are sig- 
nificantly (p < .001) different from each 
other, being roughly 26 (SD = 10.02) for 
neurotics, and 16 (SD = 8.05) for normals. 

Anxiety patients vs. other neurotics. Anxiety 
was a prominent symptom in 27 of the 51 
Patients. These 27 “anxiety neurotics” were 
compared with the remaining 24 “nonanxiety 
heurotics”” Gie., those patients for whom 
Ne anxiety was not a prominent symp- 
ant D et correlation between these 

ue ratings and scores on the Taylor 
e e showed only a chance relation (rss = 

003) , between the two variables. This 
and z 4 consistent with that of Bitterman 
x oltzman (1) who report no significant 

tion between scores on the Taylor scale 
and clinical ratings of anxiety. 


257 


However, a closer examination of the dis- 
tribution of scores made by the neurotics re- 
vealed a tendency for the nonanxiety neurotics 
to fall at the two extremes of the distribution, 
and for the anxiety patients to occupy the 
middle range (Q, to Qs) of the distribution. 
To check the statistical significance of this 
tendency, all patients having a score of below 
19 (ie., below Qı) and above 33 (ie., above 
Qs) were placed in one group, and the number 
of anxiety neurotics in this group was com- 
pared with the number of anxiety neurotics 
among the patients who scored between 19 and 
33 (i.e., Qi to Qs). The chi-square test showed 
a statistically significant (p < .02) tendency 
for the anxiety patients to fall within the 
middle range of the distribution of the scores 
for neurotics. The symptom other than anxiety 
symptoms that was most frequently mentioned 
in the patients whose scores fell at the two 
extremes of the distribution (below Qi and 
above Q) was depression. 


Discussion 


1. What exactly do the scores on the Taylor 
Scale of Manifest Anxiety represent? Taylor 
(6), as well as Holtzman, Calvin, and Bitter- 
man (3), assume that, so far as the overt 
signs of manifest anxiety are concerned, the 
only difference between neurotics and normals 
is one of degree. Inasmuch as the Taylor 
scale differentiates between normals (mean = 
16) and neurotics (mean = 26), it may appear 
to be a reasonably valid indicator of anxiety. 
But, according to our results, this validity 
does not hold through the whole range of 
scores. The data on the hospitalized neurotics 
show clearly that extremely high scores are no 
longer associated with manifest anxiety, but 
mainly with depression.” Most of the anxiety 
patients fall within Qı and Qs (i.e., from 19 to 
33) of the neurotic group. On the basis of 
these findings, we are inclined to believe that 
Whereas the scores of from 19 to 33 on the 
Taylor scale are associated with manifest 
anxiety, the scores outside this range do not 
represent manifest anxiety. It would seem 
that a person making a score of between 
roughly 19 and 33 is most likely to be classified 

2The impression of clinical psychologists and 
psychiatrists that anxiety and depression are closely 
allied conditions is irrelevant to this discussion, for we 
are concerned only with the correlates of symptomatic, 
manifest anxiety. 


258 


as anxious by clinical criteria (though not 
necessarily as neurotic). 

This view is supported by the fact that the 
total range of scores on the Taylor scale is not 
related to clinical criteria of anxiety (1), and 
is contrary to Taylor’s assumption (6) that 
different scores represent different degrees 
of anxiety. We suggest that the Taylor scale 
does not measure the degree of manifest 
anxiety, but can be used only to separate 
those individuals who are likely to be classified 
as anxious by clinical criteria from those who 
are not. Specifically, an individual who 
scores within the range of 19 to 33 shows those 
behavioral signs of manifest anxiety which 
make it most probable that clinically he will 
be judged as anxious rather than as non- 
anxious. 

2. We turn now to the contradiction in the 
results concerning the relation between con- 
ditioning and manifest anxiety. Spence and 
Taylor (5) used the new Taylor scale, and 
selected for their anxious group individuals 
who had a score of 24 or above. Since they 
worked with normal Ss, and the mean score 
for the normals is roughly 16, it is a reasonable 
guess that very few, if any, of their Ss had 
scores above 33. That is to say, almost all 
Ss in the anxious group used by Spence and 
Taylor fell within the range which, according 
to our results, is most closely associated with 
manifest anxiety. Their nonanxious group con- 
sisted of Ss with scores of nine or below, that is, 
outside the 19 to 33 range. Thus the two groups 
were well separated with respect to manifest 
anxiety. Even though we do not have the com- 
parable scores for the old scale, it appears 
certain that the anxious and the nonanxious 
groups in Taylor's original study (6) were 
equally well separated (anxious: 24-36; non- 
anxious: 1-7). Thus in the two studies where 
the anxious and the nonanxious groups were 
well separated with respect to manifest 
anxiety, the differences between the two 
groups in susceptibility to conditioning were 
also significant. 

In Bitterman and Holtzman's study (1) 
on the other hand, the scores of the anxious 
group ranged from 12 to 40, violating, at 
both ends, the limits (19 and 33) within which 
scores on the Taylor scale are associated with 
manifest anxiety. This may account for their 
failure to find a significant difference between 
the two groups. The finding that is not con- 


HUBERT SAMPSON AND DALBIR BINDRA 


sistent with this interpretation is that of Hil 
gard and his collaborators (2). They selected 
20 Ss and divided them into two well-sepa- 
rated groups (anxious: 7 and below; non 
anxious: 21 and above). But they found that 
in terms of simple conditioning there was no 
difference between the two groups. The dis 
crepancy between this result and those of 
Taylor is difficult to explain. Hilgard el d. 
attribute it to differences in the experimental 
arrangements. Equally likely, it may be be 
cause of the small number of cases used in 
their study. | 

Of special relevance to this discussion is the | 
series of studies in which rate of conditioning 
was related directly to clinical ratings. Welch | 
and collaborators (4, 7) used normal, neurotic, 
and psychotic groups and were able to demon- 
strate differences in conditioning rate within 
and between these groups. Bitterman and 
Holtzman (1) also found a difference in the 
rate of conditioning (and extinction) when 
their groups were divided in terms of clinical 
criteria of anxiety. This suggests that the 
rate of conditioning itself may be a mot 
sensitive indicator of anxiety than are scott 
on the Taylor scale. 


SUMMARY 


The validity of the Taylor Scale of Man 
fest Anxiety was examined with a view t 
reconciling the contradictory results of tht 
studies of the relation between anxiety 8! 
the rate of conditioning. The scale was admit: 
istered to 51 hospitalized male neurotics am 
to 223 college students. The results indica! l 
that different scores on the scale do not repr 
sent different degrees of manifest and 
though the scores within a limited range (^. 
to 33) are more likely to be associated witht 
clinical diagnosis of “anxious” than £ | 
scores above and below this range. 7 
interpretation helps to reconcile the contri 
dictory earlier findings concerning the relatio 
between conditioning and anxiety. It 1$ sug 
gested that the rate of conditioning itself m) 
be more closely related to differences in m 


i if 
fest anxiety than are scores on the Tay. 
scale. 

REFERENCES E 
1. Brrrerwan, M. E, & Horrzwa, W. H. C 


tioning and extinction of the galvanic skin resP' idi 
as a function of anxiety. J. abnorm. soc. P9 
1952, 47, 615-623. 


ANXIETY AND THE RATE OF CONDITIONING 259 


2. Hircarp, E. R., Jones, L. V., & Karran, S. J. strength of the UCS as determiners of the amount 
Conditioned discrimination as related to anxiety. of eyelid conditioning. J. exp. Psychol., 1951, 42, 
J. exp. Psychol., 1951, 42, 94-99, 183-188. 

3. Horrzman, W. H., Carvin, A. D., & BITTERMAN, 6. TAYLOR, JANET. The relationship of anxiety to the 
M. E. New evidence for the validity of Taylor’s conditioned eyelid response. J. exp. Psychol., 1951, 
Manifest Anxiety Scale. J. abnorm. soc. Psychol., 41, 81-92. 

1952, 47, 853-854. 7. WELCH, L., & Kusis, J. Conditioned PGR (psycho- 

4. Scurr, E., Doucan, C., & WELCH, L. The condi- galvanic response) in states of pathological anxiety. 
tioned PGR and the EEG as indicators of anxiety. J. nerv. meni. Dis., 1947, 105, 372-381. 


J. abnorm. soc. Psychol., 1949, 44, 549-552. 
5. Spence, K. W., & TAYLOR, JANET. Anxiety and the Received January 26, 1953. 


PREDICTING HOSPITALIZATION OF PSYCHIATRIC OUTPATIENTS 


DONALD R. PETERSON 
University of Illinois! 


HERE is a definite clinical need for an 

instrument which can aid in isolating 

from the heterogeneous population 
*psychiatric outpatients," those cases who 
not only fail to profit from therapy but who 
become even more seriously disturbed as 
treatment continues. One of the patients in 
the present study had killed his wife. Two had 
attempted murder. Six had attempted suicide. 
Treating such cases on an outpatient basis 
seems rather imprudent, and means for pre- 
dicting their behavior should permit wiser 
disposition. More generally, increased accuracy 
in predicting such an event as hospitalization 
should lead to improvement in the efficiency 
of outpatient treatment from the point of 
view of the patient, his therapist, and society 
as a whole. 

'This study constitutes an attempt to meet 
the need for greater predictive accuracy, but 
it was also designed to provide a partial 
definition of the concept, "latent psychiatric 
illness." It is ordinarily assumed that any 
patient who develops a personality disorder 
severe enough to require institutionalization 
has, before the manifest outbreak of symptoms, 
certain predispositions to illness. Meaningful 
definition of these predispositional tendencies 
is dependent upon measurement of behavior 
during the period of "latency," and it is this 
measurement which constitutes the second aim 
of the investigation. 

A survey of research on prognosis reveals 
only one empirical study (4) where hospitaliza- 
tion was employed as a criterion of psychiatric 
outcome, and for it the sample was drawn from 
a population not already in the hospital. 
While the results of that study suggest the 
possibility of predicting hospitalization 
through use of a psychometric test (the 

1 Published with the permission of the Chief Medical 
Director, Department of Medicine and Surgery, 
Veterans Administration, who assumes no responsi- 
bility for the opinions expressed or the conclusions 
drawn by the author. The paper is an abstract of a 
Ph.D. thesis done while the author was at the Uni- 
versity of Minnesota. The constant aid and encourage- 
ment of Dr. Paul E. Meehl is gratefully acknowledged. 


260 


Cornell Selectee Index), general inference is} 
limited by the size of the sample (V = 10)) 
and the fact that the procedures used are not 
commonly employed in present clinical prac 
tice. The literature on prognosis from psycho: 
metric data has recently been surveyed by 
Windle (23). The studies examined in that 
review have offered hypotheses for test in this 
investigation; they have served as a fund of 
possible psychometric predictors. A list of 
possible nonpsychometric factors was com- 
piled by examining the studies of Wittman 
(24), Dunham and Meltzer (3), Clark (2), 
Jenkins (9), Kant (11), Fisher and Hayes (5), 
Chase and Silverman (1), Mayer-Gross and 
Moore (16), and Lewis (14). 


METHOD 


Plan of investigation. In most respects, the pro- 
cedure employed in this study follows that outlined by 
Horst (8) for general prediction problems. The follow: 
ing steps have been carried out: 1 

1. A criterion was defined as admission to & Psy- 
chiatric hospital following psychological testing and 
two or more interviews by staff members at à VA 
mental hygiene clinic, 

2. Every case in the clinic file was examined, and 
all those who met the criterion were placed in the | 
"hospitalized" category. Certain nonpsychometiit 
data and the results of the Wechsler-Bellevue, MMPI, 
and Rorschach were recorded for each member of the 
sample. Hereinafter, these hospitalized patients shall be j 
referred to as Group I (N = 108). " 

3. A sample of nonhospitalized patients at the d 
clinic was gathered by going through the psycholog as 
test file and selecting every tenth card which show | 
that the patient had had all three of the tests considered 
in the study. All subjects (Ss) who had been hos! 
talized were left in Group I. Again, those cases who wert” 
interviewed less than two times were excluded, bul 
further restrictions were made. Results of the tests # 
nonpsychometric data were recorded for the remaining 
cases, hereinafter referred to as Group II (N = 118), k 

4. In accordance with a double cross-validatioi 
design (12, 18), each group was split into two $ 
samples by alphabetizing the data sheets and 
them alternately into two piles. Four groups TA 
consequently formed, specified as follows: Gro el 
hospitalized (NV = 54); Group II-A, not hospi? 
(N = 57); Group I-B, hospitalized (N = 54); 6/0. 
ILB, not hospitalized (V = 57). E 

5. ‘Next, a sort of gross item analysis was perfo is" 
One set of significantly differentiating signs W85 ™ | 


PREDICTING HOSPITALIZATION OF PSYCHIATRIC OUTPATIENTS 


lated by comparing Group I-A with Group II-A (the 
combination of these two groups shall be referred to as 
Sample A). Similarly, a set of discriminating signs was 
derived by comparing Group I-B with Group II-B 
(this combination shall be referred to as Sample B). 

6. For each sample, the following indices were made 
up: (a) a device consisting only of nonpsychometric 
signs; (b) a device consisting only of signs from the 
MMPI; (c) a device comprising both nonpsychometric 
and MMPI signs; (d) a comprehensive device, including 
all signs which discriminated between groups. 

7. Each index derived from Sample A was cross- 
validated on Sample B; conversely, those derived from 
Sample B were cross-validated on Sample A. This pro- 
cedure resulted in an estimate of the efficiency which 
the lists of signs would probably have if employed with 
new samples. 

8. Groups of false positives and false negatives were 
selected and case histories carefully studied in an effort 
to find ways of improving prediction. 

9. Finally, indices combining data from both samples 
were constructed by including only signs which differ- 
entiated at the .05 level of significance for both samples.? 

Subjects and procedure. All Ss had been patients at 
the Veterans Administration Mental Hygiene Clinic, 
St. Paul, Minnesota, at some time during the period 
from 1947 through 1951. The clinic is available only to 
veterans with a ‘“‘service-connected” psychiatric disa- 
bility. “Service connection,” in this sense, is a rather 
broad concept which includes origin of a neuropsy- 
chiatric disorder, aggravation of such a disorder, or 
“emotional” contribution to some nonpsychiatric disa- 
bility. Nearly all the patients are male, white, and live 
in the vicinity of Minneapolis and St. Paul. Veterans of 
both world wars are included in the sample, although by 
far the majority served only in the second one. Almost 
without exception they had had psychotherapy, 
mainly with psychiatric residents, less often with clinical 
psychology trainees and social workers, All therapy 
was supervised by psychiatric and psychological 
consultants, 

Most of the psychometric tests had been adminis- 
tered, scored, and interpreted by trainees in clinical 
Psychology, but scoring of the Wechsler-Bellevue and 
Rorschach was checked in an unusually careful way by 
the chief psychologist. For the Rorschach, the scoring 
system employed most nearly approximates that of 
Hertz (7) for location, form, and specification of popular 
Tesponses. Determinants were scored more like the 
Klopfer system (13) than any other. 

The item analysis. The statistical procedure most 


extensively used was chi square. Calculation was done» 


In accordance with the method suggested by McNemar 
(15, formula 87, p. 207) for 2 X & tables, Wherever any 
expected frequency in a 2 X 2 table fell below 10, a 
formula incorporating Yates's correction (15, formula 


852, p. 207) was applied. Cutting points were estab- 
(RSS aar ceti E IRE RE ACTU RTT 


*Katzell (12) has suggested using less stringent 
Standards for selection of items in the final scale. His 
ne e employed, but the resulting indices 

und to have only slightly greater discriminating 

Tower than the shorter ones discussed here. Since the 

orter indices work nearly as well, are less cumbersome, 
and less subject to shrinkage, they alone are reported. 


261 


lished as rationally as available literature and common 
sense would permit before the tests were calculated. 
For certain unimodal continuous distributions, the 
t test was employed (10, formula 5.03, p. 74). Where 
heterogeneity of population variance seemed likely, the 
Cochran-Cox approximation method was used (10, pp. 
74-75). Generally speaking, if no population difference 
was suggested by /, the variable was henceforth ignored. 
If, however, the / test did suggest a reliable difference 
between means, chi square was also used, with a cutting 
point approximately midway between the two sample 
means. Only if the latter test indicated significant 
discrepancy was the sign included as a member of a 
predictor set. 

Statistical tests were performed with respect to the 
following nonpsychometric variables: age, present job 
status (employed or unemployed), education, job 
stability (number of jobs held since discharge from the 
service), occupational level, marital status, number of 
children (if married), place of residence (if single, 
widowed, or divorced), parents (living or dead), siblings 
(present or absent), birth order, evidence of broken 
home (arbitrarily defined as dissolution of the home by 
separation of the parents, or death or institutionali- 
zation of one or both parents before the patient was 15), 
number of previous hospitalizations, mention of drink- 
ing as a problem in the intake interview, impressions of 
patient behavior during testing (psychologist's de- 
scription), and diagnosis (service-connected disability, 
beginning diagnosis at the clinic, and closing diagnosis). 

For the Wechsler-Bellevue, no type of complex 
pattern analysis was attempted. Reviews by Rabin 
(19), Rabin and Guertin (20), and Schofield (22) 
suggested that such analyses were likely to be un- 
profitable. In consequence, only the full scale IQ and 
the relationship between verbal and performance IQ's 
were examined. 

For the MMPI, the literature on pattern analysis 
(6, 17, 21) suggested several possible discriminators. 
The relationship between the “neurotic” end of the 
profile (Hs, D, and Hy) and the “psychotic” end (here 
defined as including Pa, Sc, and Ma), the relative 
elevation of Sc and Pt, the height of D in relation to 
Hs and Hy, the difference score calculated by sub- 
tracting F from K (T scores), the total number of 
scores over 70, and gross elevations on F and Pd were 
all tested by chi-square methods. 

Tests were made for each of the following Rorschach 
signs and scores: R, emphasis on W, D, Dr, and S, F%, 
F+%, A96, W:M, (H + A): (Hd + Ad), M:ZC, £C, 
FC, CF — andC, FC: (CF + C), M, M4-:M —, M:FM, 
Fm, (VIII-X)66, Fc, Fk, FK, FC’, cloud, fire, blood, or 
smoke content, sex, food, positional responses, and 


“abstractions. Both original and additional responses 


were counted for the last two signs. For all others only 
the original responses were considered, 


RxsurTS* 
Formalion of indices. In all, 56 variables 
were examined by chi-square techniques. Of 
3'The basic data for those variables which yielded 
significant differentiation between hospitalized and 
nonhospitalized patients in either Sample A or Sample 
B have been deposited with the ADI. Order Document 


262 
TABLE 1 
PREDICTIVE INDICES DERIVED FROM 
CHI-SQUARE ANALYSIS 
SAMPLE IN 
Waca Dis- 
VARIABLE CRIMINATION Crass WzicHT 
OccuRRED 
ene LLL 
Nonpsychometric 
Diagnosis AandB Psychosis 2 
Psychoneurosis, 
mixed or unclas- 
sified, depressive 
reaction, or ob- 
sessive-compul- 
sive reaction 1 
Any other diagnosis 0 
Previous hospi- A and B Two or more 2 
talization One 1 
None 0 
Marital status Aand B Single 2 
Divorced or 
widowed 1 
Married 0 
Drinking as a A only Mentioned in. 
problem intake interview 1 
Not mentioned 0 
Employment B only Unemployed 1 
status Employed 0 
MMPI 
“Neurotic” AandB Paor Sc or Ma > 
scores: Hs or D or Hy 1 
“psychotic” Pa and Sc and Ma 
scores < Hs and D and 
Hy 0 
Sc: Pt AandB | Sc» Pi 1 
Sc < Pt 0 
F AandB >60 1 
<6 0 
Pd A and B > 65 1 
ET 0 
D: Hs and Hy B only D > Hs or Hy 1 
D < Hs and Hy 0 
Number of B only Four or more 1 
scores over 70 Less than four 0 
Wechsler-Bellevue 
Full scale IQ A only < 105 1 
= 105 0 
Rorschach 
A% B only «35 1 
= 30 0 


these, nine appeared to discriminate between 
hospitalized and nonhospitalized patients in 
Sample A; 11 appeared effective in Sample B. 
Overlap occurred for seven variables. All 
signs were weighted in accordance with ob- 


No. 4053 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress, Wash- 
ington 25, D. C., remitting in advance $1.75 for 35 mm. 
microfilm or $2.50 for 6 by 8 in. photocopies. Make 
checks payable to Chief, Photoduplication Service, 
Library of Congress. A microfilm copy of the complete 
thesis can be obtained by ordering Publication No. 4348 


+ from University Microfilms, Ann Arbor, Michigan. 


DoNarp R. PETERSON 


served frequency trends, and the indices 
shown in Table 1 were made up. 

Estimates of predictive and discriminatory 
efficiency. On the basis of the signs listed in 
Table 1, every patient was given four scores, 
one based solely on nonpsychometric data, 
one based only on the MMPI, one obtained by 
adding those two scores, and one based on all 
signs which seemed to show differentiating 
power. Cutting scores were established by 
equalizing false positives with false negatives 
in each derivation sample, and cross-validation 
was carried out by examining the percentage 
of correctly classified cases in the samples 
from which the indices had not been derived. 
Thus, indices derived from Sample A were 
cross-validated on Sample B; those derived | 
from Sample B were tested with Sample A. | 
Final indices were formed by considering only 
those signs which discriminated between hos- 
pitalized and nonhospitalized patients in 
both samples. For these indices, percentages of 
accurate classification over the entire sample 
were computed. Results are presented in^ 
Table 2. 

Investigation of false negatives ond fast 
positives. In an effort to isolate additional 
variables which might aid prediction and 10 


TABLE 2 
PERCENTAGE Or Cases CORRECTLY CLASSIFIED PY 


THE VARIOUS ÍNDICES 


ACCURACY ACCURACY 
PrmceNTAGE, PERCENTAO 
poa Derivation  CRoss- 
SAMPLE VALIDATION 
Nonpsychômetric index derived 70.37 [2| 
from Sample A | 
Nonpsychomeftic index derived 66.04 6.51) 
from Sample B 
Final nonpsychometric index 64.95 
MMPI index derived from Sample 67.62 66.61 
A 
MMPI index derived from Sample 71.00 68.57 
B 5 
' Final MMPI index 67.32 
Combined index (nonpsychometric 71.57 73.00 
plus MMPI) derived from 
Sample A 
Combined index derived from 73.20 13.74 
Sample B 
Final combined index 71.36 
Comprehensive index (all signs) 13.68 15.0 
derived from Sample A Fi 
Comprehensive index derived from 73.68 5 
Sample B 


PREDICTING HOSPITALIZATION OF PSYCHIATRIC OUTPATIENTS 


account for the predictive failures that oc- 
curred, an intensive case study was under- 
taken for certain patients who were incor- 
rectly classified by all devices. A set of false 
negatives was selected by searching Group I 
(ie. the total hospitalized sample, consisting 
of Groups I-A and I-B) for cases in which 
scores on all four predictive devices were be- 
low the critical level for predicting hospitaliza- 
tion. Eleven individuals who fit that descrip- 
tion were found. The nonhospitalized group 
was similarly examined for patients who were 
inaccurately categorized by all statistical 
methods. Twelve such cases were isolated. 

Telephone conversations with ward secre- 
taries revealed that four of the apparent 
"misses" were spurious. Prediction for them 
would actually have been correct, but did not 
appear so because of clerical errors. Two of the 
false negatives had never been under neuro- 
psychiatric treatment. They had gone to the 
hospital, but not to the psychiatric wards, 
even though the records indicated that they 
had. Conversely, two of the apparent false 
positives were found to have received treat- 
ment in the psychiatric section of the hospital 
after examination and therapy at the clinic, 
but notation of the event was not found in the 
case history (in one instance because the 
author missed it). 

Prediction for the remaining group was 
genuinely wrong. Careful rereading of the 
material in the clinical file and examination of 
all available collateral data suggested that in- 
Correct prediction resulted variously from 
Patently faulty diagnosis, the operation of 
contingency factors, and omission of individ- 
ually important variables. Inspection of all 
the records yielded only one quantifiable fac- 
tor, out of several examined, which might 
Profitably have been included as a general 
Predictor. Five of the 10 married patients in 
the hospitalized group were expectant fathers’ 
Only one of the five married veteraris in the 
Set of false positives was in that situation. 
One cannot expect that addition of a “preg- 
nant wife” sign will increase predictive ac- 
curacy to any great extent, but it may offer 
ues promise to warrant further examina- 

Analysis of the factors which led to incor- 
d prediction, however, clearly indicates the 
3 for more careful diagnosis and inclusion 
9! such nebulous but important factors as the 


263 


relationship between patient and therapist if 
prediction is to be materially improved. 

Use and interpretation. of final indices. By 
considering those signs in Table 1 which were 
found to be effective discriminators in both 
half-samples, three scores can be assigned to 
any case: one can be derived from his stand- 
ing with respect to the three nonpsychometric 
variables, one can be obtained through exam- 
ination of his MMPI profile, and one can be 
obtained by adding those two numbers. 
Scores above two for the nonpsychometric 
index, above one for the MMPI index, and 
above three for the combined index suggest 
the likelihood of future hospitalization. Use 
of these cutting scores should result in ap- 
proximately equal percentages of accurate 
prediction in the positive and negative direc- 
tions. For a large sample of new cases, however, 
the absolute number of false positives will . 
probably be considerably larger than the 
number of false negatives, and cutting scores 
can be altered in terms of the demands of the 
particular situation where prediction is re- 
quired. 

Cautious interpretation is recommended. 
The indices are “final” only in the context of 
this study. The devices, as they stand, have 
not actually been used to predict, although the 
items which make them up have been cross- 
validated with more than usual rigor. Validity 
needs to be checked on other groups, but con- 
sidering the almost infinitesimal shrinkage 
which occurred for each half-sample instru- 
ment on cross-validation, it seems reasonable 
to assume that they will hold up quite well 
with other cases drawn from the same popula- 
tion. 

On the “average,” the person who not only 
fails to profit from outpatient psychotherapy, 
but who actually becomes worse, is single, has 
been previously hospitalized, is likely to have 
a “psychotic” diagnosis, and has a seriously 


. disturbed MMPI profile. From these be- 


havior patterns, we ordinarily infer strong 
tension and anxiety, without a healthy, or 
even a neurotic, complement of defenses. The 
mechanisms employed can more accurately be 
described as psychotic; they include with- 
drawal from the world of objective affairs, 
various distortions in perception of the en- 
vironment, and emotional frigidity. 

In all important respects, the pre-illness 
MMPI profile conforms to the characteristic 


Dowarp R. 


Fic. 1. Mean MMPI PROFILES rog THE 


ENTIRE 

SAMPLE OF HOSPITALIZED AND NONHOSPITALIZED 

Patients (—— Hospitalized; —- Not Hos- 
pitalized) 


pattern found in manifest psychosis (17). We 
can probably assume that, at the time these 
patients were tested, psychotic behaviors were 
not clinically apparent in full form; that, if 
they had been, the patients who showed them 
would have been sent to the hospital immedi- 
ately. If this is a safe assumption, predisposi- 
tional tendencies to severe psychiatric illness 
can be defined in terms of the following test 
conditions: (a) clinical observation reveals no 
obvious symptoms severe enough to warrant 
institutionalization, but tendencies in that 
direction may still be present, as evidenced by 
increased likelihood of a “psychotic” diagnosis; 
(6) examination of biographical data shows a 
history of adaptive failure, as evidenced by 


. Prior hospitalization and single marital status; . 


(c) scrutiny of the MMPI record reveals a 
profile that is different in no essential respect 
from that found in manifest. Psychosis. 

The heterogeneity of the sample, however, 
reduces the meaning of interpretations of this 
sort, and studies of more Specifically defined 
groups are badly needed. To this end, an in- 
vestigation of a group of latent schizophrenics 

ds Row under way. 


PETERSON 


SUMMARY 


This study constitutes an attempt to devise 
simple, widely applicable, and maximally 
precise indices to aid in predicting hospitaliza- 
tion of psychiatric outpatients, as well as to 
formulate a partial operational definition of 
the concept, “latent psychiatric illness.” 

Data were gathered at a VA mental hygiene 
clinic for all patients who underwent psycholog- 
ical examination, were seen for at least two 
interviews by clinic staff members, and later 
were admitted to psychiatric hospitals. In 
all, 108 such patients were found. They were 
compared with 114 nonhospitalized patients 
at the same clinic in terms of certain non- 
psychometric data and the results of three 
psychological tests, the Wechsler-Bellevue, 
the MMPI, and the Rorschach. To minimize 
chance effects, samples were split in accordance 
with a double cross-validation design. Indices 
were derived separately for each half-sample 
and cross-validated on the other. Percentages 
of accurate classification ranged from 63 for 
one of the nonpsychometric devices to 75 
for an index composed of four nonpsycho- 
metric signs, four MMPI signs, and one sign 
from the Wechsler-Bellevue. Generally, predic- 
tion made on the basis of nontest data alone 
or from the MMPI alone would have been 
correct about two-thirds of the time; predic- 
tions based on indices comprising both non- 
Dsychometric and MMPI signs would have 
been correct for a little less than three-fourths 
of the cases. 

Individual false positives and false nega- 
tives were then selected from the hospitalized 
and nonhospitalized groups. Such patients, 
for whom prediction was wrong in terms of all 
devices, were submitted to intensive case 
study. Clerical erro regarding hospitalization 
had been made in some of the individual 
Iecords; accurate prediction would in fact 
have been made for four of the 23 apparent 
misses. For the remaining 19, contingency 
factors, inaccurate diagnosis, and omission of 
uniquely important variables seemed to ac- 
count for most of the predictive failures. 

Predictors derived from the subsamples 
were combined into single indices by including 
only those signs which discriminated at the 
-05 level of significance between hospitalized 
and nonhospitalized patients in both halí- 
samples. Three forms of index were derived, 


: 
| 


PREDICTING HOSPITALIZATION OF PSYCHIATRIC ÜUTPATINETS 


one consisting only of nonpsychometric varia- 
bles, one consisting only of MMPI signs, and 


one comprising both nontest and MMPI fac- © 


tors. Choice will depend on the amount of 
data available. 

The findings contain a partial basis for 
definition of predisposition to severe psychiat- 
ric illness. The mean MMPI profile of the 
subsequently hospitalized patients is that of 
manifest, not latent, psychosis, even though 
such deviations could not have been flagrantly 
obvious in other clinical behavior at the time 
of testing. Inference of this nature, however, is 
limited by the heterogeneity of the group, and 
a study of the prepsychotic behavior of a group 
of latent schizophrenics is now in progress. 


REFERENCES 


1. CHAsE, L., & SILVERMAN, S. Prognostic criteria in 
schizophrenia. Amer. J. Psychiat., 1941, 98, 
360-368. 

2. CLARK, R. E. Psychoses. Income and occupational 
prestige. Amer. J. Sociol., 1949, 54, 433-440. 

3. Dunnam, H. W., & MELTZER, B. S. Predicting 
length of hospitalization of mental patients. 
Amer. J. Sociol., 1946, 52, 123-131. 

4. Dynes, J. B. Mental breaking point. New Eng. J. 
Med., 1946, 234, 42-45. 

5. Fisuen, W., & Haves, S. P., Jk. Maladjustment in 
college predicted by Bernreuter Inventory 
scores and family position. J. appl. Psychol., 
1941, 25, 86-96. 

6. Goven, H. G. Diagnostic patterns in the Minnesota 
Multiphasic Personality Inventory. J. clim. 
Psychol., 1946, 2, 23-37. 

7. HERTZ, MARGUERITTE R. Frequency tables to be used 
in scoring responses to the Rorschach ink-blot test. 
Hep Ed.) Cleveland: Western Reserve Univer., 

8. Horst, P. The prediction of personal adjustment. 
Soc. Sci. Res. Coun. Bull., 1941, No. 48. 


265 


9. Jenxins, W. L. The Minnesota Multiphasic 
Personality Inventory applied to the problem of 
prognosis in schizophrenia. Unpublished Ph.D. 
thesis., Univer. of Minnesota, 1952, 

10. Jounson, P. O. Statistical methods in research. New 
York: Prentice-Hall, 1949. 

11. Kanz, O. The evaluation of prognostic criteria in 
schizophrenia. J. merv. ment. Dis., 1944, 100, 
598-605. 

12. KarzELL, R. A. Cross-validation of item analysis. 
Educ. psychol. Measmt, 1951, 11, 16-22. 

13. Kuoprer, B., & Kettey, D. M. The Rorschach 
technique. Yonkers, New York: World Book Co., 
1946. 

14. Lewis, N. D. C. Research in dementia praecox. New 
York: Nat. Comm. Ment. Hygiene, 1936. 

15. McNemar, Q. Psychological statistics. New York: 
Wiley, 1949. 

16. Mayer-Gross, W., & Moore, N. P. Schizophrenia, 
J. ment. Sci., 1944, 90, 231-255. 

17. Mxxnur, P. E. Profile analysis of the MMPI in 
differential diagnosis. J. appl. Psychol., 1946, 80, 
517-524. 

18. Mosrer, C. I. Problems and designs of cross- 
validation. Educ. psychol. Measmt, 1951, 11, 
5-11. 

19. Rasın, A. I. The use of the Wechsler-Bellevue scales 
with normal and abnormal persons. Psychol. 
Bull., 1945, 42, 410-422. 

20. RABIN, A. I., & Guertin, W. H. Research with the 
Wechsler-Bellevue test: 1945-1950. Psychol. 
Bull., 1951, 48, 211-242. 

21. Scmuapr, H. O. Test profiles as a diagnostic aid: 
the Minnesota Multiphasic Personality In- 
ventory. J. appl. Psychol., 1945, 29, 115-131. 

22. ScHorFIELD, W. S. Critique of scatter and profile 
analysis of psychometric data. J. clin. Psychol., 
1952, 8, 16-22. t 

23. WiwpLE, C. Psychological tests in psychopatho- 
logical prognosis. Psychol. Bull, 1952, 49, 
451-482. i 

24. WrrrMAN, PRYLLIS. A scale for measuring prognosis 
in schizophrenic patients. Elgin Papers, 1941, 4, 
20-33. 


Received January 30, 1953. 


THE PERFORMANCE OF SCHIZOPHRENICS ON SOCIAL CONCEPTS: 


MARTIN WHITEMAN 
University of Delaware 


tendency to view schizophrenia as a 

thinking disorder. This impairment of 
thought processes in schizophrenia has been 
demonstrated in both verbal tasks (1, 19, 21, 
22) and performance tasks (3, 4, 14, 20), and 
has been characterized by some as an inability 
to generalize or to form concepts. 

However, the origin of this impairment 
is as yet open to conflicting interpretations. 
For example, Vigotsky (20) believes that the 
disturbance in concept formation is caused by 
an underlying organic process. Goldstein 
wrote in 1939, "this similarity (in test per- 
formances of organic and schizophrenic cases) 
does not permit the rash assumption that 
schizophrenia is fundamentally an organic 
disease; the great differences in the behavior 
of organic and schizophrenic patients in 
various directions will prevent such conclu- 
sions. However, the similarity points to an 
organic process as cause of this impairment, 
whether this process be primary or of secondary 
origin" (11, p. 583). In a later publication 
(12), however, Goldstein seems to view the 
problem as currently indeterminate. Both 
Goldstein and Vigotsky stress the general 
nature of the defect. Indeed, Goldstein con- 
siders the abstract and concrete attitudes not 
merely as cognitive functions but as “capacity 
levels of the total personality, each furnishing 
the basis for all performances on a certain 


I RECENT years there has been a growing 


1 This report is based upon a dissertation submitted 
to the New York University Graduate School in partial 
fulfillment of the requirements for the degree of Doctor 
of Philosophy. The writer is indebted to Dr. Irwin N. 
Katz, under whose direction this study was under- 
taken, for his advice and assistance. Grateful acknowl- , 
edgement is also made of the suggestions and criticisms 
of Drs, Richard Christie, J. McV. Hunt, Lawrence 
Karlin, Leonard S. Kogan, and Zygmunt A. Piotrowski. 
The writer is grateful for the cooperation offered by 
Miss Phoebe Overstreet and William Shelby of the 
Vocational Advisory Service of the YMCA., Dr. M. B. 
Lipton of the Brooklyn State Hospital, and Dr. James 
F. Garrett of the New York University-Bellevue Medi- 
cal Center. A summary of this paper was read at the 
1952 meeting of the American Psychological Associa- 
tion. 


plane of reference to the outer world situation” 
(12, p. 18). 

For Cameron (5, 6, 7, 8, 9), the disorganiza- 
tion in schizophrenic thinking is a symptom 
of the patient's "social disarticulation," 
initially occasioned by defective role-taking 
ability. This isolation from common environ- 
mental influences leads to a progressive sub- 
stitution of asocial fantasy for realistic inter- 
change of attitudes and viewpoints, resulting 
in a gradual impairment of organized, socially 
acceptable thinking. 

It is interesting to note, in view of the above 
conflicting interpretive considerations, that 
there has been no attempt to study systemati- 
cally the test performance of schizophrenics 
on problems involving social concepts, as 
compared to their test performance on prob- 
lems involving formal concepts, i.e., the more 
impersonal, less socially-toned type of con- 
cept? A functional-behavioral stress, such as 
Cameron's, in the problem of schizophrenic 
thinking would imply a selective impairment 
of cognitive functioning, one more apt to be 
revealed to the extent that the content of the 
concept to be formed by the patient in the 
experimental situation is related to his aware- 
ness of the realities of interpersonal situations 
about him. 

The purpose of this experiment is to in- 
vestigate the hypothesis that the social con- 
ceptual performance of schizophrenics is im- 
paired relative to that of a normal group, even 
though both populations are equated on formal 
conceptual performance; or to put it differ- 
ently, that the schizophrenic deficit relative to 


normals is greater on social conceptual prob- 


lems than on formal conceptual problems. By 
formal concept is meant an abstraction which is 
common to a number of noninterpersonal 
situations, and which is capable of describing 
aspects of such situations in (a) physical, 
psychophysical, or quantitative terms, such as, 
volume, color, number, or (5) logical-relational 
terms, such as concepts based on princi- 

? The terms social concept and formal concept vill 
be explicitly defined below. 


266 


SCHIZOPHRENIC PERFORMANCE ON SOCIAL CONCEPTS 


TABLE 1 
AGE, VOCABULARY LEVEL, AND EDUCATION OF THE 
CONTROL AND SCHIZOPHRENIC GROUPS 


VOCABU- Epuca- 

AGE LARY* Tion** 

Group! N Mean SD Mean SD MEAN SD 

Controls 31 29.4 5.9 9.7 2.1 11.1 2.0 

Schizo- 31 29.8 5.5 9.7 2.3 10.9 2.4 
phrenics 


* In Wechsler-Bellevue weighted score units. 
** Years of schooling completed. 


ples of hierarchical classification like species- 
genus, or relational concepts like ‘“middle- 
ness.” A social concept is an abstraction which 
is common to a number of situations involving 
people, and which is capable of describing 
aspects of such situations in terms of the func- 
tional relationships exhibited by people in 
their mutual interaction, i.e., cooperation, 
encouragement, courtesy. 

Social concepts are abstracted typically from 
situations in which people are reacting to one 
another. They are derived from such situations. 
It should be pointed out, however, that the 
distinction drawn between social concepts and 
formal concepts is not an absolute one. All 
concepts are social in the sense that they are 
learned and applied in social contexts. Also, 
the function of concepts, social or formal, is 
to provide a tool whereby hitherto unrelated 
situations may be related. What is stressed 
here is the relatively greater application of 
so-called social concepts than of so-called 
formal concepts to interpersonal situations. 


METHOD i: 


Subjects. The experimental group consisted of 31 
schizophrenic patients at the Brooklyn State Hospital. 
Each member of this group was matched with a non- 
psychotic control on the basis of age, education, and 
intelligence as measured by the Wechsler-Bellevue 
vocabulary score. All subjects (Ss) were male, white, 
native born, and had lived most of their lives in an’ 
urban environment. Table 1 presents means and 
standard deviations of the two groups for age, educa- 
tion, and vocabulary score. It will be noted that for 
each comparison the groups are approximately equal. 

Only those patients were selected whose diagnoses 
were unanimous and uncomplicated by neurological 
involvement or physical disability. Other criteria 
for selection were the testability and degree of co- 
operativeness of the patient. As a whole, the patient 
population could be characterized clinically, or in a 
Kraepelinian sense, as relatively undeteriorated. The 
median number of months of hospitalization was 11.3 
with 74 per cent of the group having had 2 years or less 


267 


of hospitalization. The range was from two months to 
nine years. The subdiagnostic breakdown was as 
follows: 13 catatonics, 12 paranoids, 4 mixed, 1 hebe- 
phrenic, and 1 simple. 

For the control group, 24 Ss were recruited from 
factories and business establishments in greater New 
York City. The remaining seven controls were ob- 
tained from the Vocational Advisory Service of the 
YMCA. Only those clients who were adjudged by the 
vocational counselor as not exhibiting undue emotional 
difficulties were utilized in the study. As a check upon 
the possibility of previous mental illness in the control 
group, each control was asked to enumerate any diseases 
or physical disabilities he had had both as a child and 
as an adult, whether he had spent any time in an institu- 
tion or a hospital for tuberculosis or heart disease, 
and finally whether he had ever had a “nervous break- 
down,” been treated by a psychiatrist, or spent time 
in a mental hospital. None of the controls used in the 
study gave indications of mental abnormality either 
in overt behavior, quality of verbalization, or from 
previous history. 

With respect to occupation, the majority of the 
members of both groups fall into the semiskilled cate- 
gory. It was not possible to make a man-for-man match- 
ing for occupation, but the groups as a whole are fairly 
comparable with respect to this variable. 

Tests. All tests were administered individually by 
the writer in one session. To obtain indices of formal 
conceptualization ability, it was decided to use a verbal 
scale and a performance scale. The ones selected were 
the verbal analogies and picture reasoning scales from 
Jastak’s battery, Psychometric Patterns (16), which 
was designed for clinical use. These tests have been 
standardized on the same normal population and lend 
themselves to objective scoring. Furthermore, the writer 
had used them in a clinical setting for over a year and 
had found them effective with psychotic subjects. The 
verbal analogies test consists of 32 items arranged in 
order of difficulty; each item is in the form, “the hand 
has fingers, the foot has...” The picture reasoning 
test comprises 11 items, also arranged in order of dif- 
ficulty, with each item consisting of five cards, The 
S's task is to place each set of five cards in proper 
sequence according to a logical-relational concept, e.g., 
arrangement according to the size of the box shown in 
each picture. For these formal concept tests, differential 
credits for each item are assigned, depending upon ac- 
curacy of response and time. 

A preliminary study was carried out before the ex- 
periment was begun in order to devise a social concept 
test with a suitable range of item difficulty and to 
standardize test procedure. The final edition of the 
social concept test consisted of 21 items, the first 18 
items containing four cards each and the last three 
items containing six cards each. In both the four- and 
six-card items, three of the cards depicted instances of 
one social concept, while one card (in the case of the 
four-card items) and three cards (in the case of six- 
card items) represented negative instances. For ex- 
ample, in the case of the “rescue” item, the three con- 
cept cards included (a) a picture of a man rescuing a 
girl from the path of an advancing truck; (6) a scene 
depicting a fireman rescuing a person from a burning 
building; and (¢) a picture of a man throwing a life 
belt to a swimmer in distress. For the nonconcept card, 


268 


there was a scene showing two children on sleds racing 
toward a busy intersection. The S's task was to separate 
the three cards that had the same idea from those that 
did not have that particular idea, and then to explain 

It was decided before the experiment proper to 
score the social concept test as simply and as objec- 
tively as possible, As in the case of the formal concept 
tests, differential credits for each item were assigned 
on the basis of accuracy of response (i.e., the sorting of 
the three concept cards) and time. However, when the 
occasion demanded it, the schizophrenics were per- 
mitted to run ovef the time limit in order to minimize 
frustration and to afford material for qualitative analy- 
sis. An untimed social concept accuracy score was also 
utilized in the analysis of the data. 

Statistical treatment. One test of the hypothesis in- 
volves: (a) statistically equating the picture reasoning 
and verbal analogies scores of the two groups; (b) com- 
puting the corrected difference between the control and 
schizophrenic groups on the social concept test; (c) 
testing the significance of this difference by means of 
the extension of the standard error formula for one 
matching variable given by Peters and Van Voorhis 
(18, pp. 463-469). There are two major assumptions 
which should be met before this statistical treatment 
can be applied. The assumption of normally distributed 
variables was met by converting the raw scores of the 
two groups on the three concept tests into T scores, 
i.e. normalized standard scores with a mean of 50 and 
a sigma of 10. The second assumption is linearity of 
regression among the three variables. The scattergrams 
revealed no evidence of curvilinearity but to check the 
point more carefully, statistical tests were performed 
to determine whether the hypothesis of curvilinearity 
of regression could be sustained. The method is based 
on an analysis of variance technique and is described 
by McNemar (17, pp. 255-258). The resulting F's 
revealed that the hypothesis of curvilinearity of regres- 
sion cannot be sustained. It is not refuted, of course, nor 
does it prove rectilinearity. However, this evidence, 
coupled with the fact that there is no a priori reason for 
expecting a curvilinear relationship between tests of 
cognitive ability, lends support to the hypothesis of 
rectilinearity of regression. 


RESULTS 


The differences between the control and 
schizophrenic groups on the three concept 
tests are presented in Table 2. The first three 
critical ratios in this table were obtained 
through the application of the formula for the 
standard error of the difference between 
correlated means (17, p. 74). The fourth and 
fifth critical ratios, for the corrected social 
concept scores, were evaluated by means of 
the Peters and Van Voorhis formula men- 
tioned in the previous section. 

It will be seen that all the tests discriminate 
between the two groups in favor of the con- 
‘trols, that the social concept test shows the 


Martin WHITEMAN 


greatest disparity between the control and 
schizophrenic groups, and that the difference 
between these groups on the social concept 
scale is highly significant even when differences 
on the formal concept tests are partialled out. 

These results are supported by the findings 
from the simpler technique of comparing the 
proportion of schizophrenics whose social 
concept scores were below the lowest social 
concept score of the controls with the pro- 
portion of schizophrenics whose formal con- 
cept scores fell below the poorest formal con- 
cept scores of the control group. Whereas 21 
of the 31 schizophrenics obtained social con- 
cept scores which were below the poorest social 
concept score of the controls, only nine schizo- 
phrenics secured picture reasoning or verbal 
analogies scores below the poorest control 
scores on the latter scales. The difference 
between these proportions yields a chi square 
for correlated proportions of 10.08, which is 
significant at the .01 level. 

The social concept test was scored also for 
accuracy of sorting only, without taking ac- 
count of time. This untimed accuracy score 
enabled the schizophrenics, who, as a group, 
were slower than the controls, to score rela- 
tively higher than they would with an ac- 
curacy-time score. Nineteen of the 31 schizo- 


TABLE 2 


DIFFERENCES AND SIGNIFICANCE OF THE DIFFERENCES 
BETWEEN CONTROL AND SCHIZOPHRENIC GROUPS 
ON THE VERBAL ANALOGIES, PICTURE REASONING, 

AND SOCIAL CONCEPT TESTS 


= 


STANDARD 
A ERROR OF 
TEST Mean Dr- Mean Dir- CR* 
FERENCE FERENCE 
Verbal analogies 3.645 1.329 2.74 
Picture reasoning 8.387 1.488 5.64 
Social concepts (unad- 13.806 1.636 8.44 
, justed)** 
Social concepts—I (ad- 11.945 1.538 7.76 
justed) ** 
Social concepts—II (ad- 11.613 1.806 6.43 
justed) ** 


Puss ce TUS UTEM DIGG ETSI a a 


* The differences are all in favor of the controls, are ex- 
pressed in T-score units, and are all significant beyond the .01 
level of confidence. A 

** The unadjusted social concept scores are those experi- 
mentally obtained. The adjusted social concept scores have been 
statistically corrected so as to equate the two groups (a) on both 
verbal analogies and picture reasoning (social concepts—D, oF 
(5) only on picture reasoning, which was the more discriminating 
of the two formal concept tests (social concepts—II). 


— 


" 


ScHIZOPHRENIC PERFORMANCE ON SOCIAL CONCEPTS 


phrenics obtained untimed social concept 
scores below the poorest social concept score 
of the control group. The difference between 
this proportion and the proportion of schizo- 
phrenics scoring below the poorest control on 
the formal concept tests is significant at the 
01 level (chi square is equal to 9.09). 

We have also been able to match directly 
12 pairs of controls and schizophrenics whose 
formal concept scores (averaged verbal analo- 
gies and picture reasoning T scores) were 
exactly equal. This was done in order to make 
a direct comparison of their respective untimed 
social concept T scores without necessitating a 
prior statistical correction for inequality in 
formal conceptualization. The average differ- 
ence between the two groups of matched 
Ss is 9.83 (in favor of the controls), with a 
standard error of mean difference equal to 
1.014, yielding a / ratio for small correlated 
samples (17, p. 226) of 9.69. This / is significant 
at the .01 level for 11 df, and substantiates by 
a direct equation on formal concepts what was 
found by the statistical matching reported 
above. 

Finally, 16 pairs of controls and schizophren- 
ics were matched directly on the timed picture 
reasoning test alone, since this latter scale 
proved to be more discriminating between the 
two groups than the verbal analogies test. 
The mean difference between controls and 
schizophrenics on the untimed social concept 
test was 6.78 (in favor of the controls), with a 
standard error of mean difference equal to 
2.378. The obtained / of 2.85 is significant at 
the .02 level or at the .01 point. The use of a 
one-tailed significance test is defensible since 
the hypothesis presupposed a. difference in a 
particular direction, namely, in the direction of 
schizophrenic decrement. 

In each of the above / tests, the hypothesis 
of homogeneity of variance was not refuted by 
the significance level corresponding to the Æ 


ratio obtained by dividing the variances of. 


the schizophrenics' and controls' untimed social 
concept scores. 

It may be argued that the social concept 
test may make heavier demands upon ab- 
stractive capacity than the formal concept 
tests, i.e., may require greater generalization 
ability, more planning skill, keener isolation of 
parts, or may make greater demands upon 
symbolic processes (13). If this is so, then the 
schizophrenic decrement on the social concept 


269 


test could be predicted solely from the hypothe- | 
sis of schizophrenic impairment of abstract 
attitude. However, if this is true, then it 
follows that (a) the controls would find the 
social concept test more difficult than the 
formal concept tests, and (b) the schizophrenic 
group would show greater decrement on those 
items of the social concept test which are more 
difficult for the controls than on the easier 
items of this test. As a check upon the first 
deduction, we have compared the number of 
failures per item (total number of failures 
divided by the number of items on the test) 
of the verbal analogies, picture reasoning, and 
social concept tests. The number of failures 
per item were 13.5, 7.2, and 7.8, respectively. 
There is therefore no evidence to support the 
hypothesis that the schizophrenic decrement 
on the social concept test is due to the greater 
intrinsic difficulty of this test. As a check upon 
the second deduction, the items of the social 
concept test were ranked in order of difficulty 
for the control group (percentage passing) and 
in order of intergroup discriminative power. 
The obtained rho was actually negative 
(—.30), implying that there was no tendency 
for the more difficult items to prove more 
potent as differentiators between the two 
groups than the easier ones. 


Discussion 


The results lend support to the hypothesis ` 
in that they demonstrate a pronounced decre- 
ment in schizophrenic performance on social 
concepts relative to formal concepts. These 
results were predicted from Cameron’s theory, 
but the logic of the prediction bears closer 
scrutiny. An attempt will be made to phrase 
the argument with greater explicitness, and, 
it is hoped, with a consequent increase in 
rigor. The paradigm for the argument is 
borrowed from Cohen and Nagel (10). 

More schizophrenics than normals exhibit 
decrements in conceptualization. These are 
observations or statistical inferences from 
observations. 


Rival Hypotheses 

1. If more schizophrenics suffer from “social 
disarticulation,” resulting in an impairment of 
organized, socially acceptable thinking, then 
schizophrenics would tend to exhibit significant 
decrements in conceptualization. (General 
rule—Cameron’s hypothesis.) 


270 


2. If more schizophrenics than controls 
suffer from a decrement in the "abstract 
attitude," then schizophrenics would tend to 
exhibit significant decrements in conceptuali- 
zation. (General rule—Goldstein's hypothesis.) 


Deductive Elaboration 


1. From Cameron’s hypothesis: 

a. More schizophrenics than controls 
suffer from social disarticulation. 

b. Those who suffer from social dis- 
articulation would have greater difficulty in 
the conceptualization of socially toned ma- 
terial than more neutrally toned material. 

c. Therefore schizophrenics would evi- 
dence greater decrements in the conceptuali- 
zation of socially toned material than more 
neutrally toned material. 

2. From Goldstein’s hypothesis: 

a. More schizophrenics than controls 
suffer decrements in abstract attitude. 

b. Those who suffer from abstract atti- 
tude decrement may or may not show selective 
decrement in social concepts. 

The expression “may or may not" refers to 
the fact that it is impossible to deduce proposi- 
tions about social concept decrement from 
Goldstein's hypothesis with the same degree 
of probability that such an inference can be 
made from Cameron's hypothesis. Indeed the 
presumption would be that since the abstract 
attitude represents a capacity level of the 
total personality, there would be little selective 
impairment within sizable groups, regardless 
of the type of conceptualization task used, 
provided they are comparable in difficulty. 

It would appear that this decrement can be 
assimilated best into a theoretical framework 
Which also stresses the importance of ex- 
perience and learning as determinants of 
cognitive functioning. Irrespective of one's 
Systematic orientation in the field of learning, 
whether one is a cognitive theorist or a re- 
inforcement theorist, it can be agreed that the 
less one comes into contact with a given type 
of situation, the less one will know about it 
and the less one will be able to respond ade- 
quately to it. As Hunt and Cofer put it, “as 
must all thought processes like classifying, 
inducing relations, etc., we should guess that 
the ‘abstract abilities’ assessed by these tests 
involve habitual skills which may either be 
Strengthened or weakened by an individual’s 
social experience” (15, p. 1019). Bleuler, 


Martin WHITEMAN 


despite his recognition of constitutional de- 
terminants, emphasizes that “endowment 
with the capacity to form many sharply de- 
fined concepts is an empty potentiality in the 
absence of rich experience to supply the ma- 
terial for the concepts and their delineation” 
(2, p. 430). Just why the schizophrenic is 
socially estranged cannot be answered. The 
question of asocialization is part of the broader 
problem of socialization, a convergent area for 
the fields of learning and personality, neither 
of which is advanced as yet to the stage of 
individual prediction or even delineation of 
crucial factors. However, if we were to assume 
such a process of social disarticulation, despite 
admitted ignorance of its exact determinants, 
then the obtained selective decrement might 
be considered as a result of schizophrenic 
withdrawal, possibly acting in conjunction 
with, or as a reaction to, a deficit in abstract 
attitude. 


SUMMARY 


A hypothesis was derived from Cameron’s 
view of schizophrenic thinking as a product of 
the social disarticulation of this group, as 
contrasted with Goldstein’s interpretation of 
the defect in schizophrenic thought as the 
result of an impairment of the abstract atti- 
tude. The hypothesis was that schizophrenics 
would exhibit a greater decrement relative to 
normals on a test of social concepts than on 
tests of formal concepts. 

Two formal concept tests (one verbal and 
one performance) and a pictorial social con- 
cept test were administered to 31 schizophren- 
ics and 31 normal controls matched with 
respect to age, education, sex, and vocabulary 
Score. The populations were then equated 
statistically and through direct matchings on 
the formal concept scores. 

Significant differences in favor of the con- 
trols were obtained between the two groups 
on both types of test. However, schizophrenic 
decrement on the social concept test proved 
significantly greater than decrement on the 
formal concept tests. This differential decre- 
ment obtained whether timed or untimed 
Scores of the social concept test were used as 
dependent variables. 

It is believed that as an explanatory hypoth- 
esis, the concept of an impairment of ab- 
Stract attitude is, by itself, insufficient to 
account for the selective schizophrenic im- 


à 


| 


acier 


ScHIZOPHRENIC PERFORMANCE 


pairment on the social concept test. The latter 
decrement has been interpreted as lending 
presumptive support to a theoretical position 
which also stresses the importance of social 
withdrawal as a determinant of cognitive func- 
tioning in schizophrenia. 


REFERENCES 


1. BrxJAurN, J. D. A method for distinguishing and 
evaluating formal thinking disorders in schizo- 
phrenia. In J. S. Kasanin (Ed.), Language and 
thought in schizophrenia. Berkeley: Univer. of 
California Press, 1946. Pp. 65-90. 

2. BLEULER, E. Autistic thinking. In D. Rapaport 
(Ed.), Organization and pathology of thought. 
New York: Columbia Univer. Press, 1951. Pp. 
399-437. 

3. Borres, M. M. The basis of pertinence. Arch. 
Psychol., 1937, No. 212. 

4. Borres, M. M., & Goldstein, K. A study of the 
impairment of “abstract behavior" in schizo- 
phrenic patients. Psychiat. Quart., 1938, 12, 
42-65. 

5. Cameron, N. A study of thinking in senile de- 
terioration and schizophrenic disorganization. 
Amer. J. Psychol., 1938, 51, 650-655. 

6. Cameron, N. Reasoning, regression, and com- 
munication in schizophrenics. Psychol. Monogr., 
1938, 50, No. 1 (Whole No. 221). 

7. Cameron, N. Deterioration and regression in 
schizophrenic thinking. J. abnorm. soc. Psychol., 
1939, 34, 265-270. 

8. CAMERON, N. Schizophrenic thinking in a problem- 
soing situation. J. ment. Sci., 1939, 85, 1012- 

9. Cameron, N. The functional psychoses. In J. 
McV. Hunt (Ed.), Personality and the behavior 
disorders. Vol. Il. New York: Ronald, 1944. 
Pp. 861-921. 


10. 


11. 


12. 


13. 


14. 


15. 


21. 


22. 


on SoctaL CONCEPTS 271 


Conen, M. R., & Nacer, E. An introduction to 
logic and scientific method. New York: Harcourt 
Brace, 1934. 

GorpsrEm, K. The significance of special mental 
tests for diagnosis and prognosis in schizophrenia. 
Amer. J. Psychiat., 1939-1940, 96, 575-578. 

Gorpstem, K. Methodological approach to the 
study of schizophrenic thought disorder. In 
J. S. Kasanin (Ed.), Language and thought in 
schizophrenia. Berkeley: Univer. of California 
Press, 1946. Pp. 17-40. 

GOLDSTEIN, K., & ScHEERER, M. Abstract and 
concrete behavior. Psychol. Monogr., 1941, 58, 
No. 2 (Whole No. 239). 

HANFMANN, EUGENIA, & KAsANIN, J. Conceptual 
thinking in schizophrenia. Nerv. ment. Dis. 
Monogr. 1942, No. 67. 

Hunt, J. McV., & Corer, C. N. Psychological 
deficit. In J. McV. Hunt (Ed.), Personality and 
the behavior disorders. Vol. YI. New York: Ronald, 
1944. Pp. 971-1032. 


. Jastax, J. A rigorous criterion for feeble-minded- 


ness. J. abnorm. soc. Psychol., 1949, 44, 367-378. 


. McNemar, Q. Psychological statistics. New York: 


Wiley, 1949. 


. Peters, C. C., & VAN Voorm, W. R. Statistical 


procedures and their mathematical bases. New 
York: McGraw-Hill, 1940. 


. Rasnuxis, H. A. Three types of thinking disorders, 


J. nerv. ment. Dis., 1947, 106, 650-670. 


. Vicorsky, L. Thought in schizophrenia. (Transla- 


tion.) Arch. Neurol. Psychiat., 1934, 81, 1063- 
1077. 
Wzcnockr H. Generalizing ability in schizo- 
phrenia. Arch. Psychol., 1940, No. 254. 
WENTWORTH, M. M. 200 cases of dementia praecox 
tested by Stanford Revision. J. abnorm. soc. 
Psychol., 1924, 18, 378-384. 


Received January 31, 1953. 


PSYCHOLOGICAL PROGNOSIS OF OUTCOME IN THE 
MENTAL DISORDERS: 


JOSEPH ZUBIN AND CHARLES WINDLE 
Psychiairic Institute, Columbia University 


HEN mental disorders are regarded 
as continuing processes running a 
regular course with definite charac- 
teristics and not as haphazard vicissitudes of 
life, it becomes possible to study the initial, 
middle, and ultimate stages of these processes 
and hopefully to obtain indicators in each stage 
of the probable subsequent course of the disor- 
der. This is the essence of the Prognostic prob- 
lem. The personality of the patient, his genetic 
and environmental background, his family re- 
lationships, vocational and social adjustment, 
in fact all his assets and liabilities at any given 
period, may become important indicators of 
the probable course of his illness from then on. 
The wise clinician has always been aware 
that the characteristics of the patient at the 
time he comes for treatment have an important 
relationship to his chances for eventual im- 
provement, provided the stage of the illness 
can be determined. One able diagnostician, 
Kraepelin, has even proposed for one illness, 
dementia praecox, that diagnosis be based on 
the probable outcome of the disease. Present- 
day clinicians still follow this pattern to some 
extent; they examine disease processes with an 
€ye to their eventual outcome and diagnose ac- 
cordingly. But the observations on which these 
prognoses are made are usually impressionistic, 
Whether the psychologist can do any better 
with tests than the clinician with his intuition 
is perhaps debatable, but there is such a 
definite demand for improving prognostic 
criteria that tests are well worth trying. The 
demand for improvement arises in the wake of 
the general finding that modern specific ther- 
apies of both the psychotherapeutic and soma- 
totherapeutic variety seem to have an advan- 
tage over nonspecific therapies, such as “total 
! Read at the symposium on "Interrelations of Bio- 
chemistry, Psychiatry, and Psychology in Prognosis," 
American Psychological Association, September, 1951, 
Chicago, Illinois. The preparation of this article was 
facilitated by a grant from the National Institute of 
Mental Health, Public Health Service Department of 
| Health, Education, and Welfare. Dr. Windle is now a 
Research Scientist at HumRRO. 


push," only for the period immediately follow- 
ing the therapy. In long-range follow-up 
Studies, no conclusive evidence is available for 


any advantage in favor of specific therapies | 


(84). There are two possible explanations for 
the failure of the specific therapies to stand up 
under long-range scrutiny: (a) the specific 
therapies help only those patients who would 
improve anyhow, and (b) each of the specific 
therapies is effective only for a specific type of 
patient. Perhaps, if prognostic tests were avail- 
able to predict the different outcomes to be 
expected for a given patient under each of the 
specific therapies, better results than those af- 
forded by nonspecific therapies would follow. 
In either case, prognostic information seems 
highly desirable. Consequently, psychological 
tests have been employed with increasing fre- 


, quency in order to provide more standard in- 


dications of probable outcome, and this has 
made possible confirmatory studies and sta- 
tistical evaluation of the reliability and valid- 
ity of the measures. 

Despite the great increase in interest in prog- 
nosis today, most studies have preferred to 
follow the pattern laid down by the earlier diag- 
nostic studies rather than blaze new trails 
which would lead more directly to the goals of 
Prognosis. There are some problems common 
to bota diagnostic and prognostic studies, such 
as, (a) procedures for establishing validity of 
instruments; (b) classification of patients into 
homogeneous categories such as age, sex, etc.; 
(c) proper statistical treatment of the data. 
Among the problems peculiar to prognosis are: 


- (a) proper design of prognostic study; (b) spec- 


ifücatiofi of therapeutic agent or type of treat- 
ment to which patient is to be exposed; (c) 
duration of illness at time when prognosis is 
made; (d) stage of illness at which prognosis is 
made; (e) duration of follow-up; and (f) criteria 
for evaluating outcome. We shall deal with 
each of these difficulties in turn. 

The psychological tests used to evaluate 
functioning of patients cannot now be validated 
directly but must depend for their ultimate 


272 


—— — 


= = 


—— 


PSYCHOLOGICAL PROGNOSIS oF OUTCOME IN MENTAL DISORDERS 


valididation on interviews and direct observa- 
tions of patient behavior. In contrast with 
physiology, which also evaluates functioning, 
psychology has no anatomy to which the ob- 
served behavior can be related. The so-called 
structural elements portrayed in Freud’s 
anatomy of the mind, Lewin's “differentiated 
regions of the life space,” Rank’s will, Mc- 
Dougall’s sentiments, Stern’s self, Spearman’s 
and Thurstone’s factors, and the personality 
structure conjured by Rorschach and projec- 
tive technique workers, are little more than an 
assemblage of abstractions derived from de- 
scriptions of behavior, rather than independent 
constructs amenable to direct measurement. 
Perhaps the neurophysiologists will some day 
perfect models of brain function which may 
serve as a physiological substratum for percep- 
tion, thinking, and action, but such models are 
still in the future. Until that time, the only 
possible way to validate psychological tests is 
through direct observation of behavior and 
interviews. 

Prognostic studies, unless well designed and 
appropriately implemented, are of little value. 
Too often the prognostic aspects of a study are 
introduced as an afterthought when the major 
aim of the study has failed to materialize. In 
order for a prognostic study to be scientifically 
acceptable it must be planned for, and must 
satisfy certain elementary criteria in its pres- 
entation of the data. Of primary importance, 
of course, is the specification of the particular 
pathological condition under investigation, 
including both the specific disease and the 
condition of the patients with regard to age, 
sex, attitude toward illness, etc. Also important 
are the form of therapy and the time relations 
in the prognosis. It is well known that the 
stage of development of the illness is relevant 
to prognosis. The outcome immediately after 
therapy is often at variance with the outcome 
after several years, and for this reason thé 
period of follow-up must be specified. One 
might think that these elementary require- 
ments of scientific reporting are so self-evident 
that they hardly need mentioning. Unfor- 
tunately, many of the studies in the literature 
fail to provide these crucial bits of information. 
The use of suitable statistics for evaluating 
outcome is, of course, essential in prognostic 
studies. Nevertheless, some studies in this 
field suffer from the inadequate use of sta- 
tistics, many being content with an impres- 


273 


sionistic comparison of central tendencies. 
Furthermore, most of the prognostic studies 
today are based on hindsight rather than fore- 
sight. It is to be hoped that as more experience 
is gained, a few hardy souls will attempt truly 
predictive studies. 

Another difficulty that hampers prognostic 
studies is the absence of uniformly acceptable 
criteria of improvement against which prog- 
nostic tests could be validated. Such terms as 
“cured” and “recovered,” appropriate as they 
may be in acute physical diseases like appendi- 
citis, are not at all applicable to chronic ill- 
nesses like schizophrenia. The goal of therapy, 
which is fairly well defined in the acute physical 
diseases—to restore the patient to his pre- 
morbid status—must be readjusted to a lower 
level of aspiration in the case of chronic ill- 
nesses (82). As a realistic basis for the evalua- 
tion of outcome of therapy one must still 
resort to such crude measures as being in or 
out of the hospital. 

In view of the above difficulties, the only 
method of ascertaining whether any relation- 
Ships exist between test performance and sub- 
sequent outcome was to search Íor possible 
consistencies in the various studies despite 
their shortcomings. Previous reviews of the 
literature on this topic (16, 18, 64, 66) have 
been of limited scope and have not attempted 
a thoroughgoing resolution of the conflicting 
claims made for the various tests. Conse- 
quently, a more complete review (78) was 
undertaken by one of the present authors to 
determine the validity of the prognostic claims 
for each of the psychological tests used. 

Tt was found that most of the reports of the 
prognostic efficacy of psychological tests were 
concerned with three techniques: the Ror- 
schach, the Minnesota Multiphasic Personality 
Inventory (MMPI), and a variety of intelli- 
gence tests. Each of these techniques had been 
claimed to be effective in predicting outcome, 
but in each case several investigators failed to 
confirm the claims. In addition, a wide variety 
of specific indices had been claimed to be pre- 
dictive in a given test, and there was con- 
siderable diversity in the prognostic value or 
direction of any particular sign or index. This 
discordance can best be illustrated by pre- 
senting summaries of the findings employing 
these specific psychological tests. 

The review of the literature disclosed 22 
studies in which the Rorschach technique was 


274 


investigated as a prognostic agent in the psy- 
choses? All but seven (18, 29, 60, 61, 63, 66, 
76) of these studies reported the Rorschach to 
have predictive efficacy. Furthermore, most of 
the studies claimed that in general the more 
capable patients or those who were most nearly 
normal were more likely to have favorable 
outcomes. In spite of this agreement in the 
over-all interpretation of the Rorschach proto- 
cols, the empirical signs proposed for predic- 
tion exhibited little consistency. 

For example, one outstanding exponent of 
the prognostic use of the Rorschach, who has 
contributed eight papers to this field (38, 50, 
54, 55, 56, 57, 58, 59), has proposed in each of 
seven different experiments a different set of 
signs or indices, often varying considerably 
from the signs reported in previous investiga- 
tions. It appears that not one of the indices 
which was found to have prognostic power 
retrospectively, maintained this power in a 
new sample. The same inconsistency that 
occurs in the empirical findings is also present 
at the level of interpretation. Thus, running 
parallel with the seven different groups of 
prognostic indices are three slightly different 
interpretations of the personality patterns 
which are claimed to differentiate outcome 
groups (55, 57, 58). One can sympathize with 
the strivings to improve the test results, but 
one must also realize that the continued change 
of indicators may reflect the inappropriateness 
of the selected signs. Furthermore, the multi- 
plicity of possible indices afforded by the 
Rorschach guarantees that some pseudosignifi- 
cant differences will always present them- 
selves. 

Another example of the difficulties of the 
Rorschach technique is provided in a follow-up 
study by Sloan (73) of mental defectives dis- 

Bed on wage placement. Seven criteria 
which had been suggested by another well- 
known Rorschach expert as prognostic for 
good adjustment were computed for contrasted 
outcome groups. Although the Rorschach 
criteria for good adjustment adequately differ- 
entiated the outcome groups, the predicted 
outcome was the very opposite of that which 
actually occurred. Those mental defectives 
who were successful in staying out on wage 
placement had a reliably smaller number of 
agreements with the criteria for good adjust- 

2 These 22 studies included 18, 20, 22, 29, 30, 38, 40, 
47, 49, 50, 54, 55, 56, 57, 59, 60, 61, 63, 66, 67, 75, 76. 


JOSEPH ZUBIN AND CHARLES WINDLE 


ment than did those who were returned as 
failures. 

Additional difficulties in accepting the claims 
for the prognostic value of the Rorschach arise 
from the inadequate evaluation of the results 
of experiments in this field. Only six of the 
studies making prognostic claims with psy- 
chotics (40, 47, 54, 56, 59, 67) have utilized 
statistical methods to test significance, and 
two of these (47, 54) omitted the use of Yates’s 
correction in the chi-square test with small 
samples. The best clinical verdict one can give 
regarding the prognostic claims of the Ror- 
schach technique is: not proven. 

It might be expected that studies of the 
prognostic value of the MMPI would show 
more consistency than those of the Rorschach. 
The scoring of this test is entirely objective, 
there are fewer standard indices that might 
appear to have prognostic value so that pure 
chance results play a smaller role, and less 
retrospective manipulation of the indices in 
the form of fractions, ratios, and weighted 
Scores is possible. Consequently, it is to be 
expected that most of the MMPI studies would 
be more likely to establish convincingly the 
Prognostic value of the test. It is, therefore, 
disappointing to discover the large amount of 
disagreement in the MMPI findings. In about 
one third of the studies high scores on specific 
Scales were thought to signify good outcome 
for psychoses, in another third high scores indi- 
Cated poor outcome, and in the remaining 
third no prognostic value was found. Thus, 
high scores on the schizophrenia scale, the one 
most frequently cited, were thought by Harris, 
Bowman, and Simon (26), Hales and Simon 
(21), and Carp (8) to be favorable for outcome 
in the psychoses, while Harris (24), Pearson 
(51), and Feldman (16) felt high schizophrenia 
Scores were unfavorable. A similar situation 
existed with the other MMPI scales, high 
Scores sometimes being claimed indicative of 
good outcome, but as often found to be corre- 
lated with poor outcome or of no prognostic 
value. 

It is clear that such evidence is not very 
helpful in providing a basis for prediction. It 
is quite likely that some of the studies provide 
more valuable evidence than others, but unless 
Statistical estimates of the validity of the find- 
ings are provided, it is difficult to make a 
distinction between the good and the poor 
Studies. Perhaps of even greater importance 


— 


PSYCHOLOGICAL PROGNOSIS OF OUTCOME IN MENTAL DISORDERS 


is the paucity of confirmatory studies with 
new samples. Such confirmation has been 
attempted in four instances with the MMPI, 
but only two of these attempts proved success- 
ful (16, 52). One of the others failed to specify 
the criteria of outcome (26), while the second 
claimed validation on the basis of insignificant 
data (51). Again, the most accurate evaluation 
of the prognostic efficacy of the MMPI must 
be: ot proven. 

Studies of the use of ability tests for prog- 
nosis also fail to yield a consistent basis for 
prediction of outcome. Of 23 reports of the 
prognostic value of various psychomotor tests 
with psychotics, 10 indicated that favorable 
outcome correlated with efficient performance 
before therapy (6, 9, 20, 33, 38, 42, 49, 67, 77, 
83). On the other hand, five reports concluded 
that inefficient performance was prognostically 
favorable (35, 43, 44, 48, 84), three reported 
either a curvilinear relationship or contrary 
trends under different conditions (36, 68, 79), 
and five found no relationship between intelli- 
gence tests and outcome (19, 26, 53, 60, 74). 
Such a summary of the results of the various 
ability tests may do injustice to the particular 
types of tests employed or special abilities 
studied. There is little evidence, however, that 
any particular test is consistently more effec- 
tive for. prognosis than another, or that the 
nature of the test can account for the divergent 
conclusions found in the literature. Again, the 
verdict is: not proven. 

We have found, then, that with each of the 
three techniques, the Rorschach, the MMPI, 
and tests of ability, there is a large amount of 
disagreement concerning both the signs to be 
regarded as prognostic and the direction of the 
outcome to be predicted from any particular 
sign. 

To some extent specific findings seem to be 
attributable either to the diagnosis of the 
patients or apparently to the type of treatments 
For example, Malamud and others °(43, 44) 
have reported that involutional psychotics of 
estimated inferior intelligence have better 
prognoses than do those of above average 
intelligence. Studies by Schnack, Shakow, and 
Lively (68) and by Zubin and Thompson (83) 
have indicated that different prognostic signs 
may apply to patients given insulin than to 
patients given metrazol therapy. Numerous 
investigators (59, 68, 85) have also pointed out 
that the duration of the follow-up period may 


275 


also play a role in determining the significance 
of the prognostic signs. 

It is difficult to try to account for the di- 
vergent conclusions of the prognostic studies 
on the basis of any general characteristics, be- 
cause relatively few investigators specified the 
important conditions which characterized their 
experiments. Thus, of 28 prognostic studies 
with the Rorschach, 19 (all but 20, 22, 38, 40, 
47, 56, 59, 66, 76) did not specify the duration 
of illness of the patients and seven (30, 34, 54, 
55, 63, 67, 75) failed to state the time of fol- 
low-up.’ Of 10 MMPI prognostic studies (8, 
15, 16, 21, 24, 26, 50, 51, 52, 69), five reported 
that the prognoses applied to the broad diag- 
nostic category of “psychiatric patients” (15, 
16, 24, 26, 52), only about one third specified 
the duration of illness of their patients (8, 21, 
26), and about one third did not indicate the 
duration of the follow-up period (8, 24, 69). 

None of these three factors, diagnosis, type 
of therapy, or duration of follow-up, seems to 
be able to account for all of the discrepancies 
in the literature. Apparently some other factor 
must be found to account for these contrary 
trends. The first question that needs to be 
answered is: Do the more efficiently function- 
ing or the less efficiently functioning patients 
have the better outcome or is there no relation- 
ship at all? 

In several instances in which inefficient pre- 
treatment test performance was correlated 
with good outcome, it was observed that the 
majority of the patients under study were 
chronically ill (26, 35, 43, 44, 48, 84). It was 
thought that, since duration of illness is well 
known to be an important factor in prognosis, 
it is highly probable that radically different 
prognostic signs would apply to patients differ- 
ing in duration of illness. To test this hypothe- 
sis, data were gathered for both chronically ill 
psychotics (long duration of illness) and 
acutely ill psychotics (short duration of illness) 
on the Complex Reaction Time Test (79), a 
test which in essence provides a measure of the 
patient’s ability to react quickly and appropri- 
ately to complex tasks presented continuously. 

Comparisons of the relationships between 
initial CRT scores and final outcome status 
revealed divergent trends; initially, the im- 

3 These 28 studies include 7, 11, 18, 20, 22, 29, 30, 
34, 38, 40, 47, 49, 50, 54, 55, 56, 57, 59, 60, 61, 63, 64, 
66, 67, 70, 71, 75, 76. 


276 
High 


Low 


ACUTELY {ll 


(TIL 
(T 


JoEsPH ZUBIN AND CHARLES WINDLE 


—]Zmprorec 
BI ore 


CHROMUALLY Lil 


Fio. 1. THE RELATION BETWEEN SCORE ON THE CRT TEST AND OUTCOME FOR ACUTE (EARLY) AND CHRONIC 


PATIENTS 


proved among the acutely ill patients had the 
higher scores while the improved among the 
chronically ill had the lower scores. The di- 
vergence between these relationships was found 
to be significant at the .05 level when tested 
by analysis of variance. Even more striking, 
individual analyses revealed that a cut-off 
score of 30 was significantly differential for 
both groups of patients, but in opposite direc- 
tions 


An individual analysis of the scores revealed 
that in the chronically ill, none of those who 
later left the hospital had scores of 30 or more, 
whereas half of those who were still in the 
hospital at the time of the follow-up had 
Scores above this critical point. In the acutely 
ill, this same critical score was found signifi- 
cantly differential in the reverse order. In this 
case, all of the improved fell above the critical 
score of 30, while of the unimproved about one 
third fell below that point. 

This study indicates that a possible explana- 
tion of the contradictions in the literature deal- 
ing with the use of psychological tests for prog- 
nosis may be the neglect of the factor of 
chronicity. Because relatively few studies have 
made a point of specifying this factor, it is 
hard to determine whether or not this variable 
can satisfactorily account for all the observed 
differences in results. But it can be seen to be 

; important in several cases, especially for those 
studies using ability tests or the MMPI. 


Apparently most of the studies employed 
either groups of acutely ill psychotics or groups 
of psychotics of various degrees of chronicity. 
Since patients of longer duration typically 
have poorer prognoses and perform less eff- 
ciently than acutely ill patients, it is to be ex- 
pected that a positive relationship between pre- 
therapy performance and outcome would be 
found when the group under study consists of 
a mixture of chronically and acutely ill pa- 
tients. Many of the studies reporting an inverse 
relationship between performance level and 
outcome may have employed chronically ill 
patients only. The basis for the negative cor- 
relation between test score and outcome in such 
patients will be discussed later. 

The findings that different prognostic signs 
apply to metrazol and insulin therapy (68, 83) 
may also be accounted for on this basis, since 
the more severely ill patients were more likely 
to be treated with metrazol and the less 
severely ill with insulin. It must be remem- 
bered, however, that all of the conflicting con- 
clusions found in the literature cannot be ex- 
plained on this basis alone, especially since our 
studies have dealt only with psychotics. Fur- 
thermore, it is not to be expected that all con- 
flicts would be resolved with the control of 
merely this one factor, since other factors are 
undoubtedly of importance even if we have 
not as yet identified them and their contribu- 
tions to prognosis. 


ee 


--D SS 


4 


PSYCHOLOGICAL PROGNOSIS OF OUTCOME IN MENTAL DISORDERS 2904 


There is little difüculty in accounting for 
the prognostic findings with acutely ill psy- 
chotics since it is to be expected that patients 
capable of efficient performance would tend to 
have the better outcome. For the chronically 
ill, however, it is contrary to common sense 
to predict that psychotics in whom the devia- 
tion from normal efficiency is relatively great 
should have better outcomes than do psy- 
chotics less altered in performance ability by 
the disease process. Consequently, it was de- 
sired to study more thoroughly the 48 chroni- 
cally ill Columbia-Greystone patients (45) to 
verify the prognostic value of inefficient per- 
formance, and, if possible, to specify more 
clearly the nature and extent of this ineffi- 
ciency (85). 

Of the 44 pretreatment measures available, 
it was found that five differentiated between 
criterion outcome groups of 38 patients who 
had been cooperative enough to yield repre- 
sentative data on more than a third of the 
tests. In each case inefficient performance 
presaged favorable outcome. In general, it 
appeared that on conceptual tests those pa- 
tients who subsequently improved functioned 
less well than those who did not improve, 
but on perceptual tests the former functioned 
somewhat better. 

Type of therapy was unrelated to either out- 
come or prognostic indices, but, interestingly, 
was related to the psychiatric ratings of prog- 
nosis made at the time of testing. Patients who 


100% Premorbid 


PERFORMANCE 
(arbitrary scale) 


LJ IMPROVED 


SN 


o 
aL 


UNIMPROVED 


were expected to improve were found signifi- 
cantly more often than the others to have had 
brain operations, either immediately or during 
the follow-up period. The psychiatric prognos- 
tic ratings were found to be significantly related 
to outcome when all the patients were con- 
sidered, regardless of testability, but were not 
found to be related in the cooperative cases 
alone. 

This brings us to a discussion of the 10 un- 
cooperative patients. All but one of these pa- 
tients had an unfavorable outcome and a 
diagnosis of hebephrenic schizophrenia. (This 
one relation was, incidentally, the only connec- 
tion found between diagnosis and outcome.) 
Apparently, it was on the basis of these 
deteriorated cases that the psychiatric prog- 
noses achieved their relation with outcome in 
the entire group. Furthermore, these patients 
are apparently even less efücient than are 
those who later improve. We are reminded of 
Langfeldt’s (36) conclusion that both clever 
patients and intellectually debilitated patients 
have catastrophic outcomes. Apparently the 
same situation exists in our study. Chronic 
patients who performed relatively efficiently 
and those from whom no representative per- 
formance could be elicited had remained con- 
tinuously in, or had been permanently returned 
to, the hospital during the more than three 
years since testing. The hypothesized relation- 
ship between levels of performance and the 


Chronic 
Uncooperative 


L 


STAGE of DISEASE PROCESS 


Fic. 2. HYPOTHETICAL RELATION BETWEEN PRETREATMENT OR INITIAL PSYCHOLOGICAL TEST PERFORMANCE 
AND OUTCOME IN EARLY AND CHRONIC MENTAL PATIENTS 


278 


stages of chronicity can best be shown graphi- 
cally. 

Most patients may be expected to function 
somewhat below the normal in psychological 
tests, this decrement increasing with the 
chronicity of illness. In general, acutely ill 
patients function better than the more chron- 
ically ill and have better prognoses. Within 
the chronic group the better functioning seem 
to have rather poor prognoses, those of inter- 
mediate functioning have relatively good prog- 
noses, and those who seem deteriorated have 
‘very poor prognoses. 

That even such global techniques as the 

Rorschach might prove effective as prognostic 
agents when the protocols they afford are 
investigated systematically and objectively 
without the restrictions imposed by orthodox 
scoring and interpretation is demonstrated by 
McCall’s study (40) of the preoperative proto- 
cols of the Columbia-Greystone patients (45). 
Utilizing a system of psychometric scales for 
scoring the responses (80), he found that of the 
35 scales which were applied, five showed a 
significant prognostic value in retrospect at the 
05 level. Two scales in which high scores 
might be regarded as reflecting perceptual 
clarity were positively correlated with out- 
come in a four-year follow-up (surface color 
and perception of plant life); three scales in 
which high scores might be regarded as re- 
flecting conceptual clarity were negatively re- 
lated to outcome (dehumanization tendency, 
popularity of percept, reaction time). On the 
basis of this retrospective study it may be 
tentatively concluded that the perceptually 
clear but conceptually confused patients had 
a better prognosis than the perceptually con- 
fused but conceptually clear patients. It should 
also be noted that not one of the orthodox 
scoring categories proved to be prognostic, 
though some derivatives of the orthodox sys- 
tem were found to have prognostic value when 
scaled psychometrically. 


SUMMARY 


A review of the literature on the use of psy- 
chological tests for prognosis revealed an ex- 
treme amount of variation and contradiction 
in the indices considered prognostic. In an 

.4 The obtaining of five significantly differential scales 
out of 35 at the .05 level of confidence has the proba- 
bility of occurring by chance of .03 if the scales are as- 
sumed to be independent. 


JOSEPH ZUBIN AND CHARLES WINDLE 


attempt to partially explain these contradic- 
tions, the factor of chronicity was investigated 
in psychotics. The prognostic index for chron- 
ically ill psychotics was found to be the op- 
posite of that for acutely ill psychotics on the 
Complex Reaction Time Test. Further analysis 
of the preoperative data of the chronically ill 
psychotics revealed a certain amount of gen- 
erality for the finding that those who func- 
tioned well on psychological tests had a poor 
outcome. This prognosis applied regardless of 
therapy, but was limited to the cooperative 
patients. The uncooperative or deteriorated 
patients were almost always diagnosed as 
hebephrenic schizophrenics and had a very 
poor expectancy of remission. There was, then, 
a curvilinear relationship between performance 
and outcome in the chronic patients, those 
performing best and those unable to cooperate 
having a less favorable outcome than patients 
of low-level efficiency. 

It is important to emphasize that these 
findings have been exploratory and require 
validation from confirmatory studies before 
confidence can be placed in the conclusions. 
The fact that we have found support in the 
literature for our findings is of relatively little 
importance since so many different claims have 
been made that almost any hypothesis can find 
considerable agreement. It is, then, essential 
to confirm or refute our hypotheses with addi- 
tional patients in whom more factors can be 
controlled than was the case in our studies. 
This need for confirmation applies to most of 
the studies of psychological prognosis. 

Although it may be premature to speculate 
on the implications of our hypotheses concern- 
ing different prognostic signs for patients differ- 
ing in chronicity, it is not too soon to point out 
some of the criteria for a good prognostic 
study, First, it is important to control and 
specify the conditions of the study. Among 
those which may be important are: (a) diag- 
nosis, (b) type of therapy, (c) duration of 
illness, (d) stage of disease, (e) time of follow- 
up, and (f) opportunities for, and criteria of, 
outcome. Second, what is most needed is con- 
firmation or refutation of existing claims of 
prognostic efficiency rather than new claims. 
Perhaps studies designed to test the range of 
conditions within which particular signs are 
prognostic would most satisfactorily establish 
the validity of previous claims. Last, the 
evaluation of the results should be statistically 


acceptable, 


PSYCHOLOGICAL PROGNOSIS OF OUTCOME IN MENTAL DISORDERS 


in contrast to the majority of 


present-day reports of prognostic psychological 
tests. 

When investigators begin to apply some of 
these criteria, it may be possible to place some 
degree of confidence in the reports of prog- 
nostic efficacy for psychological tests. 


1. 


2: 


10. 


11. 


12. 
13. 


14. 


15. 


16. 


17. 


18. 


. Benjamin, J. D. A 


. Bric, O., & SULLIVAN, 


. Carp, A. MMPI performance and 


REFERENCES 


Apter, D. L. Normal vs. schizophrenic perception 
of similarities. Psychol. Bull., 1942, 89, 507-508. 

Batinsky, B. Factors in the vocational adjustment 
of ‘schizophrenics after mental hospital dis- 
charge. J. din. Psychol., 1947, 8, 341-349. 


. Buck, S. J. Rorschach F plus and the ego in treat- 


ment. Amer. J. Orthopsychtat., 1948, 18, 395-401. 
method for distinguishing and 
evaluating formal thinking disorders in schizo- 
phrenia. In J. S. Kasanin (Ed.), Language and 
thought in schizophrenia. Berkeley: Univer. of 
California Press, 1946. Pp. 65-90. 

D. J. Personality struc- 
ture and prognosis of alcohol addiction. Quart. J. 
Stud. Alcohol., 1943, 8, 554-573. 


. Boies, MARJORIE M., ROSEN, G. P., & Lanois, C. 


Psychological performance tests as prognosis 
agents for the efficacy of insulin therapy in 
schizophrenia. Psychiat. Quart., 1938, 12, 733-737. 


. Brapway, K. P., Lion, E. G., & Corrican, H. G. 


The use of the Rorschach in a psychiatric study 

of promiscuous girls. Rorschach Res. Exch., 1946, 

10, 105-110. 

insulin shock 

ae: J. abnorm. soc. Psychol., 1950, 45, 721- 
26. 


. Carp, A. Performance on the Wechsler-Bellevue 


scale and insulin shock therapy. J. abnorm. soc. 
Psychol., 1950, 45, 127-136. 

Davipson, G. M., & Cowxz, R. C. The Rorschach 
test and the questions of "prognosis" and "re- 
covery” in syphilitic meningo-encephalitis. Psy- 
chiat. Quart., 1950, 24, 243-258. 

Dickson, L. P. The Rorschach test as an jndicator 
of prognosis in psychoneurosis. Unpublished 
master’s thesis, Catholic Univer., 1949. 

Eart, C. J. C. A psychograph for morons. J. 
abnorm. soc. Psychol., 1940, 35, 428-448. 

Eisner, E., & Orwison, W. D. An objective evalua- 
tion of metrazol therapy. Proc. Amer. Psychiat. 
Ass., Chicago, May, 1939. 

Erus, R. W., & Brown, G. G. The nature of 
Rorschach responses from pulmonary 
losis patients. J. clin. Psychol., 1950, 6, 298-300. 

FELDMAN, M. J. A prognosis scale for shock ther- 
apy. Amer. Psychologist, 1948, 8, 348. (Abstract) 

Ferman, M. J. Some scales to predict and evaluate 
the results of several types of shock therapy. Un- 
published doctor’s dissertation, Univer. of Cali- 
fornia, 1949. 

Freprer, F. E., & SEGEL, S. M. The Free Drawing 
Test as a predictor of non-improvement in psy- 
chotherapy. J. clin. Psychol., 1949, 5, 386-389. 

Frruxen-BENNETT, G. T. A study of prognostic 
indices in the Rorschach records of hospitalized 


tubercu- . 


19. 


21. 


22. 


24. 


25. 


26. 


21; 


28. 


29. 


31. 


32. 


35. 


. KRIEGMAN, 


. Krucman, M. Rorschach 


279 


patients. Unpublished doctor’s dissertation, 
Univer. of Pittsburgh, 1950. 

Fisner, K. A. Changes in test performance of 
ambulatory depressed patients undergoing elec- 
tro-schock therapy. J. gen. Psychol., 1949, 41, 
195-232. 


. Granam, VimoriA T. Psychological studies of 


hypoglycemia therapy. J. Psychol., 1940, 10, 
327-358. 

Hates, W. M., & Smon, W. Minnesota Multi- 
phasic Personality Inventory patterns before 
and after insulin shock therapy. Amer. J. 
Psychiat., 1948, 105, 254-258. 

HALPERN, FLORENCE C. Rorschach interpretation 
of the personality structure of schizophrenics who 
benefit from insulin therapy. Psychiat. Quart., 
1940, 14, 826-833. 

Haman, R. Test pattern of high grade mentally 
defective girls. Proc. Amer. Ass. ment. Def., 1938, 
48, 161-165. 

Hannrs, R. E. Measured personality characteristics 
of convulsive therapy patients: a study of diag- 
nostic and prognostic criteria. Psychol. Bull., 
1945, 42, 535. (Abstract) 

Harris, R. E. Psychodiagnostic testing in psy- 
chiatry and psychosomatic medicine. In M. R. 
Harrower (Ed.), Recent advances in diagnostic 
psychological testing. Springfield, Ill. Charles [oj 
Thomas, 1950. Pp. 13-29. 

Harris, R. E., BOWMAN, K. M. & Summon, A. 
Studies in electronarcosis therapy. TIL Psycho- 
logical test findings. J. nerv. ment. Dis., 1948, 107, 
371-376. 

Harris, R. E., & CHRISTIANSEN, C. Prediction of 
response to brief psychotherapy. J. Psychol., 
1946, 21, 269-284. 

Kaman, G. R. The Rorschach method as a thera- 
peutic agent. Amer. J. Orthopsychiat., 1944, 14, 
21-27. 

Karz, H. Untersuchungen an insulinbehandelten 
Schizophrenen mit dem Rorschach’schen Form- 
deutversuch. Mschr. Psychiat. Neurol., 1941, 104, 
15-33. 


. Kisker, G. W. A projective approach to person- 


ality patterns during insulin-shock and metrazol- 
convulsive therapy. J. abmorm. soc. Psychol., 
1942, 37, 120-124. 

Kroprer, W. G. The efficacy of group therapy as 
indicated by group Rorschach records. Rorschach 
Res. Exch., 1945, 9, 207-209. 

Kıucman, S. F. Test scores of psychoneurotic 
returnees before and after convalescent furlough. 
J. din. Psychol., 1948, 4, 195-200. 

G., & HILGARD, JOSEPHINE R. Intelli- 

ce level and psychotherapy with problem 

children. Amer. J. Orthopsychiat., 1944, 14, 251- 

265. 

examination in a child 
guidance clinic. Amer. J. Orthopsychiat., 1941, 
11, 503-511. 

Lawpr, C., & ERLICK, D. An analysis of the 
Porteus Maze Test as affected by psychosurgery. 
"Amer. J. Psychol., 1950, 68, 557-566. 


36. LANGFELDT, G. The prognosis in schizophrenia and 


course of the disease. 


the factors influencing the 
1937. 


London: Oxford Univer. Press, 


37. Levi, J. Rorschach patterns as a tool in predicting 
success or failure in the rehabilitation of the 
physically handicapped. Amer. Psychologist, 1950, 

: 5, 320-321. (Abstract) 

38. Lounir, R. S., Pacetta, B. L., & PIOTROWSKI, Z. 
A. Studies on the prognosis in schizophrenic-like 
psychoses in children. Amer. J. Psychiat., 1943, 
99, 542-551. 

39. Low, A. A., Siwozn, H. D., & McCorry, C. L. 
Life situation tests as aids in psychiatric prog- 
nosis. Amer. J. Psychiat., 1939, 96, 147-162. 

40. McCarr, R. J. Psychometric evaluation of Ror- 
Schach records in brain-operated patients. Un- 
published doctor's dissertation, Columbia Univer., 
1951. 

41. Matamop, W., & Gorre, J. S. Course and prog- 
nosis in the psychoneuroses. J. Mt. Sinai Hosp., 
New York, 1942, 9, 630-639. 

42. Matamon, W., & RENDER, N. Course and prog- 
nosis in schizophrenia. Amer. J. Psychiat., 1939, 
95, 1039-1057. 

43. MALAMUD, W., Sanps, S. L., & MALAMUD, IRENE, 
The involutional psychoses: a. socio-psychiatric 

1 study. Psychosom. Med., 1941, 8, 410-426, 

44. MarAMUD, W., Sanps, S. L., MALAMUD, IRENE 
T., & Powers, P. J. P. The involutional psy- 
choses: a socio-psychiatric follow-up study, Amer. 
J. Psychiat., 1949, 105, 567-572. 

45. METTLER, F. A. (Ed.) Selective partial ablations 
of the frontal Cortex; a correlative study of its effects 
on human psycholic subjects. New York: Paul B. 
Hoeber, 1949, 

46. Mires, H. H. W., BARRABEE, EDNA L., & Finz- 
SINGER, J. E. Evaluation of psychotherapy. 
Psychosom. Med., 1951, 13, 83-105. 

47. Morris, W. W. Prognostic possibilities of the 
Rorschach method in metrazol therapy. Amer. J. 
Psychiat., 1943, 100, 222-230. 

48. O'CONNELL, J. J., & Penrose, L. S. Tests of psy- 
chomotor efficiency in patients treated with 
metrazol. J. ment. Sci., 1941, 87, 183-191, 

49. OPPENHEIM, SADI,& BROWER, D. The effects of elec- 
tric shock therapy as revealed by the Rorschach 
technique. Psychiat. Quar, 1948, 92 (supple- 
ment), 318-325. 

50, PACELLA, B. L., PIOTROWSKI, Z., & Lewis, N. D. 
C. The effects of electric convulsive therapy on 
certain personality traits in psychiatric patients. 

- Amer. J. Psychiat., 1947, 104, 83-91. 

51. Pearson, J. S. Prediction of the response of 
schizophrenic patients to electro-convulsive ther- 
apy. J. clin. Psychol., 1950, 6, 285-287. 

52. Pearson, J. S, & Swenson, W. M. A note on 
extended findings with the MMPI in predicting , 
response to electro-convulsive therapy. J. clin. 
Psychol., 1951, 7, 288. 

53. Perers, H. N. Traits related to improved adjust- 
ment of psychotics after lobotomy. J. abnorm. 
soc. Psychol., 1947, 42, 383-392, 

54. PIOTROWSKI, Z. The prognostic possibilities of the 
Rorschach method in insulin treatment. Psychiat. 

Quart., 1938, 12, 679-689. 

55. Piotrowski, Z. Rorschach manifestations of im. 

$ provement in insulin treated i: 

: Psychosom. Med., 1939, 1, 508-526. 


JOSEPH ZUBIN AND CHARLES WINDLE 


56. Prorrowskt, Z. A simple experimental device f 
the prediction of outcome of insulin treatn 
in schizophrenia. Psychiat. Quart., 1940, 14, 67 
273. 

57. Prorzowsxr, Z. The Rorschach method as a pro 3, 
nostic aid in the insulin shock treatment of 
schizophrenics. Psychiat. Quart., 1941, 15, 8 
822 f 


Prorgowsxr, Z. The modifiability of personali 
revealed by the Rorschach method: methodo 
cal considerations. Rorschach Res. Exch., 1942, 
160-167. 

. Prorrowski, Z., & Lewis, N. D. C. An exp 
mental criterion for the prognostication of 
status of schizophrenics after a three-year int 
val based on Rorschach data. In P. H. Hoch ¢ 
J. Zubin (Eds.), Relation of psychological tests 
psychiatry. New York: Grune and Stratton, 19 
Pp. 51-72. Y 

Rasin, A. I. Effects of electric shock treatm 
upon some aspects of personality and intelli 
Amer. Psychologist, 1947, 2, 284. (Abstract) l 

- Rees, W. L., & Jones, A. M. An evaluation of the 

Rorschach test as a prognostic aid in the tre 

ment of schizophrenia by insulin coma thet 

electronarcosis, electro-convulsive therapy 


58. 


60. 


63. 


of psychotherapy. Unpublished doctor's disserta- — 
tion, Univer. of Iowa, 1950. m 
. RUESCH, J., CHRISTIANSEN, C., PETTERSON, L. Cy 
Dzwxzs, S., & Jacosson, A. Psychological in- 
validism in thyroidectomized patients. Psycho- 


66. 


67. ; I. W. Prognoses and psychological scores 
in electroconvulsive therapy, psychosurgery, and 
Spontaneous remission. Amer. J. Psychiat., 195! 
107, 926-931. 

68. Scunack, G. F., SHAKOW, D., & Livety, M. i 


Studies in insulin and metrazol therapy: I. The 
differential prognostic value of some psychologi- 
cal tests. J. Pers., 1946, 14, 106-124. i 
Scaormrp, W. Changes in responses to the Min- 
nesota Multiphasic Inventory following certain 
therapies. Psychol. Monogr., 1950, 64, No. 5 — 
(Whole No. 311). E. 
70. Sizczz, Mrrraw G. The diagnostic and prognostic _ 
validity of the Rorschach test in a child guidance - 
clinic. Amer, J. Orthopsychiat., 1948, 18, 119-133. 
. SIEGEL, S. M. Personality factors in psychothera- 
peutic improvement as identified by the Ror- 
schach test. Amer. Psychologist, 1951, 6, 341-342. 
(Abstract) d 
- SILLMAN, L. R. Chronic alcoholism, J. nerv, ment. D 
Dis., 1948, 107, 127-149. A 


69. 


73. 


74. 


T5. 


76. 


77. 


78. 


T9. 


PSYCHOLOGICAL PROGNOSIS or OUTCOME IN MENTAL DISORDERS 


Sroan, W. Prediction of extramural adjustment of 
mental defectives by use of the Rorschach test. 
J. consult. Psychol., 1948, 12, 303-309. 

Terry, G. C, & Rennt, T. A. C. Analysis of 
parergasia. Nerv. ment. Dis. Monogr. 1938, 
No. 64. 

Van Bark, B., & BARON S. Neurotic elements in 
the Rorschach records of psychotics. Rorschach 
Res. Exch., 1943, T, 166-168. 

WECHSLER, D., HALPERN, FLORENCE C., & Jaros, 
E. Psychometric study of insulin-treated schizo- 
phrenics. Psychiat. Quart., 1940, 14, 466-476. 

Wrtenson, D. Relation of subtest performance 
on the Wechsler-Bellevue Intelligence Scale to 
the effects of psychosurgery. Unpublished mas- 
ter’s thesis, Columbia Univer., 1950. 

WiwprE, C. Psychological tests in psychopatho- 
logical prognosis. Psychol. Bull., 1952, 49, 451- 
482. 

WiwpLE, C. & Hawwr, Vioret. An exploratory 
study of the prognostic value of the Complex 
Reaction Time Test in early and chronic psy- 
chotics. J. clin. Psychol., 1953, 9, 156-161. 


281 


80. ZUBIN, J. Quantitative techniques and methods in 
abnormal psychology. Unpublished manuscript, 
Columbia Univer., 1950. 

81. Zusix, J. Discussion of contributions dealing with 
diagnosis and prognosis of mental disorders. In 
P. H. Hoch & J. Zubin (Eds.), Relation of psycho- 
logical tests to psychiairy. New York: Grune & 
Stratton, 1951. Pp. 96-104. 

82. Zurn, J. Evaluation of therapeutic outcome in 
mental disorders. J. nerv. ment. Dis., 1953, 117, 
95-111. 

83. ZUBIN, J. & THOMPSON, JANE. Sorting tests in rela- 
tion to drug therapy in schizophrenia. New York: 
New York State Psychiatric Institute, 1941. 

84. ZUBIN, J., & Wrote, C. The prognostic value of 
the Metenym Test in a follow-up study of psy- 
chosurgery patients and their controls. J. din. 
Psychol., 1951, T, 221-223. 

85. ZUBIN, J., Wrote, C., & Hamwi, VioreT. Retro- 
spective evaluation of psychological tests as 

rognostic instruments in mental disorders. us 

Pers., 1953, 21 342-355. 


Received February 12, 1953. 


INDIVIDUAL CONFORMITY TO ATTITUDES OF 
CLASSROOM GROUPS: 


WILBERT J. McKEACHIE 
University of Michigan 


N INDIVIDUAL’S attitudes are influenced 
to some extent by the groups of 
which he is a member. Evidence for 

this statement dates back to Moore's experi- 
ments in 1921, which demonstrated that 
people will reverse their judgments if told 
that they differ from those of the majority 
of the group. In a similar experiment Marple 
(7) found, on a measure of attitude toward a 
number of controversial issues, that his 300 
high school seniors, college seniors, and adults 
made over half of the possible changes of 
attitude toward the majority attitude as 
compared with about 15 per cent made by a 
control group not told the majority attitude. 
Both of these experiments provide evidence to 
support the hypothesis that the individual 
tends to adopt attitudes corresponding to 
those held by the majority of the group. This 
tendency has usually been termed “conform- 
ity." 3 

The perceived group norm. What is the in- 
dividual conforming to? Most investigations 
of the relationship of attitudes to group norms 
have dealt only with conformity to the group 
norm as perceived by the experimenter (E). 
For example, Marple (7) found that some 
members of his group changed their attitudes 
toward those designated as the attitudes of 
the majority of the group, while other group 
members did not change their attitudes in 
this direction or changed away from the atti- 
tudes of the majority. Why did some indi- 
viduals change in one direction, some in 
another, and some not at all? 

We might improve our ability to answer this 
question if we knew what group members 
perceived the group norm to be—both at the 
time of the pretest and at the time of the 
posttest. It seems probable that even before 
the opinion of the majority was announced, the 

1 This article is based upon research conducted for a 
Ph.D. dissertation completed in 1949 and carried out 
under the direction of Professor Donald G. Marquis. 


Drs. Harold Guetzkow, Everett Bovard, and Mr. Lee 
Danielson assisted in various aspects of the research. 


group members had some vague perception of 
the opinion of the majority. We can then ex- 
plain failure to conform thus: not all members 
of the group had the same perception of the 
group norm when the pretest was given. When 
confronted with what purported to be the 
group norm, some group members saw that it 
lay in one direction from their original per- 
ception* and changed their attitudes accord- 
ingly; others saw that it lay in another direc- 
tion from their original perception and hence 
changed their attitudes in the opposite direc- 
tion from the first group. Those who did not 
change may have found that the announced 
norm was in accordance with their original 
guess, or else failed to believe E when he 
announced a different one. Hence, what Marple 
discovered may not have been the "prestige 
value" of majority opinion, but rather the 
effect of a change or lack of change in a per- 
ceived group norm. 

A study of the factors involved in conformity 
will, then, be more definite if one studies con- 
formity to the perceived group norm, rather 
than to a norm perceived by E. We shall call 
this relationship between the individual's 
attitude and his perception of the group norm 
congruence, retaining “conformity” as the 
term referring to the relationship of attitude 
to the objective group norm (see Fig. 1). 

The reference group—a matier of degree. 
Newcomb (9) has extended our question, 
“what is the individual conforming to?” by 
introducing the term “reference group.” Ac- 
gording to his hypothesis, the individual tends 
to conform to the norm, not necessarily of the 
groups of which he is a member at a given 

? This theory assumes that when an invididual takes 
an attitude test in a group, the group acts as a con- 
straint upon him even if the group norm exists for him 
in only a vague way. An experiment by F. H. Allport 
(1) demonstrated that an individual’s judgments in a 
group were less extreme than his judgments alone. This 
indicates that the group norm exists for individuals, at 


least in some vague form, even when it has not been 
made explicit. 


282 


InprvipvaL CONFORMITY TO ATTITUDES OF CLASSROOM GROUPS 


GROUP 


PROCESSES POSTTEST 


PRETEST 


Norm 
-Objective (“real”) €?8nEe Objective norm 
group norm 


Perceived o 
Individual’s per- ^ norm Individuals per- | ey & 
ception of change ception of E S 
group norm —  groupnorm f$ ER 
. A Attitude j BIS 
Individual's be- ^ change Individual's be- |5 
havior or atti- —  haviororat- |^ 
tude titude 


Fic. 1. VARIABLES INVOLVED IN CONFORMITY 
EXPERIMENTS 


moment, but to the norm of a group to which 
he refers his attitude. 

When is a group a reference group? The 
usual solution is to use an all-or-none classi- 
fication. Either the individual is a member of 
the group or he is not a member; either he 
uses a group as a referent for his attitude or 
he does not use that group as a referent. 

It seems probable, however, that a number 
of groups may be influencing an individual at 
any moment in time. Thus, two individuals 
who refer to the same group may be influenced 
to a different degree or in different directions 
by that group; therefore, quantitative meas- 
ures of the individual's group membership are 
needed. Those who have discussed group iden- 
tification or similar concepts (e.g, Krech 
and Crutchfield [5], Festinger [4], and New- 
comb [10]) have indicated that group member- 
Ship, group identification, or group belonging- 
ness are not simple all-or-none concepts. 
Festinger, for example, defines “cohesion” as 

attraction of the group,” and has evidence 
that the more cohesive the groups the greater 
the conformity to the group norms. Does this 
hold true for norms not directly related to 
group goals? Do the individuals who are most 
attracted to the group show greater cone 
formity? E 

Conformity and group process. In any dis- 
cussion of the influence of groups upon the 
individual's behavior, Lewin's classic group 
decision experiments (6) inevitably come 
to mind. Lewin and his associates found that 
Women who had held a discussion and then 
raised their hands to indicate that they would 
serve certain desired foods did serve the foods 
to a much greater extent than women who had 


283. 


simply listened to a lecture inducing them to 
serve the foods. Other experiments showed that 
in bringing about changes in behavior, this 
procedure, which Lewin called group decision, 
was also superior to discussion without de- 
cision or to individual instruction. 

Group decision evidently is an effective 
method of influencing behavior. Conformity 
to the group norm is high. But even though no 
norm was announced, conformity to the group 
norm of not serving the desired foods was 
even higher in the lecture groups. Obviously 
more research is needed to isolate the effects 
of differing group procedures upon conformity. 
Does the discussion before the group decision 
promote greater tolerance for deviation from 
the norm, or does it help in mobilizing group 
pressures toward uniformity? 


HYPOTHESES 


Tn order to fill some of the gaps in our knowl- 
edge about conformity, an experiment was 
devised to test the following hypotheses: 

1. Attitude shifts of group members are 
positively correlated with changes in their 
perceptions of the group norms. (A group 
member’s perception of the group norm is 
defined as his estimate of the attitude of 
“most of the group.”) 

It should be pointed out that the confirma- 
tion of this hypothesis will not tell us the 
direction of causation. As we shall see in the 
discussion section, such a correlation may have 
two explanations. The purpose of testing this 
hypothesis is simply to provide evidence that 
anyone interested in conformity should pay 
attention to group members' perceptions of 
group norms. 

2. There will be a higher positive correlation 
between attitudes of group members and their 
perceptions of the group norms in groups in 
which there is a greater liking for the group 
than in groups in which there is less liking 
for the group by group members. Corollary: 
The greater an individual's liking for a group, 
the greater the congruence between his atti- 
tudes and perceived group norms. 

3. The correlation between group members’ 
attitudes and perceptions of the group norm 
will be lower after participating in a group 
decision preceded by a group discussion than 
after listening to a lecture and writing an 
essay about the problem. 

This hypothesis is based on the assumption 


284 WILBERT J. 


that group discussion. weakens forces toward 
congruence, which group decision can only 
partially restore. It does not contradict 
Lewin's results. As we stated earlier, con- 
gruence was probably high in Lewin's lecture 
groups. But it does contradict the assumption 
that discussion contributes to congruence. 


METHOD 


In order to test our hypotheses we need to create 
groups whose members differ between groups in the 
degree of liking for the group. While this might con- 
ceivably be done in groups meeting but once, it seems 
desirable to have groups meeting over a period of time, 
especially since we need to subject each group to differ- 
ing procedures for changing attitudes, For these reasons, 
and because of their availability, the experiment was 
carried out in elementary psychology classes at the 
University of Michigan. 

The measures of the attitudes used were Wang and 
Thurstone’s Attitude-toward-the-Treatment-of-Crimi- 
nals scale scored by Likert’s technique, Koch’s Attitude- 
toward-the-Freedom-of-Children scale scored by the 
Likert technique, and Likert’s Attitude-toward-the- 
Negro scale, 

Each of these tests was given as a pretest during the 
second and third weeks of the semester. Each student 
was asked to check his own attitude and to indicate by 
a zero the position on each item which “most of the 
class will check.” Approximately at the end of each 
month one of the three procedures for arriving at group 
norms described below was used in each section in ac- 
cordance with the experimental design (see Table 2). A 
week after the experimental treatment of each topic, 
the students took the attitude scale related to the topic, 
following the same procedure of indicating their own 
attitudes and the attitude of the class. Thus pre- and 
posttest scores for the attitudes of students and their 
perceptions of the attitude of “most of the class” were 
available. 

Students for the six sections involved in the experi- 
ment were not especially selected from the total en- 
rollment of the elementary psychology course at the 
University of Michigan. Each section consisted of 25 to 
35 undergraduate students with the largest number 
coming from the sophomore class of the literary college. 
Students enrolling for sections at these hours were 
assigned to these sections alternately, i.e., the first stu- 
dent registering was assigned to one section, the second 
student to a section taught by the alternative method. 
Students were not told that they were participating in 
an experiment. Each section met for one-hour periods 
three times weekly for a semester. 

Differences in cohesiveness. Three instructors each 
agreed to teach their two sections of the elementary 
psychology course in different ways. In order to build 
up differences in liking for the group and feeling of 
membership in the group, they agreed that their tech- 
niques should differ in (a) opportunity of class members 
to know other members of the class, (b) amount of 
direct interaction between class members, and (c) num- 
ber of decisions which the class would be allowed to 

‘make about its own goals and procedures. 


McKxacnrE 


TABLE 1 


MEMBERS’ LIKING FOR THE GROUP IN CLASSES 
TAUGHT BY DIFFERENT METHODS 


Group Mean SD N 
Leader-centered classes 2:0 155i ua 
Group-centered classes 3.0 1.56 64 


Specific procedures used were as follows: 

a. Members of the experimental or group-centered 
class introduced themselves at the first meeting and 
each member made a seating chart identifying other 
members of the class. In the control or leader-centered 
class only the instructor possessed a seating chart and 
only he introduced himself. 

b. In group-centered classes the instructor referred 
as much as possible of the discussion from student to 
student and refrained from interrupting student ex- 
changes. In control groups he commented upon or an- 
swered each student participation, so that interactions 
between students were mediated through him. 

c. In group-centered classes the instructor gave 
direction at the beginning of the semester, but as the 
semester progressed, he referred more decisions to the 
group. Thus students in group-centered sections made 
group decisions on their assignments, the number and 
dates of tests, and even on having class and breakfast 
together in an especially reserved lunchroom. The 
decisions made in the experimental groups about tests 
and assignments were also carried out in the control 
groups in which the instructor simply announced the 
assignments and tests. Thus assignments and tests were 
the same in both groups. i 

These procedures were effective in producing the 
desired differences, On a scale on which students rated 
from —5 to 5 their dislike or liking for the group, 
students in the group-centered classes expressed 
significantly greater liking for their groups (p < .01). 
These results are presented in Table 1. 

Differences in group process of arriving at norm. The 
reason for using different techniques in presenting the 
attitute was to analyze more carefully some of the 
factors involved in group decision experiments. 

The first technique used was that of an open vote. I 
call this “group decision.”* Students discussed the 
problem, preliminary votes were taken on the alterna- 
tive solutions to the problem presented by the instruc- 
tor, and compromises made until agreement on a solu- 
tion could be reached. Observers recorded the facts and 
arguments used in the discussion, and these formed the 
content of the lecture given in the groups which used 
the other two techniques, The order of procedures in 
each group is indicated in Table 2. 

The second procedure was the giving of informa- 
tion and arguments on both sides of a problem in a 
lecture by the instructor. Following the lecture each 
student spent 10 to 12 minutes writing an essay on one 
of the six suggested alternative statements of attitude 
toward the problem. 


3]t should be pointed out that my group decisions 
were in reference to attitudes. Usually this term refers 
to decisions about behavior rather than attitudes. 


` 
i 


INDIVIDUAL CONFORMITY TO ATTITUDES OF CLASSROOM GROUPS 


TABLE 2 
DESIGN OF EXPERIMENT 


INSTRUCTORS 
PROCEDURES A B c 


LEADER-CENTERED SECTIONS 


Lecture and Attitude to- Attitude to- Attitude to- 
secret vote ward the ward the ward the 
freedom of Negro treatment of 
children criminals 
Lecture and re- Criminals Children Negro 
sult of vote 
announced 
Group decision Negro Criminals Children 
GROUP-CENTERED SECTIONS 
Lecture and Children Negro Criminals 
secret vote 
Voteannounced Criminals Children Negro 
Group decision Negro Criminals Children 
TABLE 3 


CORRELATION Or SHIFT or ATTITUDE WITH CHANGE 
IN PERCEPTION OF Group Norm 


Test r N 


Attitude toward the freedom of children .350 121 

aude toward the treatment of crim- .422 121 
in 

Attitude toward Negroes .391 137 


On the class day following the lecture, students were 
told, “Most of the students in this class chose the 
following alternative ...." In all cases the alternative 
chosen. by the group decision was also that chosen by 
the majority in this, which I shall refer to as the “vote 
announced,” group. This technique was designed to 
test the effect of making a group norm explicit. 

The third technique was identical with the “vote 
announced” technique except that the results of the 
ds were not announced. I shall call this the “lecture” 

up. 

Thus, information and arguments about the subject 
were equivalent for all three methods. The last two 
methods were equivalent in all respects except an- 
nouncement of the norm, and the norm announced was 
the same in the first two procedures. ^ 

Design. Since it would be inadvisable to treat each 
attitude by each method in each section, i.e., treat each 
topic three times in each section, it was not possible to 
use an ordinary factorial design. In a situation such as 
this, the latin-square design permits the maximum of 
Sources of variance to be isolated. However, this experi- 
Ec. " eee out in Table 2 is not a simple latin- 

ut i: i i 
C B is actually a double latin square with 
Hypotheses were tested by computing the signifi- 


285 


cance of differences between the within-groups correla- 
tions derived by analysis of covariance.* 


RESULTS AND DISCUSSION 


Probably the first question one asks, al- 
though this was not the primary focus of the 
riment, is, “did the variables produce 
shifts of attitude?" Significant changes in atti- 
tude toward the treatment of criminals and in 
attitude toward Negroes had occurred—a 
change which may not be startling but is all 
too rare in the record of the effect of teaching 
upon attitudes.® 

Relationship of shift of attitude to change of 
perceived group norm. We have seen that the 
students liked their groups. In fact, only one 
student expressed dislike for the group. 
Thus we have a situation in which we would 
expect the individual’s attitude to be in- 
fluenced positively by the group norm. In 
the introduction, I suggested that in such a 
situation the important variable is the sub- 
ject’s perception of the change in the group 
norm. If E considers only the objective group 
norm, he fails to account for many changes 
in scores while the norm has remained con- 
stant, or must invoke genii called “degree of 
suggestibility” or “contrasuggestibility.” 

As we predicted, correlations of the shifts 
in attitude of individuals with changes in 
perceived group norm were significantly dif- 
ferent from zero (p < .01). These correlations 
are given in Table 3. 

This finding indicates that our ability to 
predict attitude shifts is improved by con- 
sidering group norm perceptions. However, 
this finding does not show how this relation- 
ship is determined. It seems that either or 

4 Formulae for treating this design were developed by 
Professor Paul S. Dwyer, Consultant in the Statistical 
Research Laboratory of the University of Michigan. 

5 Another interesting question is this: how is con- 
gruence affected by the attitude involved? Taking the 
correlations between attitude and perceived group norm 


, separately for the three tests used, we find that the 


correlation is significantly less (p < .02) for attitude 
toward the Negro than for attitude toward the treatment 
of criminals. The correlation for attitude toward the 
freedom of children is not significantly different from 
either of the other two. This result, while peripheral to 
our main findings, is probably a good illustration of the 
fact that a given group does not have the same effect 
on all attitudes. In addition to the relevance of attitude 
to group functioning, some attitudes are undoubtedly 
more difficult to change because they were learned in 
the family or other important reference groups. 


E 


286 


both of the following processes are involved: 
(a) An individual who shifts his attitude 
projects his own attitude shift onto the 
group and tends to perceive the other group 
members as having changed similarly. (b) 
An individual whose perception of the group's 
attitude changes tends to shift his own attitude 
to maintain a similar relationship to the group 
norm. 

Further experimentation to reveal the exact 
operation of these processes should give us a 
clearer understanding of the traditional prob- 
lem of the "prestige" influence of majority 
opinion. 

The generality of this research is limited by 
the fact that all but one of the students in 
this experiment were either neutral or posi- 
tively oriented toward their groups as indi- 
cated by their responses on the liking for the 
group scale. Nevertheless, in some cases even 
the individual who is negatively oriented to- 
ward the group may shift his attitude in the 
same direction as he perceives the group's 
attitude to be changing. Ordinarily we think 
of the individual who is negatively oriented 
toward the group as shifting his attitude in 
the opposite direction from the group norm. 
Actually the direction of the shift may depend 
upon the situation. If the change in the per- 
ceived group norm increases the distance be- 
tween the individual and the disliked group, his 
attitude may not shift. But, if the perceived 
group norm shifts toward him, he may shift 
his attitude in the same direction to maintain 
the same degree of nonconformity. 

. Relationship of attraction-to-group and con- 
"gruence. Our second hypothesis predicted that 
in the classes in which there was more liking 
for the group and feeling of membership in 
the group there would be a higher correlation 
between attitudes and perceived group norms. 
This prediction about congruence on the post- 
test was not only not verified, but as the data 


in Table 4 indicate, if I had used a two-tailed . 


test, congruence would have been signifi- 
cantly lower in the group-centered classes. 

That congruence is not a simple function of 
cohesiveness is also indicated by the finding 
that within the groups the degree of liking 
for the group was not significantly related to 
the degree of congruence. 

Despite the fact that our hypothesis about 
congruerice was not confirmed, students in 


WILBERT J. McKEACHIE 


TABLE 4 


POSTTEST CONGRUENCE IN LEADER-CENTERED AND 
GROUP-CENTERED CLASSES 


Group T N 
Leader-centered classes .636 180 
Group-centered classes .370 207 

TABLE 5 


Mean DIFFERENCE OF ATTITUDE FROM GROUP NORM 


ATTITUDE TESTS 


Group CHILDREN NEGRO CRIMINALS 
Leader-centered 6.2 5.6 7.9 
Group-centered 7.8 3.6 6.5 


F = 5.4 for 1 and 252 df. 
?«.05. 


group-centered classes actually did conform 
to the group norm* more closely than did 
POM in leader-centered classes. (See Table 
321 

How can these results be explained? 

One group of variables is composed of fac- 
tors affecting one's perception of the group 
norm. Here we have such factors as objective 
clarity of the norm, misperceptions due to 
personality defenses, etc. On the surface it 
would appear that these variables cannot alone 
account for our result because our groups were 
not significantly different in the accuracy 
of their perception. 

Another group of variables is that having 
to do with the relationship between one's 
own attitude and the perceived group norm, 
or congruence. Festinger and his colleagues 
have shown cohesiveness to be of importance 
in determining conformity. Because the mem- 
bers of a cohesive group are more strongly 
motivated to remain members, we would ex- 
pect greater fear of rejection and, conse- 
quently, greater conformity to group norms. 
Our group-centered classes were more cohesive 
than our leader-centered classes (using Festin- 


* “Group norm" here refers to the mean score of the 
group on an attitude test. 

1The significance tests for this table and Table 7 
should be interpreted with caution. Since Bartlett's test 
indicated that the assumption of homogeneity was not 
justified, a log transformation was applied. However, 
even after this transformation, Bartlett’s test was 
significant at the .05 level. 


| 


f 


| 


| 
| 


INDIVIDUAL CONFORMITY TO ATTITUDES OF CLASSROOM GROUPS 


TABLE 6 


CoNGRUENCE AFTER DIFFERENT GROUP 
Pnaocesses WERE USED FOR 
CONSIDERING A PROBLEM 


PROCEDURE r N 
Lecture 577 132 
Vote announced .498 108 
Group decision .320 90 


gers definition of cohesion as "attraction of 
group"). Yet, these group-centered classes 
showed less congruence than the leader- 
centered classes. How can we explain this? 

We have already suggested that need to be 
accepted by the group is one of the major 
motives for conformity. But groups differ 
in the degree to which nonconformity is 
punished. In a group such as ours, in which 
there has been a good deal of interaction be- 
tween members, the group member should 
be able to develop a fairly good idea of what 
behavior the group will reward, what it will 
ignore, and what it will punish. Perhaps a 
good deal of his feeling of security in a group 
depends upon his knowledge of these limits. 
It seems probable that in most democratic 
groups the pattern of rewards and punishments 
is such that the group member will learn to 
Cooperate on issues where uniformity of be- 
havior is necessary to group progress. How- 
ever, such groups are likely to permit or even 
reward individual variation on problems 
which require individual rather than group 
action. This ability to differentiate between 
areas where conformity is necessary and where 
it is not necessary may not only be a rheasure 
of the security of the individual group member 
but also, when summed for the whole group, 
may be an important dimension related to 
the group’s effectiveness in problem solving. 
_ Relationship to congruence of procedures used 
in arriving at norm. Our third hypothesis stated 


that congruence will be less after a group de- * 


cision than after a lecture. As indicated in Table 
6, our results showed that congruence was 
significantly lower (p = .03) following group 
decision than following a lecture. 

Again this result may be interpreted in 
i of rewards and punishments involved in 
Ee group process. If we assume that one of 

e primary motives for a group member's 
conformity is his need to gain acceptance or 


281 
avoid rejection by the group, the function of a 
discussion becomes more apparent. If the dis- 
cussion is one in which the group member 
hears many divergent attitudes expressed 
and if these deviations are tolerated by the 
group, the forces toward conformity will be 
weakened. On the other hand, if conforming 
statements are rewarded and deviation results 
in rejection, the forces toward conformity 
will be increased. Our group discussions pre- 
ceding group decisions were extremely per- 
missive, and it is not surprising that con- 
gruence was reduced, 

Let us turn now to the factors which result 
in a discrepancy between the "real" group 
norm and the perceived group norm. These, 
too, help us to understand the effectiveness of 
group decision. 

One of these factors is the ambiguity or 
"clarity" of the norm. We know from many 
studies that the individual's needs and past 
experiences are involved in his perception of a 
social situation. The more ambiguous the 
situation the more these individual factors 
enter into perception. Thus, an ambiguous 
norm can easily be interpreted differently by 
each group member. The result is a low degree 
of conformity. 

Too often we have assumed that if E told a 
group, “the group norm here is so and so,” 
each member would perceive the norm in the 
same way. Unfortunately, not all subjects 
trust psychologists, and if their needs to dis- 
believe in a particular norm are Strong, they 
are.likely to dismiss the announcement. 
Consequently, they maintain conformity to a 
group norm which is more agreeable to them. ; 
This, I think, is one of the explanations for 
the high degree of congruence in our “lecture” 
procedure, where the norm was relatively 
vague and unknown. 

Nevertheless, the objective situation is an 
important factor in perception, and one of the 
features of group decision is that it makes the 
group norm clearly perceptible to members of 
the group. While congruence was low in our 
“group decision” procedure, students’ atti- 
tudes in this group were actually significantly 
closer to the "real" group norm than in the 
other techniques (see Table 7). These students 
were more accurate in their perception of the 
group norm as indicated in Table 8. While the 
discussion had weakened the pressure felt by 


* 


288 


the individual to align his attitude with that of 
the group, the vote had made clear the norm 
to which he was relating, and the resulting 
conformity was thus greater than in other 
groups. 

Thus our findings indicate that clarity of 
the norm is an important factor in conformity 
and that a permissive discussion weakens 
tendencies to conform. 

With these findings Lewin's "group dis- 
cussion" experiments may be more clearly 
interpreted. Lewin describes three phases of 
group decision—unfreezing, change of level, 
and freezing at a new level. 

The phase of "unfreezing" is accomplished 
by lessening the forces toward conformity to 
the old norm. If one of the forces toward con- 

. formity is the threat of nonacceptance by the 
group when one diverges from the norm, letting 
individuals present varying points of view 
and accepting these divergent opinions with- 
out punishment should remove some of the 
fear of diverging from the norm. 

The step, "change of level" of behavior, 
requires strengthening forces directed toward 
the new level or reduction of forces directed 
away from the new level. In the women's 
groups in which Lewin was attempting to 
change food habits, the “unfreezing” discussion 
was accompanied by dietitians’ recipes and 
information aimed at weakening or removing 
some of the forces which had been preventing 
serving of the experimental foods. Thus, 
forces toward serving the experimental foods 
became proportionately stronger during the 
discussion. 

“Freezing behavior at the new level” in- 
volves reinvoking forces to conformity to the 
new norm, This is accomplished by a hand 
vote and in at least one of Lewin’s experiments 
by the knowledge that a check on behavior 
would be made by E. In terms of my empha- 
sis upon the perceived norm, I interpret this 
as a method of making the new norm clearly 
perceived and unambiguous. Unanimity in 
the decision is important, therefore, not only 
because it makes the norm clearer, but because 
it re-emphasizes the necessity for conformity 
if group acceptance is to be obtained. 

In the light of this theory, Lewin’s lecture 
groups were unsuccessful in changing behav- 
ior because change of level was attempted 


Wert J. McKeacure 


TABLE 7 
MEAN DIFFERENCE or ATTITUDE FROM GROUP Norm 


ArrTITUDE TESTS 
PROCEDURE CHILDREN Necro CRIMINALS 


Lecture 
Vote announced 


7.7 
7.0 
Group decision 5.6 


F = 2.66 with 2 and 252 df. 
$ < 10. 


TABLE 8 
MEAN DIFFERENCE OF PERCEIVED GROUP NORM FROM 
Grove Norm ATTITUDE TESTS 


PROCEDURE CHILDREN NEGRO CRIMINALS 
Lecture 6.7 7.6 7.9 
Vote announced 5.9 4.4 5.1 
Group decision 5.8 4.4 4.4 

F = 5.91 with 2 and 252 df. 
p< 01. 


without unfreezing. As a result, conformity to 
the old standard of behavior was very high. 

Lewin’s discussion groups were less suc- 
cessful than group decision because although 
unfreezing and changes of level were carried 
out, the new group norm was not made clear 
and perceptions of the norm probably varied. 
In addition, in both groups the forces for con- 
formity were probably weakened if varying 
viewpoints were accepted during the discus- 
sion. 

The group-decision technique was successful 
because all three phases had been carried out. 
In addition, it is possible that the discussion 
before the group decision may actually have 
increased the attractiveness of the group, 
which may have helped counteract the effect 
of lack of punishment of divergent viewpoints. 


SUMMARY 


1. To study the relationship of the indi- 
vidual’s attitudes to group norms, experimen- 
tal classroom situations were set up involving 
three sets of variables: (2) The relationship 
between attitude change and changes in per- 
ception of the group norm. (b) The relation- 
ship between attraction to the group and 
“congruence” of attitudes and perceived 
group norms. (c) The effect of different group 
Processes used in considering a problem upon 
congruence. ` 

2. Attitude changes were found to be sig- 


" 
i 


INDIVIDUAL CONFORMITY TO ATTITUDES OF CLASSROOM GROUPS 


nificantly correlated with changes in group 
norms. 

3. Classes taught by a group-centered tech- 
nique created greater member-liking for the 
group than leader-centered classes, but con- 
gruence was less in group-centered classes. 

4. A group-decision technique resulted in 
less congruence but greater conformity than a 
lecture. 

5. The findings are interpreted in terms of a 
theory emphasizing the importance of the 
distribution of rewards and punishments ad- 
ministered by the group for conformity and 
the discrepancy between the objective group 
norm and the perceived group norm. 


REFERENCES 


1. Arrrorr, F. H. Social psychology. Boston : Houghton 
Mifflin, 1924. 

2. Bovazp, E. W., Jn. Group structureand perception. 
J. abnorm. soc. Psychol., 1951, 46, 398-405. 


289 


3. Bovazp, E. W., Jz. The experimental production 
of interpersonal affect. J. abnorm. soc. Psychol., 
1951, 46, 521-528. 

4. FEsrINGER, L. Informal social communications. 
Psychol. Rev., 1950, 57, 271-282. 

5. Krecu, D. & Crurcurierp, R. Theories and 
problems of social psychology. New York: Mc- 
Graw-Hill, 1948. 

6. LEWIN, K. Group decisions and social change. In 
T. Newcomb & E. Hartley (Eds.), Readings in 
social psychology. New York: Henry Holt, 1947. 
Pp. 330-344. 

7. Marete, C. H. The comparative suggestibility of 
three age levels to the suggestion of groups vs. 
expert opinion. J. soc. Psychol., 1933, 4, 176-186. 

8. Moore, H. T. The comparative influence of ma- 
jority and expert opinion. Amer. J. Psychol., 
1921, 32, 16-20. 

9. Newcome, T. M. Attitude development as a func- 
tion of reference groups. In M. Sherif (Ed.), An 
outline of social psychology. New York: Harper, 
1948. Pp. 138-155. 

10. Newcoms, T. M. Social psychology. New York: 
Dryden, 1950. 


Received February 12, 1953. 


PROJECTIVE METHODS AND VERBAL LEARNING 


BARBARA NORFLEET COHN 
Harvard University! 


[HE projective techniques are commonly 
believed to reveal dominant motiva- 
tional forces. They are believed to 
perform this function because the subject (S) 
is given "several degrees of freedom to organize 
a plastic medium in his own way, and since 
little external aid is provided from conven- 
tional patterns, he is all but obliged to give 
expression to the most readily available forces 
within himself" (14, p. 215). Two dimensions 
seem to be involved: (a) freedom, or a large 
number of response alternatives, and (b) 
conventionality, or degree of "familiarity." 
Actually, when viewed from these two dimen- 
sions, the projective techniques are not uni- 
formly “free” nor “unfamiliar.” The Rorschach 
seems to be at the free, unfamiliar poles; the 
TAT is located more at the free, quite familiar 
poles. Word-association tests are like the TAT 
in this respect. Tachistoscopic tests of per- 
ceptual sensitivity run the gamut of all 
possible combinations of the two dimensions. 
Of course it is realized that many of the 
verbal responses to projective tests represent 
mere familiarity or recency. It would be sur- 
prising to encounter a medical student who 
did not use some anatomical vocabulary in 
his imaginative productions. Thus, in tachisto- 
scopic studies, a very powerful variable, 
word frequency, demonstrates that much so- 
called “projective material” is merely a 
reflection of vocabulary habits. How, then, can 
we say that motivational forces are revealed 
by such projective techniques? The answer 
seems to lie partly in the specification of both 
the freedom-for-responding granted to S, 
and in the degree of familiarity of the stimulus , 
material presented to S. Probably the rest of 
the answer lies in the hedonic or motivational 
associations related to the stimulus material 
as well as to the responses required of S. 
These problems of specification can be readily 
1 This research was supported by the Laboratory of 
Social Relations, Harvard University. The author is 
particularly indebted to Professor Richard L. Solomon 
for many helpful suggestions in the planning and 
carrying out of this study. Drs. Leo Postman and D. 
O. Hebb suggested many ideas and improvements. 


restated in terms of an S-R analysis of verbal . 


learning, and they suggest a defining experi- 
ment. 


For this task two assumptions are neces- | 


sary: (a) familiarity of stimulus materials 
and frequency of prior exposure are almost 
synonymous, and (b) the aftereffects of 
reward and punishment give stimulus materials 
a “motivational character" that mere fre- 
quency of repetition or exposure cannot give. 


The number of repeated presentations of | 


verbal materials, i.e., frequency (7, 11, 13), 
and the rewarding, punishing, or neutral 
aftereffects of each presentation, ie., conse- 
quence (2, 8, 9, 10), are known to affect the 
amount of verbal learning. Further, it is 
known that the various methods of measuring 
verbal learning and retention, such as recog- 
nition and free recall following a constant 
learning procedure, give differing quantitative 
values (1, 4, 5, 6). However, it is not known 
how the different methods of measuring verbal 
learning interact with the systematic varia- 


tion of antecedent conditions such as fre- | 


quency and consequence. This paper presents 
data on the following questions: 

1. Are some methods of measuring verbal 
learning more sensitive than others in differ- 
entially revealing the effects of the antecedent 
conditions of frequency and consequence? 

2. Are there common dimensions among 
methods of measuring verbal retention which 
might account for differing sensitivities of 
this kind? 


VOR S 


It is clear that these two questions are | 
referable to the original problem concerning | 


the mode of operation of projective procedures. 
If we vary degree of familiarity of verbal 
stimuli jointly with consequence (reward or 
punishment or neutrality), we are controlling 
S's relationship to certain stimulus materials. 
If we vary the methods of measuring verbal 
learning, we, in fact, vary S’s freedom of 
Tesponse; e.g., the recall test offers fewer 
restraints than does the recognition test. 
In seeking to discover how the antecedent 
conditions (frequency and consequence) inter- 


290 


à . . PROJECTIVE METHODS AND VERBAL LEARNING 


„aci with measuring procedures, we elicit a 
^ further question: what are the conditions most 


conducive to the revelation of the effects of 
consequence and the motivational substrate 
of verbal responding? It is not clear to us 
whether we are asking about the character- 
istics of a "good" projective test; but that 
question is interesting to ask in the light of 
this study, and one answer is, most certainly, 
that the "goodness" of such a test is con- 
ditional upon the frequency and consequence 
of antecedents. 


METHOD 
The Learning Situation 


In order to demonstrate simultaneously the effects 
of frequency and consequence on verbal learning, as 
well as the role of the various measurement procedures 
in coloring the apparent effects of these two inde- 
pendent variables, it was necessary to subject all Ss 
to the same initial learning situation. The Ss were first 
exposed to the materials which they would later be re- 
quired to use in one of the measurement situations 
(test situations). However, E's actual purpose in carry- 
ing out the experiment was kept from the Ss. Each S 
was told that the experiment was on language and pro- 
nunciation; that in the course of language-training re- 
search it had been discovered that students had great 
difficulty in pronouncing foreign words; and that E 
was studying these findings. The S was told that the 
words he would be shown were from an Arabic dialect, 
and that his job was to pronounce each word as well 
às he could. He was also told that some words were 
made up by E merely to see how S pronounced them 
and that there was no correct or incorrect pronouncia- 
tion of these words. 

EachS was shown seven-letter nonsense words, one at 
atime. Each word was typed on its own file card, and 
was held up in front of S until S pronounced it. The S 
was immediately rewarded or punished for his pro- 
nunciation; or, if the word was one of those of the class 
stated to have been made by E, neither reward nor 
Punishment was administered. The time interval be- 
tween pronunciations varied with the ease of pronun- 
Ciation of a given nonsense word. The Ss’ responses 
to neutral and punished words were somewhat more 
delayed than were responses to rewarded words. The 


Words were made up in such a way that repeating letter , 


Patterns and the associative strength with, English 
Words were kept at a minimum. ehe cards were 
shuffled before testing each S. 

Each nonsense word has assigned to it a fixed fre- 
quency of occurrence in the pack of word cards. There 
os a total of 246 word cards: Six words appeared 
meee six words appeared five times, six words ap- 
x ten times, and six words appeared 25 times. The 
i 5 pronunciation of each nonsense word was followed 
Ed diy by one of these consequences: He was 
ate a his pronunciation was correct, and he was 
se with a poker chip (exchangeable for money 

e end of the experiment); he was told that his 


291 
TABLE 1 
EXPERIMENTAL DESIGN AND STIMULUS WORDS 
CORE FREQUENCY 
QUENCE 
1 5 10 25 
Rewarded | jasonup | tecugaw | mehiroj puhasej 8 
gimadaz | dofukal fonaped rimakeb 
Punished | hebosam | kuvalog labeduf pikoven 8 
nevucog | wudizom | silucer vozimet 
Neutral rujowet | zakebif zupajic sujefiv 8 
batiwos | gewirac noguvis howanil 
6 words | 6 words 6 words 6 words | 24 
Exposures |6 X 1 = 6/6 X 5 = 30/6 X 10 = 60/6 X 25 = 150/246 


pronunciation was incorrect, and he was punished by 
forfeiture of a poker chip to E; he was neither rewarded 
nor punished, and no poker chips changed hands. The 
effects, or consequences of pronunciation, were actu- 
ally applied to a specific word according to a prepared 
design, and the administration of reward and punish- 
ment had no relation to the pronunciation behavior of 
the Ss. Permutated across these three conditions were 
the four frequency conditions. 

The assignment of particular nonsense words to 
frequency-consequence categories was varied from S 
to S. The general design is shown in Table 1. Eight 
words were followed by reward, eight were followed by 
punishment, and eight were followed by neutral con- 
sequences. Actually, in order to control for possible 
inherent differences in difficulty of pronunciation and 
retention, the words were randomly assigned to each 
cell of the experimental design for each S. Order of 
presentation was randomized by shuffling the pack of 
word cards between Ss. If a word was found to follow 
itself, this word was randomly placed into another 
position in the pack. This randomization procedure, 
while not as acceptable as a systematic factorial de- 
sign, was more practical because it imposed no mini- 
mum number of 5s to complete the design. The ran- 
domization procedure prevented study of order effects 
and effects due to inequivalence of the nonsense words. 
The design allowed study of only the effects of fre- 
quency and consequence. 


The Test Situation 


Immediately after the 246 word presentations and 
pronounciations, each S, without prior warning, was 
exposed to one test situation. Learning appears to take 
place when an S seems to be strongly motivated to 
accomplish some task other than the memorizing of 
the material in question; thus, these test situations may 
be considered to measure incidental learning or its 
analogue in animal experimentation, “latent learning.” 
(In this experiment the existence of the incidental or 
latent-learning phenomenon is not at question.) 

The Ss were divided into four groups for testing 
purposes. (a) Ten Ss were asked to recall all the words 
they could remember from the exposure situation. This 


292 


is the free recall tesi, and the score was the number of 
the 24 possible words correctly recalled. 

(b) Ten Ss were asked to recognize words from the 
exposure situation contained in, or buried in, a list 
containing 24 unfamiliar, but similarly constructed 
nonsense words. This is the recognition test, and the 
score was the number of words correctly identified, 
corrected for guessing. 

(c) Ten Ss were asked to recognize words exposed 
tachistoscopically. This is the tachistoscopic recognition 
test. Among these words were the 24 from the exposure 
situation; 24 new, unfamiliar, but similarly con- 
structed nonsense words; and 12 seven-letter English 
words. They were presented for threshold determina- 
tion in random order. The score for each word was the 
visual duration threshold (in .01 sec.) for correct recog- 
nition, using the ascending method of limits, a recog- 
nition criterion of two successive correct responses, 
and exposure-increase steps of .01 sec. 

(d) Ten Ss were given anagrams of the 24 words from 
the exposure situation. The anagrams contained the 
seven letters needed to compose a word from the ex- 
posure situation, but letter order was scrambled. This 
was the anagram-reconstruction test, and the score was 
the number of anagrams correctly solved, with a 3-min. 
time limit for each anagram. Each S was tested indi- 
vidually in both the learning situation and test situa- 
tion. 

"The following limitations of the procedure should be 
kept in mind: The anagram test usually took a little 
over an hour to complete, while the recall test took 
approximately 10 min.; the tachistoscopic recognition 
test took over 2 hrs. in some cases. Thus, there could 
well be differential forgetting confounded with the dif- 
ferent testing conditions. Such differences are not, 
however, confounded with the independent variables. 
Secondly, recency effects were left partly uncontrolled 
since the pack of word cards was shuffled between Ss. 
Other conditions being equal, the high frequency words 
should be more often among the most recent words. 
"This will be true of any random series of stimuli where 
frequency of stimulus classes is varied. A prohibitive 
number of Ss would be required in order to control 
for the confounding of frequency and recency in this 
experiment. However, three “dummy” words were 
placed at the end of the pack of cards to help minimize 
recency effects. f 


RESULTS 
The role of frequency. The mean learning 


scores as a function of frequency are given in | 


Table 2 and are plotted in Figs. 1 and 2. 
The tachistoscopic data were transformed 
into logarithms to reduce the distortion in- 
duced by deviant high thresholds. Clearly, 
duration thresholds vary inversely with fre- 
quency of prior exposure, while recall, ana- 
gram, and recognition scores vary positively 
with frequency. Of the 48 cells, the score in 
only one fails to go in an orderly direction 
(recall, frequency of 10, consequence, neutral). 
The role of consequence. The mean scores 


BARBARA NORFLEET COHN 


TABLE 2 
MEAN Scores IN THE Four Test SITUATIONS 
RECALL 
ConsE- FREQUENCY 
QUENCE MEAN 
1 5 10 25 
Rewarded 0* .40 -70t 1.0 .58 
Punished 0* .40 250**. 1.4. .58f 
Neutral 0 30 A dev d +35 
Mean 0* .37 .43** 1.13 48 
ANAGRAMS 
Rewarded 0 -30 .80 1.30] .60 
Punished 0* .50 .90 1.30] .68 
Neutral 0 .30* 90 1.10| .58 
Mean 0* A Y hdi .87* 1.23 | .62 
Tacuistoscoric RECOGNITION (IN HUNDREDTHS 
OF SECONDS) 
Rewarded .27 .22*  .14 .12 19 
Punished .33 .23 SBSS. C 12 .22 
Neutral .38 9239. .5718* wit .23 
Mean u33900029482/155145* 7.12 21 
RECOGNITION (CORRECTED FOR GUESSING) 
Rewarded | .35** 1.44 1.58 1.88 | 1.31 
Punished .18** 1.51 1.78 1.88 | 1.34 
Neutral .35** 1.38 1.68 1.88 | 1.32 
Mean .29** 1.44** 1.68 1.88 | 1.32 


Note,—The percentage of correct words to total possible cor- 
rect words may be easily computed for the recall, anagram, and 
recognition tests: Dividing any cell by 2 will give the mean per- 
centage of correct words for that cell. 

*! tests on individual scores indicate that this frequency 
score is significantly (.05 level) lower than the score obtained 
by the adjacent ascending frequency. 

**4 tests on individual scores indicate that this frequency 
score is significantly (.01 level) lower than the score obtained by 
the adjacent ascending frequency. 

14 tests on individual scores indicate that this consequence 
(punished ər rewarded) is significantly (.05 level) higher than the 
score obtained by neutral. 


as a function of consequence are also given in 
Table 2. Means for consequence for all fre- 
quency conditions are plotted in Fig. 3. 
The results for consequence are not as clear 
as were the results for frequency. The results 
of ¢ tests indicate that the effects of conse- 
quence are significant only for the following: 
the recall test; the difference between re- 


PROJECTIVE METHODS AND VERBAL LEARNING 


w 

[2 

o 

o 

Q 

z 

< 

E e-recognition 
negnagrams 
x =recall 

Quos 9 25 
FREQUENCY 


Fic. 1. MEAN SCORES ON THE TESTS AS A FUNCTION 
Or FREQUENCY OF PRIOR EXPOSURE 
TO THE VERBAL MATERIALS 
The smooth curves are theoretical. 


2.31 


LOG DURATION 


THRESHOLD 


MEAN 


FREQUENCY 


Fic, 2. Mean LocanrTHM OF VISUAL DURATION 
THRESHOLDS rog VERBAL MATERIALS AS A FUNC- 
TION OF FREQUENCY OF PRIOR EXPOSURE TO THE 
Materrars 


The smooth curve is theoretical. 


warded and neutral words at a frequency of 
10; between punished and neutral words for 
all frequencies. There were too many empty 
cells to do an analysis of variance in the recall 
and anagram tests, but an analysis of variance 
On individual scores in the recognition and 
tachistoscopic tests demonstrated that fre- 
quency was highly significant and that conse- 
quence was insignificant. 


293 


O-T. (short) 
x=recall 


e=recognition 
AT. (long) 
200 O=anagrams 


MEAN SCORE 
o 9 


o 


+ o) = 


CONSEQUENCE 


Fic. 3. MEAN SCORES ON ALL TESTS AS A 
FUNCTION OF THE THREE CONSEQUENCE 
CONDITIONS 

The tachistoscopic test scores have been divided 
into scores for long-flash durations and short-flash 
durations. The function for short-flash durations falls 
directly on that for free recall, and so two separate 
curves could not be plotted. 


Since significant differences for consequence 
are found only in the recall test, a more 
sensitive test is probably needed to show the 
relative effects of consequence for the four 
measuring situations, however slight the 
effects may be. The total of rewarded- and 
punished-consequence mean scores as a 
percentage of total mean score is such an 
index? If consequence has xo effect, one 
would expect the percentage to be 6624, 
since two-thirds of all words presented are 
either rewarded or punished. The obtained 
percentages were 75.9% for the recall test, 
68.9% for anagrams, 67.7% for the tachisto- 
scopic test, and 66.7% for the recognition 
test. 


D DISCUSSION 


Replies to the two questions posed earlier 
in this article may now be attempted: 

1. Are some methods of measuring verbal 
learning more sensitive than others in re- 
vealing the effects of antecedent conditions? 
Figure 1 demonstrated that, regardless of the 
test used to measure memory, frequency of 
prior exposure has a homologous effect on 
what is manifestly remembered. The shapes 


294 


TABLE 3 

K CONSTANTS FOR THE Four Test SITUATIONS 
FREQUENCY 
Test 

1 5 10 25 MEAN 
Recall — .039 .024 .033  .032 
Anagrams — .039 .056 .038  .044 
Tachistoscopic .084 .089 .098 .085  .089 
Recognition .147 .223 .169 .107  .162 


of the curves relating the frequency variable 
to test score are surprisingly similar, although 
their intercepts and asymptotes depend on 
the specific characteristics of each test. All 
these curves are fitted reasonably well by the 
=n log i-e 
function H = M — Me . This repre- 
sents a growth curve: H is the mean score for 
any given frequency; M is the physiological 
maximum (in all the tests but the tachisto- 
Scopic test this equals 2 and each S could 
achieve a perfect score of 2 at any given 
frequency); e is 10; n is the frequency; and K 
is the measuring constant for each test K is 
a constant fraction of the difference between 
M and the score of the preceding frequency. 
Thus, the function rises to the physiological 
M by increments H, — H, which are a 
constant fraction K of the remaining growth 
potential M — H,;. K may be thought of as 
an index of associative strength needed for 
performance. 

Table 3 gives K constants for the four test 
situations. Although the tachistoscopic score, 
recorded in terms of time, requires some 
modification of the equation,‘ the equation 

2 Since a high score in the tachistoscopic test indi- 
cates an absence of learning, it was necessary to trans- 
form these scores. The method used was complex, and 
cannot be described here. 

3 See Hull, C. (3, ch. VIII), for a detailed explana- 
tion of this equation. The present analysis of frequency 


is similar to his analysis of the relationship between, 


habit strength and number of reinforcements. 

4 The equation for the tachistoscopic situation is 

=n oe E 

Y = (Y,— M^e + M’; Y = score in terms 
of mean logarithms of time; Y, is the intercept of the 
function on the Y axis; M’ = an approximation of the 
point at which the function becomes asymptotic to 
some value of Y greater than zero (the value of 1.33 
was found to be a good approximation here). The 
remaining symbols have identical meanings to the 
equation used for the other test situations. This equa- 
tion is identical to the one used above. The range of 


BARBARA NORFLEET COHN 


does allow a comparison of this situation with 
the other three test situations. Thus, we find 
that the different test situations do not appear 
to distort the effects of frequency of prior 
exposure. All functions of the frequency 
variable are approximately fitted by the same 
equation with different constants for the 
various tests. 

Since no continuum exists for consequence 
in the present design, no curves can be com- 
pared. But consequence, unlike frequency, 
does indicate that some methods of measuring 
verbal learning are more sensitive than other 
methods in revealing its effects. Recall is 
most sensitive and recognition least sensitive. 

2. Are there common dimensions among 
methods of measuring verbal retention which 
might account for differing sensitivities as to 
the effects of consequence? One relationship 
between the different methods of measuring 
verbal retention and the effects of consequence 
is clear: As the mean score for the measure- 
ments increases, the effects of consequence 
become harder to demonstrate. Recall, with 
the lowest mean score, varies significantly 
with the consequence variable; while recogni- 
tion, with the highest mean score, does not 
do so. This indicates that the "easier" the 
measuring test is for S, the less will the effects 
of the consequence variable have an oppor- 
tunity to become manifest. Other conditions 
being equal, two dimensions appear to deter- 
mine what makes a test simpler rather than 
more difficult: (z) The amount of stimulus 
pattern common to the learning and test 


H alone is altered. The author will be glad to send to 
anyone who is interested a detailed explanation of how 
it was worked out. 

5 Bearing in mind likelihood of error in points found 
by extrapolation, we can use the equation for the growth 
curve and the measuring constant for each test (K) to 
make some estimate of the frequencies at which the 


> curves for the other conditions would reach 1.88/2.00 


of the màximum (Ss in the recognition test, the easiest 
of the tests used, made this score at a frequency of 25). 
Solving this equation for n, and substituting the proper 
K constants, gives the following estimates of the fre- 
quencies theoretically needed in the other test situa- 
tions to reach the score attained in the recognition 
test. Recall would need a frequency of 87, anagrams 
63, and the tachistoscopic test 31. These predictions 
are not very solid. Massing, intraserial effects, degree 
of similarity between words, etc. would certainly cause 
the data to deviate from the theoretical curve. But 
these computations do suggest how much more diffi- 
cult are some test situations than others. 


+ Common 


PROJECTIVE METHODS AND VERBAL LEARNING 


situations, and (b) the sample size of possible 
responses available to S in the test situation 
(in information terminology, the “ensemble”). 
The measured effect of consequence varies 
inversely with the former, and directly with 
the latter.® 

The implications of these findings for 
projective techniques are interesting. The 
projective test can be seen as an extreme 
example of a lack of specific stimulus pattern 
common to the test situation and any previous 
learning situation, and the sample size of 
possible responses available to S is almost 
completely uncontrolled. Thus, one would 
expect material of this type to elicit verbal 
responses which are closely related to the 
motivations of S. Frequency or familiarity, 
on the other hand, would not have much 
influence in revealing effects of S’s motiva- 
tions. Words of low frequency were not re- 
called any more easily when they had been 
followed by reward and punishment than 
were words of high frequency. As found experi- 
mentally, the important factors are (a) the 
lack of external aid, and (b) the freedom to 
give any response in the test situation, rather 
than the familiarity or unfamiliarity of the 
test materials themselves. This finding helps 
us to understand why the TAT and word- 
association tests may be successful projective 
techniques in spite of the high familiarity of 
the material with which they deal. 
. In addition, there is a good check on this 
Interpretation to be found in the tachisto- 
Scopic test. This situation covers the entire 
Tange of conditions from recall to recognition 
in the Sense that very short exposure durations 
are similar to the recall test (very little of the 
Stimulus pattern from the exposure situation 


5 VIE should be noted that these dimensions might be 
ES ìn any particular test situation by stacking 
cards deliberately against them. For instance, in 
«a. recognition test one could ask S to choose the 
Correct” words from a group of words so similar to 
€ original that the amount of stimulus pattern in 
EN porum the exposure and test situations 
(EARS T relatively unimportant in determining 
Based Bike the present experiment the predictions 
nen ese two dimensions are confounded, but an 
TE t could be designed to isolate them from one 
sie of yee example, one could enlarge the sample 
reco, | possible Tesponses by placing the word to be 
hie ina list of 1,000 different words. Such pos- 
les do not, however, erase the relevance of these 


two di; i ini z 
n ìn interpreting the results of our ex- 


295 


is present in the test situation), and very 
long exposure durations are similar to the 
recognition situation (most of the stimulus 
pattern from the exposure situation is present 
in the test situation). But how should we 
decide what is a low threshold and what is a 
high threshold in the tachistoscopic situation? 
This decision can be made in a somewhat 
arbitrary, but not unreasonable, way. The 
mean score for recall was .48, which represents 
correct recall of 24% of the words; therefore, 
a visual duration threshold to represent recall 
was chosen to include the lowest 24% of the 
individual tachistoscopic scores. This thresh- 
old was .05 sec. The threshold for recog- 
nition was chosen in the same way. The mean 
score for recognition was 1.32, or correct 
recognition of 66% of the words. The cor- 
responding visual duration threshold was .10 
sec, a value which included 66% of 
threshold scores. These scores were then 
studied for the effects of consequence, The 
equivalent tachistoscopic recognition scores 
showed no effects of consequence, and the 
percentage of rewarded and punished mean 
scores to total mean score was 67.9, as com- 
pared with 66.7 for pure recognition. But, the 
equivalent recall scores showed a significant 
difference between punished and neutral 
words (t = 2.6; df = 9; p = .05), and the 
percentage of rewarded and punished mean 
scores to total mean score was 75.9% which 
is identical to the percentage found for pure 
recall. Therefore, the tachistoscopic pro- 
cedure produced data which behaved as 
though it were proceeding from recall to 
recognition memory. It is interesting that 
some writers have labeled the procedure a 
perceptual test, whereas here it appears 
entirely reasonable to talk about it as a 
learning measure. 

These data indicate that there may be a 
middle ground of agreement in the controversy 
ever the relative contributions of word fre- 
quency and motivational factors in deter- 
mining ease of perception of stimuli presented 
under difficult viewing conditions. As the 
tachistoscopic situation approaches a pro- 
jective test, in terms of allowing a wide range 
of possible responses in the presence of little 
external aid from the stimulus material, 
verbal responses tend to reflect motivational 
factors; but as it becomes more restricting, 
frequency tends to be the overruling influence. 


296 


The consistency of these findings would 
indicate that the different methods used 
here to measure verbal learning do differ in 
their sensitivity in revealing the effects of the 
antecedent condition of consequence. The 
tests can be ordered according to two dimen- 
sions which help to explain these differences. 

One may wonder why consequence does 
not “show up” more in this experiment. It is 
possible that the test situation employed did 
not enable S to use the words in such a way 
as to make the consequences from the learning 
situation “meaningful.” A final test was run 
which required that ten Ss recall the words, 
but recall them in such a way that the conse- 
quences of a word would be important. In 
place of free recall, S was asked to recall 
those words which he would “feel most con- 
fident in using if he visited Arabia"; those he 
would "feel least confident in using"; and 
those he would not “know about." The S 
was told to use all words he had formerly 
pronounced, and the three classifications were 
requested in different orders from S to S. 
As a precautionary measure, S was told the 
three classes before he was allowed to give 
any response. As expected, consequence had 
fairly large effects in this case: between 
punished and neuttal words, / — 3.9 which 
at 9 df has a p value between .02 and .01; 
between rewarded and neutral words, ¢ = 2.1 
which at 9 df is just short of the .05 level. 
The percentage of rewarded and punished 
mean score to total mean score was 77.6 in 
comparison to 75.9 for the recall test. Clearly, 
this test was more difficult for S, as indicated 
by the lowest mean score yet attained, .41 
out of a possible 2.0. 

'The methods used here to measure verbal 
learning indicate that, as far as the effects of 
consequence are concerned, tests differentially 
measure the learning (or memory) process. 
It may be that Thorndike (9, 10) found con- 
sequence to be of such great importance 
because he used the free recall test almost 
exclusively to measure verbal learning. 
However, Tolman (12) found that consequence 
may noi be very important in determining 
what is learned, but his rats were placed in a 
"recognition" situation. It may be that the 
kind of measurement used is even more funda- 


BARBARA NORFLEET COHN 


mental: When performance is measured (S's 
particular pronunciation of a word), conse- 
quence may be a strong determinant; but 
when learning, or memory, is measured 
(scores in later test situations) consequences— 
or the law of effect—may be a fairly weak 
determinant. Such would be the case in this 
experiment; Ss did repeat correct pronuncia- 
tions and change incorrect pronunciations 
during the learning situation, but they learned, 
and remembered, a large number of both 
punished and neutral words. Thorndike fre- 
quently measured "performance" rather than 
learning or memory. This interpretation 
might be accepted by proponents of latent 
learning, but it should be remembered that, 
as the test situations became more difficult, 
consequence did tend to have manifest effects 
on what was tested. 


SUMMARY 


This experiment investigated the question 
of whether some methods of measuring verbal 
learning were more sensitive than others in 
revealing the effects of frequency and con- 
sequence. It was found that the effects of 
frequency were homologous, regardless of the 
test used to measure verbal learning. The 
methods were not equivalent in revealing the 
effects of consequence (reward and punish- 
ment). The more difficult the particular test 
used to measure verbal learning, the lower 
was S's absolute score and the more likely 
to appear were the effects of reward and 
punishment. The implications of these findings | 
for students of projective techniques and i 
investigations of social perception were in- 
dicated. 


REFERENCES 


1. Acuittes, E. M. Experimental studies in recall and 
recognition. Arch. Psychol., 1920, 6, No. 44. 

2. Ersenson, J. Confirmation and information in 
reward and punishment. Arch. Psychol., 1935, 
27, No. 181. 

3. Hutt, C. L. Principles of behavior. New York: D. 
Appleton-Century, 1943. 

4. Jenxins, W. O., Postman, L., & Postman, DORO- 
tay L. An experimental comparison of active 
recall and recognition, Amer. J. Psychol., 1948, 
61, 511-519. 

5. Krurcrr, W. C. F. The effect of overlearning 0n 
retention. J. exp. Psychol., 1929, 12, 71-78. 


PROJECTIVE METHODS AND VERBAL LEARNING 


6. Lun, C. W. The conditions of retention. Psychol. 
Monogr., 1922, 31, No. 3 (Whole No. 142). 

7, Roprnson, E. S. Association theory today. New 
York: D. Appleton-Century, 1932. 

8. Rock, R. T. The influence upon learning of the 
quantitative variation of after-effects. Teach. 
Coll. Conir. Educ., 1935, No. 650. 

9. THORNDIKE, E. L. Human learning. New York: 
Appleton-Century-Crofts, 1931. 

10. THORNDIKE, E. L. The fundamentals of learning. 
New York: D. Appleton-Century, 1932. 


297 


11. Torman, E. C. Purposive behavior in animals and 
men. New York: D. Appleton-Century, 1932. 

12. Torman, E. C. The law of effect. Psychol. Rev., 
1938, 45, 200-203. 

13. Watson, J. B. Behavior: an introduction to com- 
parative psychology. New York: Holt, 1914. 

14. Warre, R. W. Interpretation of imaginative pro- 
ductions. In J. McV. Hunt (Ed.), Personality 
and the behavior disorders. New York: Ronald 
Press, 1944. Pp, 214-246. 


Received March 4, 1953. 


A TRANSVALUATION OF PSYCHOTHERAPY 
A REPLY TO HANS EYSENCK: 


SAUL ROSENZWEIG 


Departments of Psychology and Neuropsychiatry 
Washington University, St. Louis, Missouri 


T HAS been said that men will believe the 
impossible but not the improbable. The 
former demand evokes the sense of fancy, 

of wonder, and of faith; the latter, a set to 
judge on the basis of everyday, ponderable 
experience. With this thought in mind we read 
the actuarial summary of Eysenck's (2, p. 322) 
recent evaluation of the effects of psycho- 
therapy: "Patients treated by means of psy- 
choanalysis improve to the extent of 44 per 
cent; patients treated eclectically improve 
to the extent of 64 per cent; patients treated 
only custodially or by general practitioners 
improve to the extent of 72 per cent. There 
thus appears to be an inverse correlation be- 
tween recovery and psychotherapy; the more 
psychotherapy, the smaller the recovery rate." 
We are therefore to infer that psychotherapy 
is less effective than no psychotherapy. Are 
we expected to believe the improbable—and 
on the basis of statistics? 

One might be content to rest the matter 
there but for the insightful though militant 
statement of Isaac Ray (5, p. 67), pioneer of 
American psychiatry, who seventy-five years 
ago on a similar issue (the "cult of curability”) 
declared: “Statistics which are not really 
statistics are worse than useless; and the reason 
is that they beguile the student with a show 
of knowledge and thus take away the main 
inducement to further inquiry. Why should 
he look further for truth when it already lies 
before him? Some of the prevalent errors re- 
specting insanity and the insane are fairly at- 
tributable to these vicious statistics, for figures 
make a deeper impression on the mind than 
the most cogent arguments." 

Eysenck's survey could, moreover, be abused 
by the biased and the uninformed with im- 
portant ill effects socially: if psychotherapy 


1 This reply to Eysenck does not appear in the same 
journal in which his paper was published because, ac- 
cording to the editor, a policy decision not to print re- 
joinders was arrived at in accepting Eysenck’s contro- 
versial piece. 


does more harm than good, why should it be 
supported financially or by public confidence? 

It is therefore the intent of this communica- 
tion to re-examine critically the data and ar- 
guments set forth by Eysenck in his attempt 
to prove that “the figures fail to support the 
hypothesis that psychotherapy facilitates re- 
covery from neurotic disorder" (2, p. 323), 
and briefly to suggest grounds upon which it 
is possible to undertake a veridical evaluation 
of the effects of psychotherapy. In view of the 
strongly supportive role played in Eysenck’s 
analysis by the comparable surveys of Landis 
(4) and Denker (1), these earlier appraisals 
will also be incidentally examined, 

To be noted at the outset is the difference 
between Landis and Eysenck in appreciating 
the difficulties involved in making any ap- 
praisal of the effects of psychotherapy. Landis 
(4, p. 156) summarizes the sources of these 
difficulties—ignorance of the nature or cause 
of mental disease, disagreement among experts 
concerning even such broad differentiations 
as somatogenic or psychogenic, lack of uni- 
formity with respect to catagories of improve- 
ment—and concludes: “Because of these dif- 
ficulties, it is apparent that statistical figures, 
rates of recovery, etc., have to be evaluated 
cautiously, precisely, and with a minimum of 
generalization.” The caution thus advised is 
in contrast with Eysenck’s approach. Not only 
does Eysenck generalize freely, but in his paper 
one repeatedly encounters a polite bow of 
recognition to the sources of difficulty out- 


lined by Landis which are then lightly dis- 


missed., Confronted with the problems in- 
volved in assembling data from 24 separate 
studies, each of which employed its ow? 
methods and criteria, Eysenck indicates several 
reconciliatory devices he adopted, then adds 
(p. 322): “The total number of cases involved 
in all these adjustments is quite small. Another 
investigator making all decisions in exactly 
the opposite direction to the present writer's 
would hardly alter the final percentage figures 
by more than 1 or 2 per cent." Again, con- 


298 


TR 


————— À——— s 


A TRANSVALUATION OF PsYCHOTHERAPY—AÀ REPLY TO Hans EYSENCK 


sidering the definition of neurosis and offering 
his redefinition, he recognizes (p. 321) that a 
“degree of subjectivity is probably implied 
in the writer's judgment as to which disorders 
and diagnoses should be considered to fall 
under the heading ‘neurosis,’ ” but, as before, 
he concludes with facility: “The number of 
cases where there was genuine doubt is prob- 
ably too small to make much change in the 
final figures, regardless of how they are allo- 
cated.” When he presents his procedure for 
overcoming the diversity in gradations of im- 
provement as reported in the various studies, 
he remarks (pp. 320-321): “A slight degree of 
subjectivity inevitably enters into this pro- 
cedure, but it is doubtful if it has caused much 
distortion.” In taking up the serious questions 
as to whether the patients in his control (no 
treatment) groups were as seriously ill as those 
in the experimental groups and whether 
standards of recovery possibly differed in the 
two divisions, he offers some brief speculations 
and then concludes (p. 322): “In the absence 
of agreed standards of severity of illness, or of 
extent of recovery, it is not possible to go 
further.” The meaning here would seem to be 
that it is not possible to go beyond speculation 
but this admission in no wise deters the author 
from the sweeping conclusions he then pro- 
ceeds to draw. 

The foregoing resumé, largely in the author’s 
own words, will be appraised in the ensuing 
re-evaluation. The argument will take the 
following course: 

A. What is psychoneurosis? (It should be 
noted that Eysenck selects the psychoneuroses 
in making his evaluation.) Is Eysenck’s re- 
definition of neurosis consistent with a true 
evaluation of the effects of psychotherapy in 
this type of disorder? Is the severity of illness 
comparable in his contrasted groups? 

B. What is psychotherapy? Did the two 

control subgroups instanced by the author, 
actually receive no psychotherapy? In other 
Words, does the control group control, and is 
the so-called base line basic? 
a What is improvement or recovery? Were 
, © Criteria for successful outcome as applied 
In the control and experimental groups identi- 
cal or even comparable? 


A. Wuar Is PsycHoneurosis? 


As has already been noted, Eysenck ac- 


knowledges a degree of subjectivity in his 


299 


judgment as to what disorders should fall 
under this diagnostic category—the critical 
one for his study. His decision (and redefini- 
tion) is as follows (p. 321): “Schizophrenic, 
manic-depressive, and paranoid states have 
been excluded; organ neuroses, psychopathic 
states, and character disturbances have been 
included." On this basis, which many would 
question (particularly because of the inclusion 
of psychopathic states), he then proceeds to 
redistribute the data in the various reports 
included in his survey. It is his opinion, as 
quoted above, that these reallocations are too 
unimportant *to make much change in the 
final figures," but it can be shown by a single 
example that this sanguinity on his part is 
not justified. The illustration is afforded by 
the data of Fenichel's (3) report from the 
Berlin Psychoanalytic Institute—a report 
utilized by both Eysenck and Landis. If one 
compares the number of psychoneurotics 
treated at that Institute as re-reported by 
Landis and by Eysenck, one finds that this 
figure is given by the former author as 312 
and by the latter as 484. Since the total number 
of cases accepted for treatment at the Berlin 
Institute in the ten years covered by the re- 
port was 604, it can be readily appreciated 
that Eysenck's redefinition has caused a shift 
in diagnosis of 172 cases or 28 per cent. The 
effect on the figures for improvement and re- 
covery is even more germane and the difference 
between the estimates given by Landis and 
by Eysenck for this same Berlin group is there- 
fore noteworthy: Landis reports 58 per cent 
improved or recovered, and Eysenck, under 
this same heading (his Table 1), lists 39 per 
cent—a difference of 19 per cent. These dif- 
ferences of 28 per cent and 19 per cent are 
scarcely the negligible quantities that Eysenck 
has disarmingly led us to expect would be the 
result of his modified definition. 

Of equal significance is the question as to 
¿whether the severity of the illness in Eysenck's 
control and experimental groups can be con- 
sidered to be comparable. To establish his 
base line for successful outcome without psy- 
chotherapy, Eysenck employed the figure of 
72 per cent given by Landis as the consolidated 
amelioration rate of New York state hospitals 
(1917-1934), and the identical figure of 72 
per cent (for successful treatment within two 
years) obtained from the report of Denker 
(1) for a group of 500 psychoneurotic dis- 


300 


ability claims treated by general practitioners 
throughout the United States. These control 
subgroups are compared with 24 experimental 
groups subdivided by Eysenck, according to 
the method of psychotherapy, as either psycho- 
analytic or eclectic. 

If one now returns to the question raised 
as to the severity of illness in the several 
groups, the following interpretations seem 
warranted. The insurance disability cases 
were, as a whole, in all likelihood less severely 
ill than any of the others. Denker points out 
(1, p. 2165) that in these cases where dis- 
ability income was a factor the illness may 
have been prolonged by this tangible secondary 
gain. By the same token the illness may very 
well have been initiated, or at least partially 
instigated, by conscious or unconscious pros- 
pects of such gains. To compare psycho- 
neuroses of long standing, dating in many 
instances from early childhood (the typical 
case treated by psychoanalysis), with such 
disability neuroses is highly dubious, and the 
fact that the latter would have cleared up 
quickly after brief treatment by a general 
practitioner is thus not surprising. At the other 
extreme from these disability patients are 
those cases, cited after Landis, which were 
institutionalized in the various New York 
state hospitals. Here one could reasonably ex- 
pect that the neuroses must have been extra- 
ordinarily severe in order for these patients 
to have become eligible for admission to these 
crowded institutions. In these instances the 
outcome of treatment would be expected to 
be far less favorable than for either the Denker 
control group or the experimental groups. 
But at this point a question arises as to the 
standards of recovery which would apply for 
discharge from a state hospital as compared 
with the criteria of recovery utilized by a 
psychoanalyst or psychiatrist in private prac- 
tice. To this problem we shall return later. 
For the present it may be concluded that, in 
general, the Denker base-line group was prob- 
ably less seriously ill, the Landis control group 
more seriously ill than the various experimental 
groups instanced by Eysenck. To the degree 
that this conclusion is sound it may be further 
inferred that the control and experimental 
groups fail to meet an essential criterion of 
comparability—illness severity. When this con- 
sideration is added to the previous one, con- 
cerning Eysenck's redefinition of neurosis as 


SAUL ROSENZWEIG ~ Ados. 


. applied to the various studies, the basis for his 


generalizations is seriously called into question. 
B. WHAT Is PSYCHOTHERAPY? 


A second point of difference between the 
control and experimental groups in Eysenck's 
survey concerns treatment: the control groups 
of Landis and Denker are presented as not 
having received psychotherapy in contrast to. 
the experimental groups which did. The treat- 
ment given the latter groups is merely desig- 
nated as psychoanalytic or eclectic. Since 
Eysenck at no point defines what he means by 
psychotherapy, it becomes necessary to ex- 
amine the several specific mentions of psycho- 
logical treatment or lack of it in his survey, 
the main problem here being whether his as- 
sertion is supported that the control groups 
did not receive psychotherapy. 

If one turns first to the Denker group, it is 
discovered that in Eysenck's words (p. 320) 
these patients were "regularly seen and treated 
by their own physicians with sedatives, tonics, 
suggestion, and reassurance, but in no case 
was any attempt made at anything but this 
most superficial type of ‘psychotherapy’ which 
has always been the stock-in-trade of the 
general practitioner.” On the ‘basis of this 
description it is hardly possible to agree that 
these patients were not psychotherapeutically 
treated. The various presumably nonpsycho- 
therapeutic techniques mentioned include sug- 
gestion and reassurance —well-known methods 
of psychotherapy; and psychiatrists regularly 
use sedatives and tonics as adjuncts to their 
practice. That one is actually dealing here 
with psychotherapy—if not, to be sure, with 
psychoanalysis—becomes eminently clear when 
it is noted that among the experimental groups 
cited by Eysenck 80 per cent were treated by 
eclectic methods (see Table 1, B). What would 
these eclectic methods be if they did not in- 
clude the very techniques attributed to the 
general practitioner? The only difference be- 
tween the work of the general practitioner and 
of the eclectic psychiatrist that could be as- 
sumed, in the absence of detailed and specific 
knowledge, would be a difference in thorough- 
ness or expertness, not a difference in kind. 
And being aware that some general practi- 
tioners working with patients well known to 


uM ee ee ee A 


them are excellent eclectic psychotherapists, | 
one would have to make even this qualification - 


very guardedly. A reading of Denker’s paper 


EE 


puc 


A TRANSVALUATION OF.PSYCHOTHERAPY—AÀ REPLY TO HANS EYSENCK 


à 


makes it evident that he himself does not re- . 


gard his 500 disability cases as untreated by 
psychotherapy; he assumes only that the 
general practitioner is less expert than the 
psychotherapeutic specialist. 

A similar inference is warranted with respect 
to the Landis control group. To maintain that 
neurotic patients admitted to state hospitals 
receive no’ psychotherapy is seriously open to 
doubt. These institutions, despite their no- 
torious shortage of staff, usually make a special 
effort to treat their neurotic admissions, be- 
cause these cases have a better prognosis, and 
because they are far more accessible to treat- 
ment. The methods employed would pre- 
sumably be eclectic, like those of the Denker 
study and of the various eclectic experimental 
groups. 

It must then be concluded that the control 
subgroups cited by Eysenck do not sharply 
differ from the experimental groups in respect 
to the important variable of having received 
psychotherapy. As before with regard to ill- 
ness severity, the necessary contrast between 
the base line and the experimental groups be- 
comes markedly attenuated. 


C..Wnar Is RECOVERY? 


The crucial question in the present re- 
evaluation is, finally, whether the degree of 
Improvement or recovery in the control and 
experimental groups can be regarded as equal. 
In other words, the control base line for per- 
centage of cases cured can attain true sig- 
nificance only if the degree of improvement 
for these cases is identical with, or, at least, 
closely similar to that which the experimental 
cases may be estimated to have achieved after 
intensive psychotherapy. To determine, on 
the available evidence, the answer to the ques- 


tion thus posed is the last step in this re- 
evaluation. 


Needless to say, degree of improvement is, 


extremely difficult to assess and the difficulty 
I5 increased when one is dealing at second hand 
with cases treated by diverse methods and by 
Various therapists, The most significant ob- 
ed to the evaluation of degree of recovery 
les, however, in the differences in improve- 
ment Standards. In the present instance it is 
this „Particular difficulty which looms large. 
x view of the fact that the Denker group has 
iid been shown to represent, in all prob- 
ability, a less severe degree of illness, this 


301 


group will not here be further discussed. For 
the control group, attention will be focused 
on the far more representative and more gener- 
ally important base-line figure from Landis, 
derived from the consolidated amelioration 
rate of New York state hospitals for 1917-34; 
for the experimental groups both the eclectic 
and the psychoanalytic will be considered. 

As has been already stated, Eysenck reports 
that patients treated by psychoanalysis im- 
prove to the extent of 44 per cent, those treated 
eclectically, 64 per cent, and those treated 
custodially, 72 per cent. It was these figures 
which at the outset challenged us to believe 
the improbable. It will be the burden of the 
present section to show that the improbability 
resolves itself largely into differences in the 
presumed standards of improvement invoked 
in these three treatment pools. 

We may begin with two brief statements 
found in Landis (4). In discussing his Table 
II, which deals with the percentage of patients 
discharged from mental hospitals as recovered 
or improved, he characterizes (p. 159) the 
presented figures as indicating to what extent 
"hospitalization of one year or less yields 
sufficient improvement for favorable discharge" 
(my italics). By contrast, in describing the 
criterion employed at the Berlin Psycho- 
analytic Institute for recovery Landis (p. 162) 
paraphrases Fenichel (3) thus: “Only those 
cases were classified as recovered in which 
the success consisted in the disappearance of 
symptoms, and which also underwent an es- 
sential change which was completely explicable 
from the rational, analytical viewpoint." (A 
less rigorous standard was, of course, employed 
by Fenichel for improvement, etc.) In other 
words, while patients residentially treated are 
generally considered in terms of hospital dis- 
charge and return to the community, the cri- 
terion of social recovery being highly relevant, 
patients nonresidentially treated, as by psy- 
choanalysis, live continuously in the com- 
munity and are worked with in terms of radical 
therapy which, if successful, permits them to 
live not only with others but with themselves. 
This difference in therapeutic goal is so great 
that percentage figures for residential and non- 
resident treatment are dubiously commensur- 
able. 

The fact that improvement or recovery as 
defined in relation to hospital discharge may 
reflect an extremely low standard becomes 


302 


patent if one notes, for example, that in 
Table I of Landis, which presents the number 
of patients discharged annually as recovered 
or improved per one hundred admitted to state 
mental hospitals, the figure given for psycho- 
pathic personality is 75, Anyone having the 
slightest familiarity with this type of case is 
aware that such patients are in only the 
vaguest sense cured or improved at discharge. 
If 75 per cent of them are returned to the com- 
munity, this figure can only mean that they 
have temporarily made a social recovery. The 
majority of them will doubtless be back again 
if they do not in the meantime gain admission 
to some other correctional institution. One is 
dealing here with a striking illustration of the 
comparatively low recovery standards inherent 
in state-hospital discharged. 

It would, however, be incorrect to lump to- 
gether state-hospital discharges and discharges 
from intensive-treatment institutions like the 
New York Psychiatric Institute or Maudsley 
Hospital as if the same standard of recovery 
prevailed in both. We are here presumably 
dealing with a gradation not only as between 
nonresidential and residential improvement 
standards but as between standards in inten- 
sive-treatment hospitals and state hospitals. 
It must therefore be recognized that resi- 
dentiality of treatment is only an approximate, 
and by no means a perfect, criterion of the 
rigorousness of improvement standards. 

The figures for residential and nonresidential 
treatment in the control and experimental 
groups cited by Eysenck were accordingly de- 
termined and will now be presented as an ap- 
proximate index of recovery standard. As indi- 
cated already, only the Landis control group 
was considered; it may be stated forthwith 
that the state-hospital patients included in it 
were residentially treated. For the experi- 
mental groups surveyed by Eysenck, it was 
found by a study of the available reports in- 
cluded in his survey that approximately 4,040 
of 5,262 eclectic cases, or 77 per cent, were 
residentially treated; of the 760 psychoanalytic 
cases, none were so treated.? 


2 The figures here presented were derived from the 
reports cited by Eysenck in Table 1. (See his list of 
References.) The following proved to have been treated 
nonresidentially: Psychoanalytic—all five groups (total 
N = 760); Eclectic—Huddleson, Neustatter, Luff and 
Garrod (two groups), Yaskin, Carmichael and Masser- 
man, Schilder, Wilder (total N = 1,222). The following 


SAUL ROSENZWEIG 


We arrive thus at a resolution of Eysenck’s 
paradox. If 44 per cent of neurotics treated by 
psychoanalysis, 64 per cent of those treated 
eclectically, and 72 per cent of those treated 
custodially improved or recovered within two 
years, this sequence of figures does not prove 
the improbability that the more intensive the 
psychotherapy, the less benefit to the patient; 
rather it reflects the probability that the more 
intensive the therapy, the higher the standard 
of recovery. This interpretation is borne out 
by the fact that the three patient pools in the 
order listed above vary in this same order with 
respect to the frequency with which treatment 
was given nonresidentially rather than resi- 
dentially. The figures 44, 64, and 72 correlate 
perfectly with the above-given figures for 
frequency of residential treatment in the three 
patient pools: Psychoanalytic—0 per cent, 
Eclectic—77 per cent, Custodial—100 per cent. 
The implication is: the higher the standard of 
recovery, the smaller the degree of reported 
success.* ‘ 

If this conclusion is thought to depend too 
much on inference, it is sufficient for the present 
purpose to rest this part of the discussion with 
the more modest statement that the standards 
of improvement and recovery in Eysenck’s 
various patient groups, control and experi- 


were treated residentially: Psychoanalytic—none; 
Eclectic—Matz, Maudsley (1931), Ross, Curran, 
Hamilton & Wall, Hamilton, et al., Landis, Miles, ef al. 
(total W = 4,040). Several qualifications should be 
noted. The Maudsley reports (Nos. 3 and 4 under B 
in Eysenck's Table 1) were not available to the writer. 
However, the survey by Wilder includes the 1927-1931 
Maudsley report, and from this source it is inferred that 
these patients were treated residentially. The later 
Maudsley group had to be omitted. The necessary 
figures for the Institute of Medical Psychology grouP 
(Eysenck's No. B, 17) could not be found in the source 
cited by Eysenck. The 50 cases reported by Masser- 
man and Carmichael are ambiguous as regards resi- 
dentiality and have therefore not been included. (Ex- 
cluded total N = 2,031.) F 

? Eysenck has made a slip on p. 320 where he indi- 
cates that his survey of treated groups includes “the 
results of nineteen studies reported in the litera- 
ture..." Inspection of Table 1 on p. 321 makes it 
evident that his survey actually included 24 studies. 
The total series in the table is divided in two—subseries 
(A) including five groups who received psychoanalytic 
therapy, and subseries (B) consisting of nineteen groups 
who received eclectic psychotherapy. Has the author in 
the error on page 320 tacitly acknowledged the nineteen 
eclectic groups and “repressed” the five psychoanalytic 
ones? 


A TRANSVALUATION OF PSYCHOTHERAPY—A REPLY TO HANS EYSENCK 


mental, bear so little resemblance to each 
other that, once again, the basis of his com- 
parisons has little demonstrable validity. 

The foregoing re-evaluation from the stand- 
points of the definition and severity of neurosis, 
amount of psychotherapy accorded, and 
standards of recovery in the several patient 
groups thus leads to the general conclusion 
that Eysenck's data and arguments fail to 
support his thesis that psychotherapy cannot 
be shown to facilitate recovery. The need for 
à more circumspect use of statistics in this 
highly complex area of evaluation is under- 
scored. It is not, however, maintained that a 
conclusion in the opposite direction is war- 
ranted. The only safe deduction on the basis 
of currently available data is that, in view of 
the diversity of methods and standards in the 
field of psychotherapy, broad generalizations 
as to the effectiveness of treatment are to be 
avoided. 


One may seek to evaluate the effects of 
Psychotherapy either by counting reported 
outcomes or by considering dynamic change 
in the process of treatment, or both, In the 
former instance each patient serves as a poll, 
for or against, as indicated by his subjective 
report or the therapist’s clinical judgment; in 
the latter, one examines each treated person- 
ality as a system of structures and forces 
Which, in the course of therapy, is altered in 
a definable direction. To undertake an evalu- 
ation of the effects of psychotherapy by tally- 
Ing outcomes at second hand, without even 
introducing the problem of dynamic change 
In various forms of illness and in differing 
therapeutic procedures, and, in default of such 
considerations, to reassign diagnoses and prog- 
noses is to invite the inconsistencies and non 
Sequiturs that have been demonstrated in the 
Oregoing reanalysis. But it is not to be con- 
cluded that statistical evaluation is totally to, 
be eschewed because such pitfalls exist; the 
implication is, rather, that the use of statistics 
in the evaluation of psychotherapy demands 
Special precautions and, in addition, the 
guarantee of a concurrent or prior dynamic 
evaluation of each case—an evaluation in 
which the complexity of the individual patient 
and of the therapeutic situation has been fully 
Considered. Such a dynamic appraisal must 

© Into account the organization of the 
Patient’s personality and life space before, 


303 


during, and after treatment. It must, for ex- 
ample, even allow for such statistically equivo- 
cal outcomes as those in which therapy 
produces no computable decrease in former 
symptoms but an increased capacity to ac- 
cept them and to accept oneself, It must, in 
other words, reflect an understanding by the 
therapist and, at least in part by the patient, 
of the reason or reasons for the alleged out- 
come of treatment. 

In internal medicine and surgery the neces- 
sity for empirical adjuncts to any clinical eval- 
uation of therapeutic change has long been 
recognized—as, for example, when X rays, 
blood tests, biological assays, etc. are invoked 
to determine, quite apart from the patient’s 
subjective report or the physician’s clinical 
judgment, what progress the patient is making 
in healing a fracture or in overcoming an infec- 
tion. Psychology and psychiatry may not yet 
be ready for great precision in the making of 
such independent evaluations, but the last 
decade has seen rapid advances toward this 
goal. One reason for opposing such naive evalu- 
ations of the effects of psychotherapy as the 
one here re-examined is to keep open the 
intrinsically difficult road which such in- 
vestigation is destined to follow. 

What good is psychotherapy? As good as 
man’s faith in his humanity. Men have always 
believed in their ability to change for the better 
and to help each other so to change—through 
mutual assistance, love, religion, and art. Con- 
ceived in the broadest terms psychotherapy 
derives from the same faith and, employing of 
necessity some of the same means, attempts to 
formulate these more precisely. The question 
is not, then, whether psychotherapy does any 
good—one might as reasonably ask, “Is life 
worth living?"; the question is how does ther- 
apy accomplish its ends in those fortunate 
instances where, despite the adverse odds, it 
manages to succeed. It is to the process, not 


-the superficially appraised end result—to the 


disorganization or organization of forces which 
may spell illness and partial death or health 
and growth—that attention should be directed 
if we are to learn anything about psychother- 
apy. If, in aiming at this admittedly more 
difficult kind of assessment, we shall have to 
postpone rigorous quantification or use it 
guardedly, to ensure genuine precision, such 
caution may well constitute a measure of the 


304 SAUL ROSENZWEIG 


maturity—the “tolerance for ambiguity” — 3. FENICHEL, O. Ten years of the Berlin Psychoanalytic 


which keeps us from hasty generalization, Institute, 1920-1930. 3 
4. Lanois, C. A statistical evaluation of psychothera- 
REFERENCES peutic methods. In L. E. Hinsie (Ed.), Ch. V in 


n Denker, P. G. Results of treatment of psycho- Concepts and problems of psychotherapy. New 


neuroses by the general practitioner: a follow-up York: Columbia Univer. Press, 1937. pi. 
study of 500 cases. N. Y. State J. Med., 1946, 46, 5. Ray, I. Statistics of insanity, in Contributions to 


2164-2166. mental pathology. Boston: Little, Brown, 1873. 
2. Eysenck, H. J. The effects of psychotherapy: an 
evaluation. J. consult. Psychol., 1952, 16, 319-324. Received October 8, 1953. Prior Publication. 


ito many psychologists today. This is 
f partly because psychotherapy is the 
‘only rational approach to the treatment of 
neuroses and psychoses. It is also because 
Psychotherapy is a unique source of data 
“about some of the most important and elusive 
"processes of human behavior. Yet, there is 
"much we need to learn about psychotherapy. 
Tt is not clear why only some patients improve. 
Current research using measures before and 
after therapy, or comparing therapy with no- 
therapy, yields little useful information; 
Such an approach establishes no relationship 
between what actually occurs during therapy 
and the outcome. Thus, there is no rationale 
^ for gradually improving tactics or understand- 
ing the changes in various kinds of patients. 
Th spite of the general descriptions of therapy 
‘that are available, it is not clear just what 
happens between a therapist and his patient. 
1 To some extent this is due to the complexity 
and the number of the events in therapy. In 
‘addition, we have to contend with the sub- 
jectivity of the therapist’s report. What is 
Needed is an objective behavioral description 
of Psychotherapy. Such a description must be 
Comprehensive enough to capture the impor- 
tant events and yet simple enough to clarify 
the complexity of the events. With an adequate 
description of the events in psychotherapy, 
Studies on the prediction of therapeutic 
Progress from psychological tests, as well as 
Studies on the evaluation: of therapy using 
outside criteria, will take on new meaning and 
ect more influence on therapeutic conduct. 
p S Purpose of this paper is to describe a 
first Step which was taken in the direction of 
Em adequate description of psychotherapy. 
^. UY events occur during psychotherapy. 
Se which are readily observable may be 
aped as physiological, gross behavioral, 
cl Verbal. All three groups have been studied 
- Ses 3, 10) and should be studied further. 
owever, verbal behavior seems to be most 
uS paper was presented at the 1952 meeting of 
ern Psychological Association, Atlantic City, 


| ps is of considerable interest 


CASE REPORT 


A CASE STUDY IN A BEHAVIORAL ANALYSIS OF 
PSYCHOTHERAPY: 


EDWARD J. MURRAY 
Yale University 


critical from many points of view. There 
have been studies on the grammatical and 
formal properties of verbal behavior in therapy 
(e.g., 2), which are interesting in many ways 
but do not seem to be related to the major, 
theories of personality in any determinate ` 
way. The content or meaning functions of 
verbalization seem much more relevant. The 
general method for studying such material 
is called content analysis. The research on 
psychotherapy done by the Rogers group (10) 
uses content analysis. However, in content 
analysis the categories which one selects are 
determined by theory. The theory guiding 
the Rogerian content analysis appears to be 
a vague preceptual schema. The content 
analysis which we are developing is guided 
by two other points of view: psychoanalytic 
theory and learning theory. In this context 
the most relevant categories are those con- 
cerned with motivation and defense. Thus, 
we propose to study the content of verbal 
behavior in psychotherapy with respect to 
underlying motives and defenses. This study 
illustrates this approach with one psycho- 
therapy patient. 
A Case STUDY 

The patient was seen fòr 17 hours in an 
outpatient clinic? All hours except the thir- 
teenth were phonographically recorded. The 
first hour was omitted because it consisted 
of history taking. The patient was referred 
by the medical clinic. His complaint was that 


“he has trouble getting to sleep at night— 


feels that if he falls asleep he may die. He is 
tremendously threatened but can't say what 
he is threatened by." At the time of the first 
interview the patient was described by the 
therapist as follows: 

The patient is a well built, good looking young man 
of 24. His family was once well-to-do, he's a college 
graduate, and he's now working in a real estate firm. . . . 
His parents were divorced when he was eight after a 
protracted period of arguments which the patient re- 

?'The author wishes to thank Dr. Larry Hemmen- 
dinger of the Veterans Administration Regional Office, 
Bridgeport, Connecticut, for providing this case ma- 
terial. 


306 


members as painful to him. Apparently his mother was 
unfaithful and a good deal of the dispute was about this. 
The patient also remembers being taunted about his 
mother's behavior by one of his companions. Before 
the divorce the entire family moved about and finally 
lived with the maternal grandmother. He was sur- 
rounded by dominating and pampering females. He now 
relates this to a “complex” of going from person to 
person for aid. After the divorce he lived with his aunt. 
His mother remarried, had a child, who is now nine, 
and moved in with the patient and his aunt, which is 
the present home situation. His mother was harsh to 
him and he developed a distaste for her. He wanted to 
be different from her and rejected her “emotional” 
kind of living in favor of a “logical and rational ap- 
proach." He adjusted to school and social relations 
with boys very easily. He tended, though, to be a 
teacher's pet in grammar school. He did well in science 
and math in high school and started out in biology at 
college. Upon his return to college after the war he 
Changed to the social sciences taking his degree in 
history. His relations with girls were extremely in- 
nocent until he went in the army. He masturbated 
during adolescence in spite of guilt and fear about it. 
The army's attitude about masturbation enabled him 
to overcome a good deal of this guilt. While in the army 
he went out with a college girl without ever having 
intercourse, avoiding the kind of girls with whom he 
might have had sex and avoiding one situation with 
this girl which might have led to sex. While overseas 
he did have some sexual relationships. Generally he 
seems very cooperative and well motivated. 


Therapy was mainly supportive but in- 
cluded interpretations about the defensive 
nature of his physical complaints and the 
hostility which arose when he became de- 
pendent. A permissive attitude toward the 
expression of hostility was maintained. 

After a considerable period of trial and 
error, two main categories were selected for 
content analysis: hostility and defense state- 
ments. The hostility category was divided 
into six subcategories of hostility: (a) mother, 
(b) aunt, (c) other people, (d) general situa- 
tions and groups, (e) the therapist, and (f) 
a vague “at home" (referring to one or more 
persons in his home). The defense category 
was composed of: (a) intellectual defensive 


statements which included the patient's 


views on philosophy, science, current events, 
etc., and (b) complaints about a wide variety 
of physical symptoms and discomforts. Every- 
thing else was considered irrelevant. 

The unit scored was called a statement. 
"This was either a simple sentence or the mean- 
ing phrases of a more complicated sentence. 
Each statement was judged as belonging to 
one of the several categories. The main measure 
was the number of statements in a given 


‘hour fluctuations show a true reciprocal rela- | 


EDWARD J. Murray 


category for each hour. Relationships from 
hour to hour between categories, and between 
categories and the behavior of the therapist, 
were determined. 

The record of each hour was played very 
slowly, and each statement was judged either 
as belonging to one of the eight categories or 
as irrelevant. A reliability study with three 
other judges scoring the same hour showed 
that the categories were defined in a way which 
was precise enough for teaching other people 
the method. The reliability was high when 
the eight categories were compared (r = .86, 
.89, and .91; p < .01 in all cases), and even 
higher when the irrelevant category was added 
(r = .94, .95, and .98; p < .01 in all cases). 


HOSTILITY AND DEFENSES 


Expressing hostility was a major problem 
for the patient. In the summary of the treat- 
ment, the therapist says, “his problems today 
seem centered around dependency and con- 
comitant resentment.” However, the therapist 
did feel that some progress was made in his 
ability to express his hostility as well as to 
see its relationship with his dependency. If 
the patient had trouble expressing hostility, 
then his hostility must have aroused anxiety. 
From the theory of conflict (1, 4) we would 
predict that if hostility is inhibited by anxiety, | 
then, if anxiety is reduced by the permissive 
attitude of the therapist, the overt manifesta- 
tions of hostility should increase while the 
overt manifestations of anxiety should de- 
crease. In this study it is assumed that the 
hostility statements are the overt manifesta- 
tion of hostility, and the defense statements 
the overt manifestation of anxiety. Figure 1 
shows the hostility and defense statements 
throughout the course of psychotherapy. 
It can be seen that hostility increases an 
defenses decrease. Moreover, the hour-to- 


tionship which strengthens the conflict analy- 
sis. The crisscrossing from Hour 4 to Hour 8 
takes place in between the part of the therapy 
when defenses were high and the later part 
when the expression of hostility was high. The 
correlation is negative and highly reliable (r = 
—.73, p < 01). | 
However, the objection might be raised 
that this is not a dynamic relationship between 
the two categories because the total number 


BEHAVIORAL ANALYSIS OF PSYCHOTHERAPY 


DEFENSES 


NUMBER OF STATEMENTS 
> e 
è $ 


G 


2? $4 586 7 8 9 I0 H I2 14 15 16 INT 
PSYCHOTHERAPY HOURS 


Fic, 1. FREQUENCY or HOSTILITY AND DEFENSE 
STATEMENTS THROUGHOUT THERAPY 


of statements is limited and when one category 
goes up the other must come down. This is 
not the case, because, with the exception of 
one hour, pooled hostility and defense state- 
ments never constituted more than 60 per 
cent of the total statements of one hour. An 
objection which can be made to this is that 
there may be nothing unique about the hos- 
tility-defense relationship if the residual or 
irrelevant Statements also correlate with 
either hostility or defenses. This objection is 
ruled out by the fact that the residual is not 
correlated with hostility (r = —.20) or with 
defenses (r = .19). Another objection which 
might be raised is that, since the total number 
of statements varied from hour to hour be- 
cause of differences in the patient’s rate of 
Speech, the length of pauses, and the number 
of therapist’s remarks, the relationship is an 
artifact of the different totals. That this is 
Not the case is shown in Fig. 2, where the 
percentage of the total number of hostility 
and defense statements is plotted? In this 
form, hostility is still negatively and highly 
reliably correlated with defenses (r = —.75, 
? < 01). Hostility in percentage form cor- 
Telates very highly with hostility in the fre- 
quency form in Fig. 1 (r = 
This is also true of defenses (r = 94, p. < 01). 
us feel that this result strongly suggests 
that. the patient's anxiety about expressing 
hostility was decreased as a result of the thera- 
Pist’s activity and/or inactivity. However, 
this may be limited only to the therapeutic 
Situation; outside observations are needed to 
demonstrate any more general conclusion. 


iei The results presented in Figs. 3, 4, and 5 also show 
le change when plotted in percentage form. 


96, p < 01).. 


307 


^ 
o 


a 
o 


^ 
A / X HOSTILITY 
ay 


20 DEFENSES 


PER CENT OF TOTAL STATEMENTS 


23456 782 I0 ! I2 4 IS I6 17 
PSYCHOTHERAPY HOURS 


Fic. 2. PERCENTAGE OF HOSTILITY AND DEFENSE 
STATEMENTS THROUGHOUT THERAPY 


The verbal changes shown here may or may 
not be indicative of fundamental emotional 
change. Here again, other evidence is needed. 
It is also conceivable that this hostility was a 
defense against a more anxiety-arousing 
sexual problem. But in any event, the lawful- 
ness of the change is encouraging for the use- 
fulness of the method. 


THE DISPLACEMENT OF HOSTILITY 


An examination of the occurrence of state- 
ments in the subcategories of hostility from 
hour to hour reveals a sequence of persons 
toward whom hostility is directed rather than 
a global display on each hour. Figure 3 shows 
hostility to mother, aunt, and a combined 
“others” and “general.” The sequence strongly 
suggests displacement. From a psychoanalytic 
point of view we would expect the mother to 
be the recipient of the most basic hostility. 
Mother, aunt, and others form a meaning- 
ful gradient of generalization. The patient’s 
hostility to his mother gradually increased 
up to Hour 6. The therapist’s summary of 
Hour 6 says, “he revealed that his mother had 
punished him for masturbating. She also forced 
food upon him and punished him for not eating 
promptly. He was ‘too little to fight back’ 
but did act in a spiteful way several times. He 
described other hostilities and retaliations 
with respect to his mother." The following 
hour showed little hostility. Then for several 
hours he expressed hostility to his aunt. This 
is viewed as primarily a displacement from 
the mother, although the aunt elicited her 
own share of hostility. Following this there 
was a displacement to other people. We ex- 
pected the sequence to reverse itself, i.e., 


Kev 
= a HOSTILITY TO MOTHER 
100 T---- HOSTILITY TO AUNT 
==- *HOSTILITY TO OTHERS 
AND GENERAL 


NUMBER OF STATEMENTS 


AEREE IA “ethene SE e i EB ami. WR 
23 45 6 7 8 9 10 H 12 I4 15-6 I7 
PSYCHOTHERAPY HOURS 


Fie, 3. Hostmiry STATEMENTS REFERRING TO 
MOTHER, Aunt, AND OTHERS-GENERAL 
THROUGHOUT THERAPY 


after the hostility to others, hostility would 
be expressed first to the aunt again and finally 
to the mother again. However, a dramatic 
external event took place between Hours 15 
and 16: his aunt was suddenly taken to the 
hospital for an emergency cancer operation. 
The therapist says that during Hour 16, 
"after considerable struggle he finally stated 
an ambivalency about the event." Further- 
more, his hostility to his mother was increased 
because she had hinted that if the aunt died, 
she would take over the home. 

Tt should be noted that the learning analysis 
of displacement (5), which has been confirmed 
in several animal experiments (6, 7, 9), does 
not predict sequences of hostility to various 
objects. Since it is desirable to make such 
predictions, we embarked on an extension of 
displacement theory (8) and an experimental 
investigation, in collaboration with Mr. 
Mitchell Berkun, of displacement with animals 
in a free-choice situation. Suffice it to say that 
if rats are both rewarded and punished at a 
given goal so as to establish a conflict, they 
will still approach that goal part way before 
displacing to another goal. This is an example 
of how precise data from psychotherapy can 
influence theories of behavior based on animal 
work. 

The fact that hostility was directed toward 
people who were further and further displaced 
provides an alternative explanation for the 
general increase in hostility throughout 
therapy. If expressing hostility to these people 
aroused less anxiety than expressing hostility 
to his mother, we would expect more and more 
hostility as therapy progressed. Probably 


EDWARD J. Murray 


both displacement and a general reduction of 
anxiety because of the treatment operated to 
permit hostility to increase. The hostility 
expressed to the patient’s mother in Hour 16 
was much stronger and was concerned with 
much more recent events than was the hos- 
tility in Hour 6. Thus, there was a therapeutic 
effect. Indeed, expressing hostility to relatively 
unimportant people may be therapeutically 
valuable because it is less fearful. In learning 
theory terms, fear is extinguished in the dis- 
placed situation and these extinction effects 
are generalized back to the fear in the primary 
conflict situation. In the experiment mentioned 
above, the rats returned to the original goal 
after making unpunished goal responses in the 
displaced situation. 


DEFENSES 


The two defense categories seemed to 
operate as alternate members of a defensive 
armamentarium. In Fig. 4 it can be seen that 
the intellectual defense was high at the outset 
and decreased throughout the first part of 
therapy. This intellectual defense was never 
interpreted. The physical complaint defense 
increased as the intellectual defense decreased. 
After the major interpretation in Hour 5, 
the physical complaint defense dropped off 
sharply. In Hour 6 both defenses were low, 
and this hour proved to be especially fruitful, 
as was noted above. This expression of hos- 
tility also increased his anxiety, and the 
following hour was a dull one. Moreover, with 
the physical complaint defense interpreted, 
the noninterpreted intellectual defense in- 
creased in a compensatory way in this subse- 
quent hour. Both defenses decreased during 
the rest of psychotherapy and show no dis- 
tinguishing features as far as the number of 
responses is concerned. The interpretation of 
the physical complaints may have functioned 
as a punishment, or may have established 
insight. Further indices are needed to dis- 
tinguish between the two possibilities. 

There is also a possibility that the two 
defenses had different functions. The intel- 
lectual defense was more assertive and self- 
aggrandizing. It may have been a way of 
telling the therapist that he was masculine 
or mature. On the other hand, the physical 
complaints had a pleading and ingratiating 
tone. This defense may have been motivated 
by feminine or dependent needs. However, 


BEHAVIORAL ANALYSIS OF PSYCHOTHERAPY 


120 KEY 
reo INTERPRETATION. 
INTELLECTUAL OF PHYSICAL COMPLAINT 
100 DEFENSE DEFENSE 


Y= MINOR INTERPRETATION 
OF SAME 


NUMBER OF STATEMENTS 


PSYCHOTHERAPY HOURS 


Fic. 4, THE INTERACTION OF INTELLECTUAL AND 

PHYSICAL COMPLAINT DEFENSES AND THE EFFECTS 

OF INTERPRETATIONS DURING THE First PORTION 
OF THERAPY 


both may be viewed as motivated by anxiety 
and both served the function of avoiding talk 
about conflict areas. 

Fortunately, we examined another case 
which showed some striking similarities to 
this one. The general personality picture was 
the same as the present case. In addition, the 
two chief defenses were similar to the physical 
complaints and intellectualizations charac- 
terizing our present case. The second patient 
had physical complaints, although he had 
more insight into their psychogenesis. A cate- 
gory comprising physical complaints and more 
Psychologically phrased feelings of tension, 
conflict, and blocking was defined. The in- 
tellectual defense was quite similar to the 
first case, Figure 5 shows the course of physical, 
etc. defensive Statements and intellectual 
defensive statements throughout the eight 
hours of therapy. This second patient began 
With physical complaints which décreased 
from Hour 1 to Hour 3 without interpretation. 
As the physical complaints decreased, the 
Intellectual defense increased. This was in- 
terpreted in a punitive way in Hour 2 and, 
less severely, in Hours 3 and 4. Both defenses 
Were low in Hours 3 and 4 and it was in these 
hours, both in our opinion and in the unso- 
licited opinion of the therapist, that the most 
Important hostile material came out. In Hour 
5 it was the uninterpreted defense—physical 
Te te that showed the greatest increase. 
In this case, the therapist then proceeded to 
Interpret the physical complaint defense on 

Ours 5 and 6. Both defenses were low for 
the last two hours. 

This second case in a sense provides a natural 
experimental control for the first case. In the 


309 


INTERP. 


NUMBER OF STATEMENTS 


PSYCHOTHERAPY HOURS 
Fic. 5. THE INTERACTION OF INTELLECTUAL AND 
PHYSICAL COMPLAINT DEFENSES AND THE EFFECTS 


OF INTERPRETATIONS THROUGHOUT THE THERAPY 
OF A SECOND PATIENT 


first case intellectual defenses were highest at 
first and decreased without interpretation; 
in the second case this was true of physical 
complaints. In the first case physical com- 
plaints supplanted intellectual defenses; in 
the second case intellectual defenses sup- 
planted physical complaints. In the first 
case the physical complaints met with the 
disapproval of the therapist; in the second 
case this was true of intellectualizations. In 
both cases the punitively interpreted defenses 
decreased. In both cases important hostile 
material emerged when both physical com- 
plaints and intellectualizations were low. In 
both cases the expression of hostility was 
followed by an upswing in the uninterpreted 
defense. These results strengthen our belief 
that the physical complaints and intellectuali- 
zations are alternate defenses against anxiety. | 
They also tend to confirm the general conflict 
analysis we have made. The increase in de- 
fenses after the expression of hostility may 
be related to the "negative therapeutic effect" 
(1). We may also tentatively formulate the 


. hypothesis that, with this kind of patient, 


uninterpreted (or unpunished) defenses have a 
greater probability of occurrence when anxiety 
is increased than interpreted (or punished) 
defenses. It is obvious that much more evi- 
dence is needed before making any definite 
conclusions. 


SuMMARY AND CONCLUSIONS 


In our present state of knowledge about 
psychotherapy, final conclusions should be 


310 


avoided. Therefore, it is with trepidation that 
we approach the task of making a comprehen- 
sive summary statement about our illustrative 
case. This is true even though the therapy was 
brief and the changes were probably not 
fundamental. We might ask about the success 
of this case. The therapist does not sound 
optimistic in his summary: “The patient now 
has insight into his problems but can't extri- 
cate himself from them. Because of a general 
character weakness it is improbable that short 
term therapy will be of much more value." 
But our feeling. is that it is important to 
understand what has happened during these 
interviews, whether or not the therapy was 
eminently successful. We feel that our ob- 
jective description aids in this. Here is a 
tentative recapitulation of the case in terms 
of the data presented in this study. 

The patient began treatment with a good 
deal of anxiety and subsequent defensiveness. 
His defensiveness decreased as a result of a 
combination of permissiveness about hostility 
and punitiveness about defenses on the part 
of the therapist. As this occurred he expressed 
strong hostility to his mother. This expression 
of hostility led to an increase in anxiety and 
defensiveness. The defense which increased 
was the one not previously punished by the 
therapist. Subsequently, hostility was dis- 
placed further and further away from his 
mother. Hostility to displaced objects was 
stronger because it aroused less anxiety. It is 
possible that because of the unpunished ex- 
pression of hostility to the displaced objects, 
the patient was able later in therapy, when 
environmental factors precipitated it, to 
express hostility about his mother much more 
strongly, at least in the therapeutic situation. 

Although this integration involves several 
assumptions, we feel it is strongly supported 
by the data. It will be noted that the state- 
ments made in the summary concern motiva- 


tional and defensive shifts. This is what the, 


categories were set up to measure. Other 
events, such as the establishment of insight, 
require other kinds of categories. Thus, we 
offer no evidence to support the therapist's 
opinion that the patient understood the rela- 
tionship between his hostility and his de- 
pendency at the end of therapy. 

This preliminary study has been presented 
to illustrate the kinds of results which can 


EDWARD J. Murray 


be obtained by studying psychotherapy care- 
fully and quantitatively. We feel that it also 
demonstrates the applicability of principles 
derived from animal and human experimenta- 
tion to complex human behavior. On the 
other hand, it is only data as clear as these 
that will stimulate experimental work and 
modify existing theories. We also hope that 
this study has indicated some of the difficulties 
involved in a behavioral analysis of psycho- 
therapy. Cases do not miraculously become 
comparable when studied quantitatively. New 
cases are expected to present as many problems 
as they solve. Nor do we feel that all of the 
important events in psychotherapy can be 
described by this method. Indeed, many im- 
portant things in this case have been ignored. 
But, as this method becomes more refined, 
and as equally objective measures are added 
to it, we feel that our understanding of psycho- 
therapy, and human behavior in general, will 
be furthered. 


REFERENCES 


1. Dorragp, J., & Muer, N. E. Personality and 
psychotherapy. New York: McGraw-Hill, 1950. 

2. Grummon, D. L. An investigation into the use of 
grammatical and psycho-grammatical categories 
of language for the study of personality and 
psychotherapy. Unpublished Ph.D. dissertation, 
Univer. of Chicago, 1950. 

3. LASSWELL, H. D. Certain prognostic changes dur- 
ing trial (psychoanalytic) interviews. Psychoanal. 
Rev., 1936, 28, 241-247. 

4. Muer, N. E. Experimental studies of conflict. 
Tn J. McV. Hunt (Ed.), Personality and the be- 
havior disorders. New York: Ronald, 1944. Pp. 
431-465 

5. Miter, N. E. Theory and experiment relating 
psychoanalytic displacement to stimulus re- 
sponse generalization. J. abnorm. soc. Psychol, 
1948, 43, 155-178. 

6. Mier, N. E., & Kraexine, Doris. Displacement: 
greater generalization of approach than avoidance 
in a generalized approach-avoidance conflict. 
J. exp. Psychol., 1952, 48, 217-221. 

7. Miter, N. E., & Murray, E. J. Displacement and 

, conflict: learnable drive as a basis for the steeper 
gradient of avoidance than of approach. J. exp. 
Psychol., 1952, 48, 227-231. 

8. Murray, E. J. Displacement in psychotherapy: 
Paper read at East. Psychol. Ass., Boston, 1953. 

9. Murray, E. J., & Murer, N. E. Displacement: 
steeper gradient of generalization of avoidance 
than of approach with age of habit controlled. 
J. exp. Psychol., 1952, 48, 222-226. 

10. Rocers, C. R. Client-centered therapy. Boston: 
Houghton Mifflin, 1951. 


Received December 12, 1952. 


CRITIQUES AND NOTES 


THE USE OF TAPE RECORDING TO SIMULATE A GROUP ATMOSPHERE! 


ROBERT R. BLAKE? 
The University of Texas 


HE present study explores the possibility that 

tape recordings can be used to communicate 
to a test subject the experience that he is a partici- 
pating member of a social group. If this method 
can be employed to investigate the impact of group 
membership on individual behavior, its research 
applications should prove to be numerous. 


THE EXPERIMENT 
Application of the Method 


To test the feasibility of this approach for a 
standard problem, a brief description of a study 
based on the autokinetic phenomenon will be 
given (1), The situation used is comparable in 
many respects to that employed by Sherif (2) in 
his investigation of autokinetic judgments in the 
group context. The important differences are that 
(a) in our situation the subjects (Ss) understand 
that each is experiencing the same amount of 
Movement, but that they are looking from different 
observation posts, as would be the case, for ex- 
ample, on shipboard (thus accounting for the fact 
that they are not physically together); and (6) our 
ines policintes scale, rather than inches, as 

unit through whi ir judg- 
hg ig ich they express their judg: 

As Sherif originally reported, judgments of the 
movement experienced in the autokinetic situation 
are responsive to group pressures, the rule being 
that Ss’ responses tend to converge on a norm 
established by the group. Given that finding, it is 
Predicted that the uninstructed S experiencing the 
Present situation will tend to locate his judgments 
coe within the range of reports given by 
n taped" Ss. If he does give responses within 

Is range and if the range used is a most improb- 


‘The study was conducted in the Laboratory of 
ay Relations, Harvard University, and «was pn 
poem part by the United States Air Force under 
Fas ag AF 33(038)-12782 monitored by the 
mro sources Research Institute. Permission is 
tie Unit x peas in whole and in part by or for 
Vüited SQ. tates Government, "The cooperation of the 
AD tes Navy in facilitating the conduct of the 
NS T also gratefully acknowledged. The authors 
for his o Ed to Dr. Henry W. Riecken 

Ip with design and administrative problems. 


1 
istius ot absence from The University of Texas, 


AND 


JACK W. BREHM 
The University of Minnesota 


able one, as inferred from the frequency with which 
responses are given within it by Ss who judge in a 
situation without social stimulation, this would 
constitute evidence for the conformance-producing 
power of the recorded group situation. 


Explanation to the Subjects 


After taking him to the darkroom and adjusting 
his head-set and power microphone, the experimenter 
(E) left, informing S that he and the rest of the group 
who were located in adjoining rooms would be given 
further instructions over headphones. The S was also 
told that his code letter would be *G." The E then 
returned to the control room, started the tape recorder, 
and turned on a switch completing the circuit to S. 
The explanation that followed was addressed to the 
“taped” participants as well as to the uninstructed, 
critical S. The recording ran as follows: 

Experimenter: All right, here we go. I'm talking to 
you from our control room. Now I'll tell you all what 
I want you to do. But first I want to know if you can 
hear me all right. If you can, please answer okay when 
I call your code letter. Subject A (okay); subject B 
(okay); .. . subject E (okay); subject F (okay); subject 
G (four-second space). All right, that's fine. 

Now here's what the experiment is all about. Navy 
men on board ship or flying occasionally have to judge 
the distance which a light moves on the horizon. The 
Navy needs to know how men make such judgments 
when the conditions are poor—such as when it's dark. 

Tn this experiment each of you will estimate how far 
a light moves in the dark. You will then report your 
estimates. As on board sbip, you will be able to hear 
each other's reports. The answers you all give will then 
be used by our control room to make a single estimate 
of the distance the light moved. Since a person cannot 
be very accurate under these conditions, the combined 
judgments of the group will be much better than the 
individual reports. It's the group performance which 
makes the difference between a hit and a miss in this 
kind of work. 

Now here are your instructions. Please listen care- 
fully. 

When the experiment begins, a light will appear 
in front of you. Shortly after it appears it will begin 
to move. It will only be visible for a few seconds. Your 
task is to estimate the total distance the light moves. 
As soon as it stops, you will each be called by your code 
letter to give your estimate of how far it moved. Give 
your report in millimeters. (The Ss were shown a 
millimeter scale before the beginning of the experi- 
ment.) 

Are there any questions? 


311 


312 


Subject D: Yeh, I've got a question. Does this light 
move the same amount for all of us each time it 
comes on? 

Experimenter: Yes, we're doing this just like we did 
before, Okay? Ready for trial one. (Ten-second space 
inthe tape.) Now give me your reports to the nearest 
millimeter; first, subject A—(69); subject B . . . subject 
G—(five-second space). Ready for trial two. (Eight- 
second space). Etc. 

At the end of trial 10, E said, “That’s the end of 
this part of the experiment. Please remain in your 
seats until I bring you a questionnaire to fill out." 

The E then asked the uninstructed S to fill out.a 
questionnaire. After discussion and explanation of what- 
ever part of the experiment he was interested in, S 
was released, As far as could be determined, no one 
diagnosed the concealed features of the study. 


Subjects 
The Ss participating in this study, 84 in number, 


were drawn from Naval enlistees and Harvard Uni- 
versity undergraduates. 


Ropert R. BLAKE AND Jack W. BREHM 


Experimental Design 

"Three sets of conditions using simulated groups and 
one condition involving judgment in the “alone” situa- 
tion were employed. The three synthetic conditions 
involved 10 trials, each consisting of six numerical 
values spoken by the “participants” as their judgments 
before the uninstructed S was called to give his report, 
The 10 trials varied in two ways: they had five different 
ranges and they were built on two different scale cen- 
ters. In all cases the numerical distances between 
individual judgments were of equal amounts, and no 
numerical value was repeated in the same trial. Thus 
the median and mean coincided at the center for any 
given scale. 

For the diverging condition, scale centers alternated 
from 46 to 57, starting with the low center. The diverg- 
ing conditions started with a small range, with the 
range size increasing as the trials progressed. Trial 
one had a range of 15, trial two of 15, trial three of 20, 
and so on up to trials nine and ten, each of which hada 
range of 45. The converging condition represents an 
exact duplication of the diverging one but in reverse 


TABLE 1 


RANGES, SCALE CENTERS, AND NUMERICAL VALUES SPOKEN BY RECORDED SUBJECTS UNDER THE SYNTHETIC 


Group CONDITIONS 
EMBED a ee A O O 


ConprrioN I* 
Drvercinc RANGE 


TrraL — RANGE CENTER — STIMULUS VALUES 


Conprtion III 
NONOVERLAPPING RANGE 


TRIAL RANGE CENTER 


STIMULUS VALUES 


1 15 45.5 38-41-44-47-50-53 1 35 51.5 34-41-48-55-62-69 
2 15 56.5 49-52-55-58-61-64 2 5 30.5 28-29-30-31-32-33 
3 20 46.0 36-40-44-48-52-56 3 30 52.0 37-43-49-55-61-67 
4 20 57.0 47-51-55-59-63-67 4 10 31.0 26-28-30-32-34-36 
5 30 46.0 31-37-43-49-55-61 5 20 52.0 42-46-50-54-58-62 
6 30 57.0 42-48-54-60-66-72 6 20 31.0 21-25-29-33-37-41 
7 40 46.0 26-34-42-50-58-66 7 10 52.0 47-49-51-53-55-57 
8 40 57.0 37-45-53-61-69-77 8 30 31.0 16-22-28-34-40-46 
9 45 45.5 23-32-41-50-59-68 9 5 $1.5 49-50-51-52-53-54 
10 45 56.5 34-43-52-61-70-79 10 35 30.5 13-20-27-34-41-48 
* Condition II is the same as condition I but in reverse sequence. 
TABLE 2 


Tue Frequency OF RESPONSES LOCATED WITHIN THE SOCIAL RANGE 


CREATED BY THE SIMULATED (RECORDED) 


GnOUP, CONTRASTED WITH THE COMPARABLE FREQUENCY FOR THE NONSOCIAL SITUATION 
CowprrioN I Conprtion II ConprrioN III 
CONTROL Exe. CONTROL Ex». CONTROL Exe. 
TRIAL FREQ. Freq. FREQ. FREQ. FREQ. FREQ. [2 
E (N = 27) (N-20) (N = 27) (NW = 20) (N = 27) (N= 17) 
1 1 6 «.05 2 16 <.01 2 17 «.0t 
2 1 5 <.10 4 12 <.01 2 9 «.0t 
3 1 13 «.01 1 16 «.01 1 17 «.01 
4 0 12 «.01 2 11 «.0t 2 13 <.01 
5 1 16 <.01 0 13 «.01 0 14 <.01 
6 1 12 <.01 2 11 <.01 4 14 «.01 
7 4 18 <.01 1 11 «.01 1 9 ; 
8 4 17 «.01 4 10 <.20 4 17 
9 6 13 <.01 2 8 «.05 2 2 
10 3 15 «.01 0 6 «.01 4 11 


Usr or TAPE RECORDING TO SIMULATE A GROUP ATMOSPHERE 


since it began with wide ranges and ended with narrow 
ones. The nonoverlapping condition alternated between 
scale centers of 31 and 52, starting with the high value. 
However, in this condition ranges alternated from large 
to small. These ranges were designed in such a way 
(varying from 5 to 35 in width) that adjacent ranges 
never overlapped. The composition of each condition 
is reproduced in Table 1. In the actual experiment, the 
recorded order of numerical values within each trial 
was randomized. 

The fourth condition, introduced to furnish evidence 
for judgmental tendencies in the alone situation, 
provides the control data. Operating under conventional 
autokinetic conditions, but using exposure periods of 
the same length as those employed for the simulated 
group conditions, 27 Ss simply judged the distance 
of movement for 10 consecutive trials. 


RESULTS 


Table 2 shows the frequency with which control 
Ss gave responses within the range of numbers 
spoken by the recorded Ss under each of the three 
conditions outlined above, as contrasted with the 
frequency with which uninstructed Ss located 
their responses within the same range. It shows 
that under each of the synthetic group conditions 
more than 50 per cent of the responses fell within 
the spoken range of the recorded Ss, while only 
about 4 per cent of the responses given under the 
alone condition were located within the same 
region. The differences, which are of high statistical 
Significance, point to the conclusion that under 
each of the three conditions the responses of the 
uninstructed Ss operating as members of "syn- 
thetic” groups were changed in the predicted 
direction, 


DISCUSSION AND CONCLUSIONS 


In this study a group frame of reference was 
created for the autokinetic effect solely by auditory 
Stimulation, The reports of all “other” Ss were 
recorded and communicated to the critical S over 
headphones. Eighty-four Ss distributed among 
three experimental and one control condition were 
asked to give 10 autokinetic judgments. For the 


313 


experimental conditions, judgment was made after 
S listened to recorded reports from six other people. 
A control was provided by 27 Ss, each of whom 
made 10 judgments under the nonsocial condition. 
Owing to effective concealment of the critical 
feature in the study, the test Ss were, as far as 
could be determined, unaware that the judgments 
they heard were given by persons not actually 
present in other rooms. 

It was hypothesized that under these conditions 
an S would locate his judgments within the range 
of reports given by the group as a whole. Statistical 
evaluation indicated that the frequencies of judg- 
ments falling within the recorded social ranges were 
significantly greater for Ss who judged with others 
than they were for those who judged in the alone 
situation. It may be concluded that pressures to 
change can be created by skillfully prepared record- 
ings which simulate the conditions of a “live” 
experimental situation. 

These results suggest that the tape method of 
inducing the experience of group membership may 
facilitate the conduct of group research in several 
ways. For example, the conventional procedure of 
using paid participants entails at least three diffi- 
culties: changes within the experimental procedure 
due to variations in the behavior of participants 
from time to time, problems of scheduling, and the 
expense of employing instructed Ss. The use of 
magnetic tape recording eliminates the need for 
the repeated presence of paid participants, thereby 
reducing expense and problems of scheduling, and, 
at the same time, it automatically standardizes the 
major part of the experimental procedure. It should 
be clear, of course, that such a procedure imposes 
constraints on the possibilities of interaction, thus 
having its own intrinsic limitations. 

REFERENCES 
1. Bremm, J. W. A quantitative approach to the 
measurement of social influence. Unpublished 
bachelor’s thesis, Harvard Univer., 1952. 
2. SmerI, M. A study of some social factors in per- 
ception. Arch. Psychol., 1935, No. 187. 


Received January 19, 1953. 


A CASE OF ESP: CRITIQUE OF *PERSONAL VALUES AND ESP SCORES" 
BY GERTRUDE R. SCHMEIDLER 


E. I. BURDOCK 
Carnegie Corporation of New York 


UST as the lodestone depends on compensation, 
so the statistical test rests on certain governing 
assumptions which both limit its applicability and 
determine its meaning. Ignorance or disregard of 
the underlying assumptions frequently leads to 
false conclusions as to the significance or insignifi- 
cance of the results under assessment. 

In particular, one of the most commonly used 
statistical tests of significance, the ¢ test, rests on 
the assumptions that: (a) the trait whose meas- 
urement is being tested is normally distributed; 
(b) the variance of the samples is homogeneous; 
(c) the samples are a random selection. 

If any of these assumptions do not hold in a 
case under investigation, then the conclusions as to 
significance of findings may be held in doubt. 

An article in the October 1952 issue of this 
journal, entitled “Personal Values and ESP Scores” 
(3), presents an instance where use of the £ test 
may be questioned. In this investigation the ex- 
perimental subjects (Ss) were required to guess the 
order of items in concealed lists. The lists each 
consisted of 25 items chosen from five possibilities 
arranged in random order. The investigator ad- 
vises the reader that “with lists prepared in this 
way, the average number guessed correctly, if 
choice [sic] alone determined the guesses, would 
approximate five...." The binomial model in- 
voked here is thus: (16 + 4£)?*. Each “run” of 25 
guesses by a single S is treated as a discrete 
sequence of independent events. A total of 959 
runs was secured from 122 experimental Ss, ap- 
proximately eight runs per S. In the statistical 
evaluation of the results, the investigator combined 
the runs extracted from various Ss into a single 
composite for each of two groups, a group of co- 
operative Ss appropriately termed “sheep” and a 
contrasting group of recalcitrant “goats.” 

Taking the above-listed assumptions in order, 
we note the following: 

1. The universe postulated has a range of dis- 
crete values from 0 to 25 and a mean of 5. It is 
thus neither normal nor continuously distributed. 

2. No test was made for homogeneity of 
variance. 

3. The randomness of the sample is in doubt, 
since the 959 runs are not drawn at random from 
the postulated universe, but only from 122 Ss. 
Eight runs drawn from one S are not necessarily 
the same as eight runs by eight different Ss. Even 
if all Ss are responding without benefit of ESP, 
they may be expected to differ from one another in 


their attack upon the problem. At the roulette 
wheel gamblers who employ different "systems" 
have different sequences of wins and losses, al- 
though all lose in the long run. 

A more appropriate device for the assay of a 
discrete nonnormal distribution of this type would 
be a nonparametric test such as chi square. Un- 
fortunately, when we seek to recompute the data 
in the published paper, the confounding of the 
results from different Ss makes it impossible to 
allow for the variance within Ss. What we need to 
know for the proper evaluation of this experiment 
is the number of hits and misses per S for each of 
the groups considered. However, because of the 
form in which the data are presented, this critical 
information is effectively obscured. 

Although we are unable to reanalyze the in- 
vestigator’s data as presented in her Table 1 for 
the reasons adduced above, the data presented in 
her Table 2 in terms of numbers of Ss lend them- 
selves to a chi-square analysis. Chi square for all 
Ss (sheep vs. goats) with respect to the numbers of 
Ss with averages “above,” “at,” and “below” 
mean chance expectancy, a 3 X 2 classification 
with 2 df, equals 2.1757. The probability of ob- 
taining such a value by chance is .35, which does 
not support the author’s contention that sheep 
score significantly better than goats. 

A similar analysis for Ss below the 90th per- 
centile on the Allport-Vernon Theoretical scale 
gives a chi square of .1387, and corresponding 
probability of .93, suggesting that sheep and goats 
here, too, can be considered samples of a common 
population, 

While the subsample of Ss at or above the 90th 
percentile on the Study-of-Values Theoretical 
scale yields expected values too small to justily 
computation of chi square, it is possible to compare 
all Ss above the 90th percentile with those below 
the 90th percentile by pooling sheep and goats. 


'Chi square so computed, despite one weak fre- 


quency “of 3.41, yields a value of 3.5385 for à 
probability of .18 that the observed differences are 
fortuitous. This result contradicts the author's 
conclusion that dividing the Ss according to their 
Scores on the Allport-Vernon Theoretical scale is 
meaningful. 

One further possibility remains for the evalu- 
ation of the number of Ss at or above the 90th 
percentile on the Allport-Vernon Theoretical scale. 
That is to combine the categories for those at, and 
those below, mean chance expectancy. The resulting 


314 


| 


PERSONAL VALUES AND ESP: A CRITIQUE 


2 X 2 table has one expected frequency of 3.81 
with the other three between 5 and 10. Although 
this violates Lewis and Burke's (2) criterion that 
expected cell frequencies be at least 10, the chi 
square of 1.9753, with a probability of .17 by 
chance, is sufficiently far from the borderline of 
significance to obviate use of Fisher's direct method 
(1, pp. 96-97). The results here again support the 
hypothesis that there is only a random difference 
between sheep and goats, even theoretical sheep 
and goats. 

On the basis of these considerations the investi- 
gator's assertion that "one group of our Ss scored 
above mean chance expectancy while the other 
group scored below” is not sustained. The apparent 
difference between sheep and goats and between 
Ss with high and Ss with low theoretical values in 
the investigator’s Table 1 is merely an artifact of 
the way in which the data are tabulated. 

Having found no significant differences in the 
population with respect either to attitude toward 
the experiment or with respect to Allport-Vernon 
Theoretical scale scores, we may inquire whether 
the averages above, at, and below mean chance 
expectancy differ significantly from one another 
for the population as a whole. Upon further re- 
flection, however, we realize that the three cate- 
Bories refer not to crudely grouped attributes but 
to a frequency distribution. The data as presented 
have the further peculiarity that the central cate- 
gory, “averages at mean chance expectancy,” 
Tepresents a point on the scale, while the two other 
intervals, “averages above mean chance ex- 
Pectancy” and “averages below mean chance ex- 
Pectancy,” are broad categories. To average at 
mean chance expectancy, an S must have guessed 
correctly exactly one-fifth of the approximately 200 
items Presented to him; a single item difference 
casts him into one of the other categories together 
with Ss departing widely from chance. Included in 
the totals for each of the three categories, however, 
are the results for “some latecomers” who “missed 
the beginning of the session.” Presumably, on the 
evidence of figures presented elsewhere in the 
article, these numbered at most nine, if we assume 
each latecomer missed only one run. However, we 
ate unable to make any adjustment for unequal 


numbers in the absence of specific information, and" 


aay cannot say to what extent differences in 

e tabled results reflect this discrepancy. 

di Now the probability of a point value in a discrete 
stribution is not very large. Even though the 

Probability for the mean frequency is a maximum, 


315 


it can be shown to approach zero as WV increases, 
since 


1 
Pom 7 VNp — BD 


For N = 200, pmax = .07. If we follow the in- 
vestigator’s binomial model, 122 (1$ + 46), we 
should expect 7 per cent of the 122 Ss, or 8.54, to 
average at the mean by chance. Actually 16 Ss, 
twice the number expected by chance, lie at the 
mean. Although the data in the form tabulated 
are insufficient for computation of the dispersion, 
the piling up of cases at the mean suggests that 
the sample may have subnormal dispersion, A 
sequence with subnormal dispersion is character- 
ized by probability compensation. In such a 
sequence there is a negative aftereffect from an 
item to its immediate successor shown in a tend- 
ency toward an exaggerated alternation of re- 
sponses and an avoidance of repetitions. Such 
sequences are often produced by individuals who 
naively try to construct a random sequence, but 
who fail to realize that a certain amount of cluster- 
ing is necessary in a normal frequency distribution. 

Two possible causes for this phenomenon are: 
(a) The individual choices do not add up to a 
random sample of the Ss' behavior because 
grouping them into runs induces an interaction 
effect. (b) The behavioral universe itself is charac- 
terized by probability transfer with negative 
aftereffect. Such sequences, known as Markoff 
chains, may nevertheless converge toward a specific 
probability value, such as that of one-fifth postu- 
lated by this investigator. However, such sequences 
will not have as many runs as a normal sequence, 
but will show an excess of alternation over a genu- 
inely random distribution. 

In conclusion we are led inevitably to the per- 
suasion that the findings presented in the above 
article are due not so much to telepathy as to 
numerology. 


REFERENCES 


1. Fisuer, R. A. Statistical methods for research workers. 
(11th Ed. Rev.) New York: Hafner, 1950. 

2. Lewis, D., & Burke, C. J. The use and misuse of 
the chi-square test. Psychol. Bull., 1949, 46, 433- 

` 489. 

3. SCHMEIDLER, GERTRUDE R. Personal values and 
ESP scores. J. abnorm. soc. Psychol., 1952, 4T, 
757-761. 


Received January 21, 1953. 


HOW NEUROTIC IS THE AUTHORITARIAN?! 


JOSEPH M. MASLING? 
Institute for Research in Human Relations 


ERY few social scientists would deny that 

social science research can be carried on inde- 
pendently of prevailing social and political pres- 
sures. The social scientist, because of his training 
and interests, is constantly under the strain of 
examining objectively phenomena about which he 
privately has many strong, emotionally-tinted 
attitudes, An interesting example of one way in 
which a certain set of these political pressures—in 
this case the current flood of loyalty investigations 
and the threat posed by current and past dictator- 
ships—has influenced social science research can be 
seen in the work done on authoritarianism. 

It is evident that the concept of the authori- 
tarian personality has been a popular area of in- 
vestigation, and that some extremely interesting 
and valuable knowledge has been made available 
by this research. It is not the purpose of this paper 
either to evaluate or to summarize this research. 
It will suffice to say that for the most part it is 
characterized by rich, insightful conceptions of 
personality and by sound experimental design. 
What will be done is to point out how in some in- 
stances otherwise sound research has unfortunately 
been rendered less effective by the descriptions 
painted of authoritarian individuals. 

To illustrate this point two of the major works 
in this field are quoted extensively—The Authori- 
tarian Personality, by T. W. Adorno, Else Frenkel- 
Brunswik, Daniel J. Levinson, and R. Nevitt San- 
ford (1), and Authority and Leadership: A Study of 
the Follower’s Orientation of Authority, by Fillmore 
H. Sanford (10). These two works were chosen be- 
cause of their unquestioned importance in the 
field and because they are in many ways superior 
to other research done in the area. 

It is evident to anyone reading the literature on 
authoritarianism that authoritarians are indeed 
nasty fellows. “They worry about egocentric and 
material things. They think in terms of blame and 
they appear to express aggression against the 
weak” (10, p. 43). Not only this, but authoritarians 
are conventional, submit uncritically in the face of 
authority, are anti-intraceptive, superstitious, and 
stereotypic in their thinking, are preoccupied with 
the “dominance-submission, strong-weak, leader- 
follower dimension,” overemphasize the conven- 
tionalized attributes of the ego, have exaggerated 
assertions of strength and toughness, are cynical 
and destructive, tend to believe that “wild and 


1 This paper was read at the 1953 meeting of the 
American Psychological Association in Cleveland, Ohio. 
2 Now at Syracuse University. 


316 


dangerous things go on in the world,” and have 
“exaggerated concern with sexual ‘goings on’” (10, 
p. 7). In addition, authoritarian men are overly 
masculine and women overly feminine (10, p. 12), 

Elsewhere, ina discussion of the “subsyndromes” 
to be found among highly ethnocentric individuals, 
the authoritarian is characterized as achieving: 


social adjustment only by taking pleasure in obedi- 
ence and subordination, This brings into play the 
sadomasochistic impulse structure both as a condition 
and as a result of social adjustment . . . . In the psycho- 
dynamics of the “authoritarian character, ” part of the 
preceding aggressiveness is absorbed and turned into 
masochism, while another part is left over as sadism 
....Ambivalence is all-pervasive, being evidenced 
mainly by the simultaneity of blind belief in authority 
and readiness to attack those who are deemed weak and 
who are socially acceptable as "victims." ...He de- 
velops deep “compulsive” character traits, partly by 
retrogression to the anal-sadistic phase of development. 
- - . [His religious belief is compulsive and punitive; he 
has overt rigidity of conscience with] strong traces of 
ambivalence . . . . The over-rigid superego is not really 
integrated, but remains external (1, pp. 759-760). 


There is not quite as much said about the au 
thoritarian’s opposite number, the equalitarian, 
but it is evident at once that he is a fairly well-ad- 
justed individual, albeit a trifle dull. He has no 
interesting psychiatric syndromes and is, in gen- 
eral a more amiable, social creature than the 
authoritarian. According to F. H. Sanford, equali- 
tarians "seem inclined to take a rational and im- 
punitive position in a situation of social stress and 
they demonstrate a tendency to be relaxed and 
mild in the face of personal inconvenience at the 
hands of an inferior" (10, p. 47). In relating to 
authority figures, equalitarians “talk in terms 
‘fairness’ and ‘kindness’ and 'warmth, " instead of 
using the “cold bromides” of the authoritarian. 

Equalitarians again demonstrate a relatively 1%- 
tional and relaxed feeling for leaders [in contrast to the 
authoritarian who regards authority with ambivalence]: 
-- In the face of strong authority, they are more in- 
clined to observe calmly what will happen than to fly off 
immediately into either acceptance or rejection. . «+ In 
the face of strongly directive leadership they adopt 4 
tational, group-centered position instead of coming 
down with petulance and maladaptive resentment 
(10, pp. 94-95). 

The “genuine liberal” is only one of five different 
syndromes to be found among individuals low 10 
ethnocentrism, according to Adorno ef al (1). Their 
description of the “genuine liberal” is consistent 
with the characterization of the equalitarian give? 
by F. H. Sanford: 


J 


How NEUROTIC Is THE AUTHORITARIAN? 


His ego is quite developed but not libidinized—he 
is rarely “narcissistic.” At the same time, he is willing 
to admit id tendencies, and to take the consequences— 
as is the case with Freud's “erotic type.” One of his 
conspicuous features is moral courage, often far beyond 
his rational evaluation of a situation. He cannot “keep 
silent” if something wrong is being done, even if he 
seriously endangers himself. Just as he is strongly 
*individualized" himself, he sees the others, above all, 
as individuals, not as specimens of a general concept 
(1, p. 781). 

"These descriptions are summarized in Table 1. 
It is readily apparent that the authoritarian comes 
out second best. If this characterization is accurate, 
then it is reasonable to expect statistically signifi- 
cant differences between authoritarians and 
equalitarians on a test of mental health. After all, 
the authoritarian is compulsive, punitive, rigid, 
analsadistic, etc., while the equalitarian is more 
flexible, erotic, rational, relaxed, etc. 

Four separate studies have been conducted 
which present data on precisely this problem. In 
each of these studies a different measure of authori- 
tarianism and a different criterion of mental health 
have been used. Yet in none of them has a statisti- 
cally significant relationship been found between 
authoritarianism and mental health. It may be easy 
to discount any one study which fails to confirm 
the experimental hypothesis on the grounds that 
the operational measures of the independent and 
dependent variables are inappropriate or weak. 
However, when the inference that authoritarians 
are more neurotic than equalitarians cannot be 
confirmed in any one of four studies using eight 
different measures of the dependent and indepen- 
dent variables, it begins to appear that the infer- 
ence and its underlying theory are in error. These 
four studies are presented below; only a brief ac- 
Count of each is given, since more detailed descrip- 
tions are available elsewhere. 

CRUS etis in relation to the Minnesota 
"i . ic Personality Inventory. Curiously 

‘ough, the very first study in this area was re- 
Ported in the same volume that presented a neuro- 
‘ picture of the authoritarian. In this study, 
d 9n the California Ethnocentrism (E) Scale 

ere compared with scores on the MMPI. The sub- 


Jects were 34 men and 48 women, all of whom were. 


gd in the Langley Porter Clinic in San Fran- 
m co, an institution for the diagnosis and treatment 
Qc cuatro disorders. “Comparisons of average 
E(thn on the various MMPI scales for the four 
tion pee) quartiles and preliminary inspec- 
us oe and group test-profiles failed to 
ie a relationships between E 
a atric syndromes as measured i 
Inventory? (1, p, 910). E 


"As far as statistical significance of most of the 


Tésults is concerned much is left to be desired. The. 


Scope of the investigation dj i 
igation did not permit the use of 
many more than 120 subjects. For many of our 


317 


TABLE 1 
COMPARISON OF PERSONALITY CHARACTERISTICS 
ASCRIBED TO AUTHORITARIAN AND 
EQUALITARIAN INDIVIDUALS 


AUTHORITARIAN EQUALITARIAN 
1. Conventional (1, pp. 1. Strongly individua- 
229-232; 10, p. 7) lized (1, p. 781) 
2. Aggressive to uncon- 2. Relaxed and mild (10, 
ventional individuals p. 47) 
(1, pp. 232-234; 10, 
p.7) 


3. Anti-intraceptive (1, 3. Readiness toward in- 
pp. 234-235; 10, p. 7) traception (1, p. 466) 

4. Superstitious (1, pp. 4. Scientific-naturalistic 
235-236; 10, p. 7) attitude (1, p. 464) 

5. Stereotyped in think- 5. 
ing (10, p. 7) 

6. Cynical (1, pp. 238- 6. Moral courage (1, p. 
239; 10, p. 7) 781) 

7. Destructive (1, pp. 7. 
238-239; 10, p. 7) 

8. Concern with sexual 8. Willing to admit id 


9. 
10. 


11. 
12. 
13. 
14. 
15. 


16. 


identifications (10, p. 
121) 


Sadomasochistic (1, p. 
759) 

Ambivalent toward au- 
thority (1, p. 759; 10, 
p. 94) 

Compulsive (1, p. 759) 
Analsadistic (1, p. 759) 
Punitive (1, p. 409) 
Cold (10, p. 77) 
Directive (10, p. 79) 


Maladaptive (10, p. 


tendencies and take 
the consequences (1, 
p. 781) 


. Rational and relaxed 


feelings for leaders 


(10, p. 94) 


. Flexible (1, p. 463) 

. Erotic (1, p. 781) 

. Impunitive (10, p. 43) 
. Warm (10, p. 77) 

. Nondirective, group- 


centered (10, p. 168) 


. Rational (10, p. 95) 


95) 


comparisons this group had to be divided into 
many small subgroups. Taken one by one, most of 
the numerical results therefore are not statistically 
significant, nor otherwise impressive" (1, p. 961). 
2. Authoritarianism and. Personal Security Scale 
scores? In 1949, 963 residents of Philadelphia, se- 
lected from 24 random census tract areas, were 
interviewed in a study of the leader-follower rela- 
tionship. In the course of this interview the sub- 
jects were administered the Authoritarianism- 
‘Equalitarianism (A-E) Scale, and the Personal Se- 
curity (P.S.) Scale. Both these scales have demon- 
strated that they can be used validly for specific 
, the A-E scale in predicting responses on 
questions regarding leadership (4, 10) and the P.S. 
3 This and the following two studies are part of a 
larger series of investigations of leadership being con- 
ducted by the Institute for Research in Human Rela- 
tions. This research is sponsored by the Office of Naval 
Research (Contract No. N8onr-69401). Mr. F. Loyal 
Greer, of the Institute staff, was responsible for the 
analysis of the data concerning personal security scale 
scores and authoritarianism. 


318 


Scale in successfully differentiating between hos- 
pitalized psychoneurotics and interns (6). 

A Pearson correlation coefficient computed be- 
tween thescores on these two scales was — .03, with 
a standard error of .071. A ¢ value for the difference 
in mean scores was .48. It is evident that the rela- 
tionship between personal security and A-E is not 
statistically significant (4, p. 103). 

3. Relationship between scores on the California 
F test and the Rotter Incomplete Sentences Test 
Sixty-four students attending summer school 
classes in introductory history and educational 
psychology at Temple University were adminis- 
tered the California F(ascism) test (forms 45 and 
40) and the Rotter Incomplete Sentences Test (8). 
This particular version of the sentence completion 
method has been found useful by several investi- 
gators (2, 7, 9); the F test is widely used and is 
described by Adorno e al. (1). 

A Pearson correlation coefficient computed be- 
tween these two measures was not statistically 
significant (r — —.03). To maximize whatever dif- 
ferences there might have been, the 14 most equali- 
tarian and the 14 most authoritarian subjects were 
selected, and a comparison of their incomplete 
sentences scores made. This ¢ test showed that the 
differences were not significant. Bearing in mind 
that the authoritarian is theoretically ambivalent 

toward parents and authority figures, the scores 
made on the sentence stems “back home,” “a 
mother,” and “my father" were averaged and re- 
lated to the F-test score. The Pearson correlation 
coefficient again was not significant (r = —.04) 
(4, p. 138). 

4. A comparison of the authoritarianism of 
nonhospitalized recruits and recruits under observa- 
tion in a recruit special evaluation hospital. The 
measure of authoritarianism used in this study, 
called the Authority Acceptance Scale (AAS), was 
constructed to aid prediction about the recruit’s 
adjustment to a military situation (5). The meas- 
ure of mental health used was confinement, or lack 
of it, in a Navy hospital used for the observation 
of those recruits who have been unable to adjust 
to the Navy situation. The hospitalized recruits are 
described in an official Navy publication as follows: 
“The prevalent pattern of these recruits seems to 
include pronounced evidence of immature behav- 
ior, feelings of dependency in conflict with hostile 
actions against the environment and sufficient dis- 
play of emotional instability to produce malad- 
justment” (3). 

The subjects of this study consisted of the entire 
population (V = 49) of recruit patients in the 

Bainbridge, Maryland, Naval Training Center 
Special Evaluation Unit Hospital.* Their scores on 


* Drs. Clarence Smeltzer, Richard Martin, and Harry 
Woehr of Temple University were instrumental in 
providing Ss for this study. 

5 The author is indebted to Lt. W. B. Lyon of the 
Special Evaluation Hospital at Bainbridge who was 
responsible for the administration of the AAS. 


J. M. Mastine 


the AAS were compared with the scores of 1,000 
nonhospitalized recruits. A ¢ test computed for the 
difference between these means equaled 1.89, a 
figure which does not reach statistical significance, 

In the light of these data it seems evident that 
the characterization of the authoritarian has been 
overdrawn. All things evil have been posited in one 
end of the distribution; all things healthy and 
democratic have been attributed to the other. It 
has been natural to slip from a rigorous, scientific, 
“show-me” frame of reference, to one which has 
been heavily influenced by the researcher’s concern 
about world conditions. The concept of the author- 
itarian personality may be a valuable, heuristic 
tool, but only if it can be divorced as much as pos- 
sible from value judgments. There seems to have 
been a tendency to use the term "authoritarian" as 
a mild profanity which one could use to describe 
other people (never oneself). It is suggested that 
the concept be re-examined in the light of the data, 
purified of value judgments, and put to use again 
as a research instrument. 


REFERENCES 


1. Aporno, T. W., FRENKEL-BRUNSWIK, ELSE, 
Levinson, D. S., & Sanrorp, R. N. The authori- 
tarian personality. New York: Harper, 1950. 

2. Brown, S. W. The use of an incomplete sentences 
test for the study of attitudes toward Negroes. 
Unpublished doctor’s dissertation, Ohio State 
Univer., in progress. 

3. BUREAU OF MEDICINE AND SURGERY. Psychiatric 
unit operational procedures. Office of Naval Re- 
search, Contract No. N6-ori-158. 

4. Courtney, P. D., Greer, F. L., & MAsLING, J. 
Leadership identification and acceptance. Series 
1952, No. 1, Philadelphia: Institute for Research 
in Human Relations. 

5. Courtney, P. D., Greer, F. L., MASLING, Jy & 
OnLaxs, H. Leadership identification and accept- 
ance. Series 1953, No. 1, Philadelphia: Institute 
for Research in Human Relations. 

6. Knutson, A. L. The nature of personal security. 
Unpublished doctor’s dissertation, Princeton 
Univer., 1949, 

7. Morton, R. B. A controlled experiment in psycho- 
therapy based on Rotter’s social learning theory 
of personality. Unpublished doctor’s dissertation, 
Ohio State Univer., 1949. 

8. Rorrer, J. B., & Rarrerry, Janet E. The Roller 
Incomplete Sentences Blank. New York: The 
Psychological Corp., 1950. 

9. Rorrzz, J. B., RAFFERTY, Janet E., & SCHACHTITZ, 
Eva. Validation of the Rotter Incomplete Sen- 
tences Blank for college screening. J. consult. 
Psychol., 1949, 18, 348-356, j 

Sanrorp, F. H. Authoritarianism and leadership: 
a study of the follower's orientation to authorily. 
Philadelphia: Institute for Research in Human 
Relations, 1950. 


10. 


Received March 4, 1953. 


REVIEWS 


‘Tue Lire AND Work oF SIGMUND FREUD. Volume 
I. THE FORMATIVE YEARS AND THE GREAT 
Discoveries, 1856-1900. By Ernest Jones. 
New York: Basic Books, 1953. Pp. xiv + 
428. Price $6.75. 

The first volume of the trilogy Dr. Jones prom- 
ises is.a book of unparalleled interest and impor- 
tance for psychologists of all schools and theoretical 
persuasions, It presents an absorbing story which 
will never be more fully nor better told. The his- 
torical importance of Freud and his ideas hardly 
needs to be labored, and it is perhaps enough to 
say that this book is, in the reviewer’s opinion, the 
best available introduction to an understanding of 
the man and of psychoanalysis as he developed it. 
For it presents the work as well as the life of Freud, 
and carefully traces the development of psycho- 
analytic ideas up to their first great climax in The 
Interpretation of Dreams. 

The organization of this first volume, and pre- 
sumably of the whole work, is basically chronologi- 
cal. At the same time, however, important topics 
are treated as a whole within the general frame- 
Work, so that all the facts about episodes in 
Freud's life, such as his extraordinary friendship 
with Fliess, are told together. The result is a satis- 
factory compromise, although there is inevitably. 
Some overlap and repetition, and the reader 
does not get an integrated picture of all that was 
going on in Freud's complicated life stream during 
Certain crucial periods of his mature years. 

We begin with an interesting chapter on Origins, 
Which tells a little about Freud’s family and— 
Surprisingly—quite a lot about his early childhood. 
This chapter becomes even more meaningful after 
one has read the rest of the book. The story of 
Freud s school years is less meaty, but many inter- 
esting facts begin to emerge when his higher educa- 
tion is discussed, Hardly “progressive” by many 
modern standards, Freud's university let him go 
oe medical school at his own pace, give him- 
m a liberal education as he went along, and carry 
whi AA anatomical research, the first report of 
nE ys published before he was 21! True, he 

ids ost 25 when he got his M.D., but he had 
ies Deos in half a dozen pieces of research, 
of TS of them published, had translated a volume 
bis T Mill, and had had a year of compulsory 
ur Service. Not bad for one's first quarter 

It Pobre pea standards. 

ng at this point to go into a more 
ended description of Freud's personality as it 
alae te in Jones’s account, but space does not 
ed S The fascination of this case goes far be- 

AER d Interest inherent in any thoroughly 

eus e, and beyond what is implied in the 

pun we can at last turn the tables on the 

XS analyzer of others, Even if he had been an 

own, we should have been intrigued by the 


human interest of his story. But the effect is all 
the greater because the grave and dignified figure 
of the later photographs is suddenly revealed to 
have been an intensely passionate, vivid, fallible, 
and human person, torn with inner conflicts and 
neurotic troubles of his own. Titan though he was 
intellectually, he made his historic discoveries with 
pain, self-doubts, and tremendous labor. The 
achievement seems the more remarkable when we 
learn it was done against a background of suffer- 
ing from illnesses that would have incapacitated 
a man without his tremendous self-control—neuri- 
tis, sinusitis, migraine, *neurasthenia," and others. 

The fact that this biography has made the best- 
seller lists should not be surprising, but may lead to 
some misconceptions. Jones has not abated a whit 
of his customary dignity and urbanity in writing 
this book. He has sought no sensationalism; most 
of his writing is clear, occasionally it is witty and 
graceful, and it makes effortless reading in the 
narrative sections. But he has made no effort to 
spare the reader. When it comes to theoretical or 
technical issues, Jones certainly makes good his 
expressed intention to write for his psychoanalytic 
colleagues. The great merit of the book lies in its 
material—painstakingly gathered, worked over 
with loving care, presented readably—rather than 
in what Jones does with it. How often does it hap- 
pen that an intimate history can be written about 
one of the great movers and shakers of the civi- 
lized world, so that we can follow day by day the 
development both of a tempestuous romantic love 
affair and of the painful wresting of basic discover- 
ies from that most stubborn of nature’s recesses, 
the human mind? 

It is particularly ironic that the materials exist 
that enable Jones to tell us this story about a man 
who had such a strong sense of privacy, and who 
took pains to prevent any future public knowledge 
of his private life even when he was thirsting for 
fame. Yet even though he several times destroyed 
personal records and diaries, he was betrayed by 
his own magnificent articulateness and by his need 
to express himself to someone from whom he got 
emotional support. Hence the two series of letters, 
to his fiancée and to his friend Fliess, which have 
fortunately been preserved. Good use has been 
made of them, but the reader of this book will not 
only look forward with impatience to the forth- 
coming translation of the Fliess correspondence, 
but will clamor for the publication of the love 
letters. May I add a speculation that it is not 
wholly accidental that Freud did express himself 
so fully and openly to people who were not likely 
to destroy his letters? It seems not too much to 
hazard a guess that where so much need for privacy 
is coupled with a longing for fame as great as 
Freud’s, there exists some unconscious exhibition- 
ism that may find obscure expression. 


319 


320 


Yet to Dr. Jones such speculations apparently 
would seem to go too far. The book is remarkable 
for the lack of psychoanalyzing in it. Its author has 
limited himself primarily to a painstaking recon- 
struction of the facts; indeed, sometimes he gets a 
little lost in a scholarly enthusiasm for exact dating 
or placing or for unearthing facts that are not given 
much significance. On the whole, however, most 
psychological readers will find this approach 
preferable to one of overselectiveness and luxuriant 
interpretation, The facts receive a modicum of 
basic, rather conventional psychoanalytic interpre- 
tation and are at times related to specific character 
traits, but without any attempt to integrate them 
into an intelligible portrait. 

There is some evidence that Jones deliberately 
eschewed interpretation at a number of points. For 
example, on page 85, he raises the question of what 
the basis of Freud’s self-punishing trends was, but 
says: “The answer to this question we must leave 
to the psychoanalysts." In large part, then, his 
intention may have been to furnish all the facts 
to his colleagues, knowing that they would make 
their own interpretations anyway! Readers who 
can take such unexplained terms as “empyema of 

the antrums" in stride can certainly be expected 
to be able to supply the hypothesis that Freud’s 
irrational antagonism toward Breuer after 1895 
had in it elements of “transference from earlier 
figures in his life—ultimately his father" (p. 308). 
Yet Jones does give us just this kind of psycho- 
analytic interpretation. 

Even more puzzling, the author’s interpreta- 
tions occasionally seem surprisingly superficial and 
unconvincing to anyone analytically informed. 
Perhaps the most amusing is Jones’s labored at- 
tempt to explain why Freud, who was profes- 
sionally so interested in sex, nevertheless “dis- 
played less than the average personal interest in 
what is often an absorbing topic.” He suggests 
“that Freud’s interest in sexual activities, like that 
in the aphasic disturbances of speech, came from 
the fact that sexuality has so obviously both 
physical and mental components” (p. 272). Then, 
too, he uncritically accepts Freud’s own "interpre- 
tation” of his alleged lack of interest in the thera- 
peutic aspects of medicine: “My innate sadistic 
disposition was not a very strong one, so that I 
had no need to develop this one of its derivatives” 
(p. 28). This in spite of voluminous evidence, 
faithfully set forth, about fairly overt sadism in 
Freud’s childhood, unusually intense hatreds in 
his adult life, and countless indirect manifestations 
such as depressions, somatizations, etc. In general, 
Jones makes very little use of aggression as an ex- 
planatory principle, nor does he refer often to 
specific defenses other than repression, Such omis- 
sions may be due to the fact that psychoanalysis 
as he learned and practiced it for many years was 

- essentially a single-instinct (libido) theory and 
an id psychology. Like Freud himself, many of the 


* Reviews 


oldest generation of analysts never fully assimilated 
the implications of the dual instinct theory nor of 
ego psychology into their thinking. i 

Yet in many other contexts that do not directly 
involve hostility, Jones shies away from obvious 
interpretations. I have not counted the references 
to fame that Freud made, quoted in this book, 
but there are scores of them, alternating between | 
rather straightforward wish-fulfilling fantasies of 
world renown and frequent denials (technically, 
negations) that he had any interest in leaving his 
name carved on a rock for future generations, Yet 
Jones accepts the protestation, apparently without 
question or considering the possibility that the wish 
was strong but the defense almost equally so. 

Jones’s exposition of Freud’s early ideas is clear 
and helpful, though not particularly critical, 
Throughout, he tries manfully to be unbiased, and 
does indeed make many statements that are any- 
thing but praising; on the whole, he achieves a 
surprising degree of objectivity, considering how 
hard it is to be objective about so controversial a 
subject. But I am reluctantly forced to conclude 
that the interpretative weaknesses mentioned de- 
rive from a limitation in Jones’s ability to conceive 
his hero in unflattering terms. 

A couple of other criticisms might be made, 
Though Jones does a good job of picturing the | 
intellectual tradition and context out of which 
Freud’s ideas came, the book is weak on the social 
and cultural influences on his life and thought. 
For example, he does not fully convey to the reader 
the extraordinary prestige that university appoint- 
ments, particularly professorships, enjoyed in 
Vienna. Nor does he develop much the part that 
Freud's Jewish cultural heritage played in his life | 
and thought. x 

A final minor point: psychoanalytic technique, 
as Jones consistently depicts it, seems extremely 
passive, nonmanipulative, anything but directive 
or interventive. Certainly the new method that 
Freud developed was all of this, in contrast to con- 
ventional and prevailing psychiatric treatments, 
but today the emphasis gives a distorted impres- 
sion. For example, Jones hardly mentions interpre 
tation in his description of psychoanalytic therapy: 

Some of this criticism may be premature. After 
all, two more volumes are to appear, and in them 
Jones may plan to do much that he has left un 
done in this first volume, It stands alone to such # 
great extent, with a real dramatic unity, that one 
tends to forget that most of the work has not yet 
been done. At any rate, a great wealth of fact 
about Freud’s first 44 years has been made avail- 
able. If Jones plans to end the project by a Sum" - 
mary portrait, it would give us further reason t0 | 
look forward even more eagerly to volumes 2 and3. | 

Rosert R, Hom | 

Research Center for Mental Health, 

New York University. 


r3 


NEW BOOKS RECEIVED 


AsmAHAMS, J., & Varon, Eprrm. Maternal de- 
pendency and schizophrenia. New York: 
International Universities Press, 1953. Pp. 
240. Price $4.00. 
| ACKOFF, R. L. The design of social research. Chi- 

cago: Univer. of Chicago Press, 1953. Pp. 
420. Price $7.50. 

Aronson, M. J. The juridical evolutionism of 
James Coolidge Carter. Toronto: Univer. of 
Toronto Press, 1953. Pp. 53. Price $2.25. 

Baum, Vicki. The mustard seed. New York: Dial 
Press, 1953. Pp. 437. Price $3.95. 

Bere, L., & Street, R. Sex: methods and manners. 
New York: The McBride Co., 1953. Pp. 250. 
Price $3.50. 

BumsreN, DonorHv R. The social theories of 
Harry Stack Sullivan. New York: William- 
Frederick Press, 1953. Pp. 186. Price $3.50. 

BONAPARTE, Marre. Female sexuality. New York: 
International Universities Press, 1953. Pp. 
225. Price $4.50. 

Bossrrman, Bevan C. The troubled mind. New 
York: Ronald, 1953. Pp. 206. Price $3.50. 
Brown, J. S., et al. Current theory and research in 
motivation: a symposium. Lincoln, Nebraska: 
Univer, of Nebraska Press, 1953. Pp. 193. 

Price $2.00. 

Bucuanan, W., & Canrrit, H. How nations see 
each other. Urbana: Univer. of Illinois Press, 
1953. Pp. 220. Price $5.00. 

Caweserr, J. D. Manic-depressive disease. Phila- 

ic delphia: Lippincott, 1953. Pp. 403. Price $6.75. 

ANTOR, N. The teaching-learning process. New 
3 York: Dryden, 1953. Pp. 350. Price $2.90. 

AROTHERS, J. C. The African mind in health and 

disease, a study in ethnopsychiatry. New York: 
Columbia Univer. Press, 1953. Pp. 177. 
Price $2.00. 

posu, J. B. The study of language. Cambridge: 

eas Univer. Press, 1953. Pp. 289. Price 


Voile tides behavior in the concentra- 
mp. : 
Pieds A ew York: Norton, 1953. Pp. 295. 
EUTSCH, F. (Ed.) The psychosomatic concept in 
Psychoanalysis. New York: International Uni- 
p. „versities Press, 1953. Pp. 182. Price $4.00. 
XC » J., Aur», F., JR., & Wurre, Atice M. 
os z psychotherapy. New York: Macmillan, 
Byers: P. 222. Price $3.50. 
he HET: Structure of human personality. 
| COBRA Wiley, 1953. Pp. 348. Price $5.75. 
ae & BnopsEck, May. Readings in the 
Pig Sophy of science. New York: Appleton- 
entury-Crofts, 1953, Pp. 811. Price $6.00. - 


FENICHEL, Hanna, & Rapaport, D. (Eds.) The 
collected papers of Otto Fenichel. New York: 
Norton, 1953. Pp. 408. Price $6.50. 

Gerts, H., & Mus, C. W. Character and social 
structure. New York: Harcourt, Brace, 1953. 
Pp. 490. Price $6.00. 

GINZBERG, E., Herma, J. L., & GINSBURG, S. 
Psychiatry and military manpower policy. 
New York: King’s Crown Press, 1953. Pp. 
66. Price $2.00. 

GREENACRE, PHvLLIs. Affective disorders: psycho- 
analytic contributions to their study. New York: 
International Universities Press, 1953. Pp. 
212. Price $3.00. 

GREENBLATT, M., & Sotomon, H. C. Frontal lobes 
and schizophrenia. New York: Springer Pub- 
lishing Co., 1953. Pp. 425. Price $12.50. 

Harms, E. Essentials of abnormal child psychology. 
New York: Julian Press, 1953, Pp. 265. 
Price $5.00. 

Hirrner, S. Sex ethics and the Kinsey reports, 
New York: Association Press, 1953. Pp. 238. 
Price $3.00. 

Hovtanp, C. I., Janis, I. L., & Kerrey, H. H. 
Communication and persuasion. New Haven: 
Yale Univer. Press, 1953. Pp. 315. Price $4.50. 

Hsu, F. L. K. Religion, science, and human crises. 
London: Routledge & Kegan Paul, 1952, 
Pp. 142. Price $3.50. 

Honrock, ELIZABETH B. Developmental psychology. 
New York: McGraw-Hill, 1953. Pp. 556. 
Price $6.00. 

Jacost, JoLANDE. (Ed.) Psychological reflections: an 
anthology from the writings of C. G. Jung. 
New York: Pantheon Books, 1953. Pp. 342. 
Price $4.50. 

Kaun, A. J. A court for children: a study of the New 
York City children’s court. New York: Colum- 
bia Univer. Press, 1953. Pp. 359. Price $4.50. 

Kocan, L. S., Hunt, J. McV., & BanrELME, 
Pnvrus F. A follow-up 
social casework. New York: Family Service 
Ass. of America, 1953. Pp. 115. Price $2.50. 

` auces, I. F. A speculation in reality. New York: 
Philosophical Library, 1953. Pp. 154. Price 
$3.75. 

Lerner, K. Hypnotism for professionals. New 
York: Stravon Publishers, 1953. Pp. 127.’ 
Price $4.00. 

LipcnEN, H. C. Psychology of personal and social 
adjustment. New York: American Book Co., 
1953. Pp. 481. Price $4.50. " 

Mavzs, P. B. The church and mental health. New 
York: Scribner’s, 1953. Pp. 303. Price $4.00. 

Maccrecor, Frances C., ABEL, THEODORA M., 


321 


study of the results of ^ v. 


322 » 
Bryt, A., Laver, EpiTH, & WEISSMANN, 
SERENA: Facial deformities and plastic sur- 
gery: a psychosocial study. Springfield, Ill.: 
Charles C Thomas, 1953. Pp. 230. Price $5.75. 

McCzrzLLAND, D. C., Arxinson, J. W., CLARK, 
R. A, & Lower, E. L. The achievement 
motive. New York: Appleton-Century-Crofts, 
1953. Pp 348. Price $6.00. 

Mzap, Marcarer. (Ed.) Cultural patterns and tech- 
nical change. Paris: UNESCO, 1953. Pp. 348. 
Price not supplied. 

MILBANK MrwoRIAL FUND, 1952 ANNUAL CON- 
FERENCE. Interrelations between the social en- 
vironment and psychiatric disorders. New York: 
Milbank Memorial Fund, 1953. Pp. 265. 
Price $1.50. 

NICHTENHAUSER, A., CoLEMAN, MARE L., & 
Rone, D. S. Films in psychiatry, psychology 
and mental health. New York: Health Educa- 
tion Council, 1953. Pp. 269. Price $6.00. 

NorcurT, B. The psychology of personality. New 
York: Philosophical Library, 1953. Pp. 259. 
Price $4.75. 

Nunn, H. L. The whole man goes to work. New York: 
Harper, 1953. Pp. 214. Price $3.00. 

Perry, J. W. The self in psychotic process: its 
symbolisation in schizophrenia. Berkeley: 
Univer. of California Press, 1953. Pp. 184. 
Price $5.00. 

PENNSYLVANIA STATE COLLEGE, PSYCHOTHERAPY 
RrsEARCH Group. Group report of a program 
of research in psychotherapy. State College, 
Pa.: Pennsylvania State Coll. Press, 1953, 
Pp. 179. Price $2.25. 

RxpriELD, C. E. Communication in management. 
Chicago: Univer. of Chicago Press, 1953. Pp. 
290. Price $3.75. 

. Rese, W. The conception of disease, its history, its 
versions and its nature. New York: Philosophi- 
cal Library, 1953. Pp. 120. Price $3.75. 

Sartre, J-P. Existential psychoanalysis. New 


Ney Booxa Rsisivxp 


ay 


* York: Philosophical Library, 1953, Pp. 275, 
Price $6.00. 

SCHNEIDER, ELISABETH. Coleridge, opium and 
Kubla Khan. Chicago: Univer. of Chicago 
Press, 1953. Pp. 378. Price $5.00. 

Seman, J. M. (Ed.) The adolescent: a book of 
readings. New York: Dryden, 1953. Pp. 789, 
Price $4.50. 

SERoc, M. New light on dreams. Boston: The 
House of Edinboro, 1953. Pp. 159. Price $3,00, 

Suaw, F. J., & Orr, R. S. Personal adjustment in 
the American culture. New York: Harper, 
1953. Pp. 388. Price $4.00. 

Suum, M., & Witson, M. O. (Eds.) Group re- 
lations at the crossroads, New York: Harper, 
1953. Pp. 379. Price $4.00. 

Smut, J. S. Respiratory diseases and allergy: new 
method of approach. New York: Medical 
Library Co., 1953. Pp. 80. Price $2.75. 

SrEPHENS, E. A. Lawless youth. New York: 
Pageant Press, 1953. Pp. 315. Price $3.50. 

STEPHENSON, W. The study of behavior: Q-technique 
and its methodology. Chicago: Univer. of Chi- 
cago Press, 1953. Pp. 376. Price $7.00. 

TRAVAUX DE LA SECTION DE PsvcHOLOGIE Cil- 
NIQUE DU XI* CONGRÈS DE L'AssoctATION IN- 
TERNATIONALE DE PSYCHOTECHNIQUE. Revue de 
Psychologie appliquée. Paris: Centre de Psy- 
chologie Appliquée, 1953. Pp. 366. No price 
supplied. 

TYLER, LEONA E. The work of the counselor. New 
York: Appleton-Century-Crofts, 1953. Pp. 
323. Price $3.00. " 

WEITZENHOFFER, A. M. Hypnotism: an objective 
study in suggestibility. New York: Wiley, 1953. 
Pp. 380. Price $6.00. 

WITTENBERG, R. M. How to help people. New | 
York: Association Press, 1953. Pp. 64 
Price $1.00. 

Yost, O. R. What you should know about mental | 
illness. New York: Exposition Press, 1953. 
Pp. 165. Price $3.50. 


THREE SITUATIONAL DETERMINANTS OF CONFORMITY 
TO SOCIAL NORMS 


SOLOMON C. GOLDBERG 
Army Medical Service Graduate School, Walter Reed Army Medical Center! 


ROUP psychology has been variously 
E depicted as the establishment of 

Social norms by the group and the 
adoption of these norms by the members of 
the group. The present study is concerned 
with certain of the conditions under which 
group members will conform to the social 
norms of their groups to one degree or another. 
We shall consider primarily three variables 
which, as yet, have received little systematic 
attention. These are: (a) the number of occa- 
sions a subject (S) has been exposed to the 
social norm (to be known as “exposures”), 
(D) the extent of S's agreement or disagree- 
ment with the social norm when he is initially 
exposed to it (to be known as “distance”), 
and (c) the size of the group to which the 
norm is attributed. 

The importance of exposures seems to be 
quite well established in advertising prac- 
tices; it is assumed that the consumer tends to 
adopt & more favorable attitude toward the 
advertised product the more often he is told of 
the vast number of people and prestige indi- 
viduals who already use the product. 
^ The distance variable in relation to con- 
ormity has been observed in a number of 
experiments but has not been systematically 
eg The best example is Sherif’s auto- 
: Tues experiment (4), which showed that the 
in viduals who initially held the more ex- 

. treme Opinions were those who exhibited the 
Se opinion change. In another setting 
Jm vM (3) found that the group as a whole 
more communications toward the 

ES member whose opinion deviated most 
Bas ee group norm opinion in an attempt to 
ms x hats in line with the group. From 
os "ts it would follow that it will be 
io vate who will conform most 
ing group, ie, change his attitude most 

~ tavor of the group attitude. m 


US 

ipd ee Tepresents part of a doctoral disserta- 

Alterica pleted in 1952 at the Catholic University of 

Ie Under the supervision of Drs. P. R. Hof- 
et, J. W. Stafford, and M. Lorr. 


The effect of group size on conformity has 
been explored by Asch (1). He found that 
larger groups, up to a given size, exert greater 
influence upon the group member whose 
opinion differs from the group's. The present 
experiment presents the opportunity to test 
the differential effects on conformity of groups 
of two and groups of four. 


HYPOTHESES 


The preceding introductory discussion may 
be summarized in terms of the hypotheses to 
be tested in the present experiment. These 
are that conformity is greater (a) when the 
number of exposures to the group norm is 
greater, (b) when the group member’s initial 
distance from the group norm is greater, and 
(c) when the size of the group is larger. 


METHOD 

Subjects 

The Ss were 79 white males who were either college 
or high school students or young business men. The. 
sample was subdivided into nine experimental groups 
of four each, nine experimental groups of two each, 
two experimental groups of three each, and one control 
group of 19 Ss. 


* Materials 


The experimental stimuli were nine bust-size photo- 
graphs of male Negroes taken from a high school year- 
book. The nine photographs (on the basis of previous 
judgments by 24 judges) were selected such that the - 
variabilities of their adjudged intelligence were high, j 
and the means and variabilities of their adjudged in- 
telligence did not differ significantly from each other. P " 
This procedure allowed the selection of nine equated 


- stimuli which were relatively unstereotyped with re- 


gard to adjudged intelligence. 


Procedure 1 
Control group. The following instructions were given 


to the group: " 
y “The purpose of this experiment is to settle once and 


for all whether people have accuracy in making snap * 


v 


Ld x DA Ms 
judgments of intelligence from photographs. You all AW 
have seen some people who seem to be able to size up | ` 


other people rather quickly and rather well; so probably 
some of you will be more accurate than others. I am. ~ 
going to show you nine difierent pictures. You will see ` 


4$ $ S 


326 è 


each picture more than once, but you will see some 
pictures more than others. The reason for this is that 
we are trying'to see if you become more accurate on 
those pictures which you have had more chances to see. 
I will show you each picture for three seconds. During 
that time look at it carefully and judge the intelligence 
of that person on a scale from 50, low, to 150, high, de- 
pending on how high or how low you think he is. Write 
down your judgment for each picture as it comes up. 
It is only fair to tell you that, in order to simplify the 
experiment, wechoseto photograph mainly rather smart 
and rather dumb people. Although most of these people 
were either smart or dumb, we could not avoid a very 
few who were average in intelligence. Try to improve 
your accuracy every time you see each picture. Any 
questions?" 

The first three photographs were judged 20 times 
each; the second three, 10 times each; and the last three, 
5 times each; thus the total number of judgments was 
105. The presentation of the photographs was random- 
ized, and the control group underwent two such judg- 
ment sessions separated by a 15-minute rest period. 

Experimental groups. Each experimental group was 
at first given the same instructions as the control group 
and then completed one session of 105 judgments. A 
second session of judgments was then administered to 
each group member individually. This session was pre- 
ceded by the following instructions: 

"In this session you are asked to judge the same 
pictures you did before. I have looked over your first 
judgments and have made a note as to what the group 
as a whole thought of each picture. Now every time I 
show you a picture, as I did before, I will tell you what 
judgment the group as a whole made for that picture. 
Then, as before, look at the picture carefully, make 
your own judgment on it, and write it down on the sheet. 
The reason for this is that we want to see if your judg- 
ment accuracy increases or decreases when you know 
what the group judgment is. In other words, will your 
knowledge of the group judgment aid or hinder your 
accuracy? Remember, try to improve your judgment 
accuracy every time you see each picture. Any ques- 
tions?" 

The group estimate of each photograph reported to 
Ssin this session was a fictitious one; it was manipulated 
to be a desired distance from S's estimate of that 
photograph in the first judgment session. For example, 
on the first of the nine pictures it was desired that all 
Ss’ estimates be 50 points from the group estimate; to 
achieve this the group estimate reported to each S for 

“that photograph was 50 points from S’s judgment of 
that photograph in the first session of judgments. More 
specifically, if in the first judgment session subject A, 
judged the first photograph to be 60 and subject B 
judged it to be 120, then the fictitious group opinions 
given to them were 110 and 70 respectively. Whenever 
possible, the fictitious group opinion was reported in the 
direction opposed to the midpoint value of 100. For ex- 
ample, with a distance of 10, if S’s opinion were 75, the 
possible fictitious group opinions could be 65 or 85. In 
this case, 65 would be preferred in order to counteract 
any chance regression to the midpoint. An outline of 
the experimental conditions may be given as follows: 


Exposures: 20 20 20 10 10 10 5 5 5 


Photograph: A B C D E F GH I 
Distance: 50 30 10 50 30 10 50 30 10 


Sotomon C. GOLDBERG 


A M 


> 


o y 
This means that on Photograph E, for example, all Ss“ 


were 30 points from the group, and that the fictitious - 


group opinion was exposed to each S each of the 10 


times he was asked to judge this photograph. These 10 _ 


exposures were randomly dispersed with the presenta- 
tion of the other eight photographs. 


RESULTS 


For the experiment to be successful it was 
necessary that Ss be unaware of its true pur- 
pose lest they become resistant to its efforts, 
At the end of the experimental procedure each 
S was questioned in order to ascertain whether 
he had “caught on” to our true purpose. It 
was found that only 25 per cent of the experi- 
mental Ss thought that attempts were being 
made to influence their opinions. This split of 
percentages yields a highly significant critical 
ratio of 6.00, signifying that the procedure had 
achieved its desired end. 

The conformity score. The amount of change 
in rating points on any given photograph made 
by an S, from his last judgment in the first 
session to his last judgment in the second 
session, was taken as his conformity score. 
For the experimental Ss the direction of 
change had to be taken into account. If the 
change was in the direction of the fictitious 
group opinion, it was given a positive sign; 
if away from the fictitious group opinion, it 
was denoted as negative change. No change in 
opinion was denoted as zero conformity. 
Direction of change for control Ss from the 
first to the second judgment session was 
randomly assigned since no fictitious group 
opinion was reported to them. 

Comparison of experimental and control 
groups. To test for a significant difference 
between conformity in the experimental 
groups and opinion change in the control 
group a ¢ test was computed. Its value of 3.59 
was significant beyond the .01 level of con- 
fidence showing that the experimental manipu- 


-lations produced a real conforming effect. (See 


Table 1.) 

Conformity to “groups” of four and “groups” 
of two. The question asked of this analysis i$ 
whether an S conforms more to a group 
opinion when he knows the group is composed 
of a greater number of members even though 
he is told of the group opinion by a single in- 
formant. Certainly in extreme cases of group 
size this relationship may be expected to hold; 
however, it appears that in the present limited 
case it does not. Table 1 (second row) shows 4 


I 


Change fro 


R `. .StrvationaL DETERMINANTS OF CONFORMITY 327 
“TABLE 1 TABLE 2 
Conrormity DIFFERENCES: EXPERIMENTAL VS. CONFORMITY As A FUNCTION OF DISTANCE AND 
CONTROL Groups; GROUPS OF Four vs. EXPOSURES 
Groups or Two (N = 540; m per cell = 60) 
COMPARISONS Mean è N t ? No. of DISTANCE 
Experimental 9.5 74.5 60 50 30 10 
vs. 3.59 <.01 
Control 1.6 47.7 19 M 14.5 11.0 8 
Groups of four 10.2 73.7 39 20 
vs. 0.78 >.05 g? 359.15 258.40 198.95 
Groups of two 8.3 73.8 21 
- M 14.1 11.2 3.3 
Ad Dee on two groups of pec td meaningfully 10 
analyzable. These two groups were divided, the data of one in- 2 
idi in the groups of four and the data of the other in the groups E PRU SURE Were 
two. 
eere M 152 12.5 3.1 
H . . n 5 
t of .78, which is not significant to the .05 c? = 470.96 327.95 169.87 
level of confidence. 
Conformity as a function of distance and ex- 
posures. The conformity scores for each ex- TABLE 3 


perimental condition were arranged in a three- 
by-three factorial design so that an analysis 
of variance could test for significant differences 
in conformity between the three conditions 


. of distance from the group norm and between 


the three conditions of exposures to the group 
norm; in addition, this analysis would reveal 
ahy significant interaction between distance 
and exposures. 

Table 3 shows that the F test between dis- 
tances is significant beyond the .001 level of 
confidence; the F's for between exposures and 
the Interaction of distances and exposures are 
not significant to the .05 level of confidence. 
The / tests showed no significant differences 
between conformity at distance 50 and dis- 
oe 30. However, differences in conformity 
"ES distances 30 and 10 and between 

istances 50 and 10 were significant. 

Proportional conformity. 'The preceding 
uS Shows the degree of conformity to be 
Za under a variety of distance condi- 
iei Owever, the absolute extent of con- 
b y cannot be generalized to situations 

Syond the present experiment. Of more 
red would be a percentage conformity 
d e index is taken as the conformity 
ims S ded by the distance; in simpler 
hus enotes how much an S conformed in 
je >of how much he was “asked to con- 
T Ks this index can be shown to be con- 
Mh us all conditions of distance, then it 
eus € applied to other situations in which 

* trying to predict the degree of opinion 
m a knowledge of opinion distance. 


ANALYSIS OF VARIANCE OF CONFORMITY AS A 
FUNCTION OF DISTANCE AND EXPOSURES 


Source df VARIANE F $ 
Distances 2 7283 24.80 .001 
Exposures 2 105 <1.00 >.05 
Interaction 4 39 «1.00 05 
Within 531 294 


To test this possibility, all the conformity 
Scores were transformed into proportional 
conformity scores; an analysis of variance was 
computed to test for differences between 
proportional conformity under the nine experi- 
mental conditions. The resulting F value of 
.76 shown in Table 4 was not significant to 
the .05 level. Thus, we cannot reject the 
hypothesis that there are no real differences 
in proportional conformity under varying 
distance conditions. The index of proportional 
conformity appears to be a constant whose: 
average value is approximately .30. This 
ineans that, within the limits of the present 
experiment, Ss reduce their disagreement 
with the group by approximately 30 per cent 
regardless of the extent of their initial dis- 
agreement. 

Conformity as multidimensional behavior. 
Since we have nine somewhat different social 
situations in the experimental conditions, we 
may ask to what extent an individual's con- 
formity in one condition is related to his con- 
formity in another condition. To investigate 


328 


TABLE 4 


ANALYSIS OF VARIANCE OF PROPORTIONAL CONFORMITY 
AS A FUNCTION or DISTANCE 


Source df Varano F ? 
Distances 8 0.57 0.76 2.05 
Within 531 0.75 


Note: Means and variances for each experimental condition 
can be computed directly from Table 2. 


this possibility, intercorrelations’ were com- 
puted between conformity scores in each of the 
experimental conditions. These ranged in 
value from —.24 to .47, with the median at .14. 
It would appear that there is little tendency 
for individuals to conform to the same extent 
from one situation to another. 


Discussion 


The results concerning group size were nega- 
tive; Ss did not conform more to the opinions 
of groups of four than to groups of two. This 
may have occurred for two possible reasons. 
First, the difference in group size may not 
have been sufficient to elicit any increased 
conformity. Second, since the effect of the 
group opinion upon the individual was in- 
direct (reported to S by E), Ss could have been 
conforming to E rather than to the group; 
hence, differences in group size would make no 
difference in conformity. Whichever interpreta- 
tion is the true one, it was still possible to 
measure the conditions of conformity differen- 
tials to a prestige source, whether this source 
was the group or E. 

The most clear-cut results were those con- 
cerning the distance variable. A priori, there 
is no reason to expect Ss to conform more 
when the distance is greater. Schachter indi- 


? A complete centroid factor analysis of the inter- 
correlation matrix resulted in three factors; these were 
rotated “blindly” to simple structure keeping the axes 
orthogonal. Identification of the factors, however, was 
far from clear, and consequently it was decided not to 
burden the report with the tabular details of the analy- 
sis. At most, one could identify very indirectly us of 
the factors as "conformity to larger distances and 
“conformity to smaller distances.” The intercorrelation 
matrix and completed factor analysis have been 
deposited with the ADI. Order Document No. 3989 
from the ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washington 
25 D. C., remitting in advance $1.25 for 35 mm. ner 
film or $1.25 for 6 by 8 in. photocopies. Make che : 
payable to Chief, Photoduplication Service, Library o 


Congress. 


SoLomon C. GOLDBERG 


cated that the opinion-deviate conformed 
more because the group exerted greater pres- 
sure upon him to conform by directing more 
communications to him. In the present ex- 
periment we see that those with more deviat- 
ing opinions conformed more, but that the 
pressure brought to bear by the prestige 
Source upon the different degrees of deviation 
was constant. Schachter's interpretation would 
not explain the present results. It would 
appear instead that S perceives his distance 
from the group as an undesirable deviation 
and then makes a compromise with the group. 
Perhaps greater deviations are perceived as 
requiring greater compromises. It is also very 
interesting that the percentage of distance 
compromised is a constant for all degrees of 
distance. Why this is so defies explanation 
by the available data. The next step in this 
research should be the investigation of intro- 
spectional correlates of conformity at various 
degrees of distance in an attempt to under- 
stand the dynamics underlying these empirical 
results, 

Consider now the number of exposures to 
the group norm. It was shown that conformity 
occurred within the first five exposures to the 
group norm and that additional exposures 
were not accompanied by additional con- 
formity. This finding concerning the effects of 
repetition is certainly not in line with the cur- 
rent assumptions of advertising and propa- 
ganda. The results do not appear to charac- 
terize a situation wherein an S’s opinion iš 
gradually weakened by the influence of the 
group. It would be our guess that Ss perceive 
the group opinion as an additional datum 
upon which to base a new judgment of the | 
stimulus. This new information is taken into 
account, and, as we have seen, S makes some 
sort of compromise adjustment in the direc 
tion of the group. Additional repetitions of the 
same group opinion over and over do not 
provide S with any new information. What 
may be happening in the repetition of adver 
tisements is that more and more people are 
reached rather than that the same people 
are influenced to a greater and greater extent. 
Here, too, we expect that phenomenal data 
would be revealing. 

The low intercorrelations between CoD 
formity in the nine experimental conditions | 
provide evidence against the existence of D 
general personality characteristic of suggest: | 


[ 
D 


SITUATIONAL DETERMINANTS OF CONFORMITY 


bility or conformity. The existence of such a 
general personality characteristic would re- 
quire the rank ordering of individuals in the 
different experimental conditions to be more 
highly related than they were. It would ap- 
pear that investigations of the acquisition of 
social norms would more fruitfully concentrate 
upon the analysis of the immediate situation 
rather than on stable personality characteris- 
tics of S. 


SuMMARY AND CONCLUSIONS 


The present study has been concerned with 
the determinants of conformity to group 
norms. The variables under investigation were: 
(a) the extent of initial disagreement between 
the individual’s and the group's opinion 
(known as “distance”); (b) the number of 
occasions the individual was exposed to the 
group norm (known as "exposures"); and (c) 
the size of the group. The following results 
were obtained: 

1. Knowledge of a group norm by a subject 
results in the subject’s conforming to (reducing 
his disagreement with) the group norm. 

2 The degree of conformity to groups of 
four Js not different from the degree of con- 
formity to groups of two. 

3. The absolute degree of conformity to the 
group norm is a function of the distance from 
(the extent of disagreement with) the group 
norm, In general, greater distance makes for 
greater conformity. 


329 


4. Subjects conform to the group norm bya 
constant percentage of the distance from the 
group norm. 

5. Conformity occurs within the first few 
exposures to the group norm; additional ex- 
posures to the group norm are not accom- 
panied by additional conformity to the group 
norm. 

6. Intercorrelations between conformity 
under all experimental conditions were low. 
Conformity was not generalized but appeared 
to be highly situational. This was considered 
as evidence against the existence of any 
general personality characteristic of sug- 


, gestibility or conformity. 


REFERENCES 


1. Ascu, S. E. Effects of group pressure upon the 
modification and distortion of judgments. In H. 
Guetzkow (Ed.), Groups, leadership, and men. 
Pittsburgh: Carnegie Press, 1951. Pp. 177-190. 

2. McNemar, Q. Psychological statistics, New York: 
Wiley, 1949. 

3. SCHACHTER, S, Deviation, rejection and communica- 
tion. In L. Festinger et al., Theory and experiment 
in social communication. Ann Arbor: Univer, of . 
Michigan, Research Center for Group Dynam- 
ics, Institute for Social Research, 1950. Pp. 51- 
82. 

4. SHERIF, M. A study of some social factors in percep- _ 
tion. Arch. Psychol., 1935, 27, No. 187, 

5. THURSTONE, L. L. Multiple-factor analysis. Chicago: 
Univer. of Chicago Press, 1947. 


Received February 13, 1953. 


POLITICAL ATTITUDES AND JUDGMENTS OF OTHER PEOPLE 


GUSTAV JAHODA 
University College of the Gold Coast! 


URING the last few years there have 
been several studies dealing with 
judgments of people from pictures. 

Some of these were concerned with the inter- 
relations of judgments of various attributes 
(11), or the effects of stereotypes (10). Follow- 
ing recent developments in the theory of per- 
ception (4, 8), one sees that interest has in- 
creasingly become focused upon the charac- 
teristics of the perceiving or judging subjects 
(5). 

This trend has been most conspicuous in 
connection with ethnic prejudice. Here, the 
most relevant investigation is that of Razran 
(9), who asked his subjects to rate girls’ 
photographs on such variables as beauty, 
intelligence, and character; when these same 
photographs were later presented with Jewish, 
Italian, or Irish names, the ratings became 
modified in accordance with social stereotypes. 

Only one other similar work dealing with 
political attitudes is known to the author. 
This was an experiment by Hsii (6). He was 
concerned with judgments of pictures labeled 
“Communist” and ‘Non-Communist,” re- 
spectively (again a minority group), and only 
three subjects took part. 

The present study, carried out in England, 
was designed to answer the question of whether 
or not affiliation to one of the two main 
parties, Conservative or Labour, can exert an 
influence analogous to that of ethnic prejudice. 
The aim was to test the hypothesis that, in 
judging other people, subjects tend to associate 
favorable personal characteristics with political 
attitudes of which they themselves approve. 
More specifically, two effects were predicted: 
Conservatives and Labourites will tend to 
make differential judgments of pictures of, 
people they identify as political allies or op- 
ponents; and, conversely, they will tend to 
identify those judged favorably (unfavorably) 
as belonging to their own (the opposite) party. 


1 The author wishes to acknowledge gratefully the 
assistance of John Morris and Morris Walker, his former 
colleagues at the University of Manchester, England, 
where the research was undertaken. 


METHOD 


Materials. Photographs were selected from the | 
Times Guide to the House of Commons, 1950. This work | 
contains standard-sized photographs of all 625 Mem- 
bers of Parliament (MP's) elected at the 1950 General | 
Election. Most of the pictures were full-face showing 
the head down to, but excluding, the shoulders. 

After eliminating certain categories; batches of 20 
photographs equally divided as to party were obtained 
by means of a table of random numbers. These batches 
were once more randomly separated into two sets 
arbitrarily labeled A and B, each set containing five 
pictures of Labour (LAB) and five of Conservative 
(CON) MP's. The photographs had code numbers so 
that the experimenters (Es) themselves did not know 
their party. 

Criteria. The choice of criteria for the judgments 
was guided by two considerations: They should in- 
volve “values” in the sense of not being motivationally 
neutral, and also should permit the subjects (Ss) to 
feel that there was a reasonable possibility of judging 
a photograph according to the criterion. After several 
trials, Looks (LO), Intelligence (JQ), and Social Class 
(SC) were selected. Categories for grading are shown 
in Table 1. 

The exact wording was chosen to insure that none 
of the resulting distributions would be too skewed. 
In referring to judgments, the generalized labels 
“High,” “Medium,” and “Low” will be used hence- 
forth. 

In addition to the above, Ss were asked to identify 
party membership for each photograph. 3 

The task. Alternative types of record forms, sche- 
matically presented in Table 2, were devised for each 
criterion separately. The object was both to randomize 
the sequence of presentation and to separate the judg- 
ments of party and criterion. 

An example will illustrate the general procedure: 
S is approached with a request to take part in an ex- 
periment to study the extent to which individuals 
agree, and also to determine how accurate they arè 
in judging personality characteristics from photographs 
S is then given set A and asked to sort the pictures into 
three piles marked, e.g., Upper Class, Middle Class, and 
Working Class, respectively. When this is completed, 


> E records the code numbers in the appropriate spaces 


on FormI and hands S set B to sort into two piles ac 
cording to political party; then set A follows, agan 
to be sorted for party; and finally set B, to be grad 
once more according to SC. 


?Liberal MP's, women MP’s, and pictures other 
than full face; MP's in military uniforms or other dis- 
tinctive dress clearly indicating social status; prominent 
politicians; MP's from the Lancashire region (where 
the subjects lived) because they were liable to be recog- 
nized. 


330 


POLITICAL ATTITUDES AND JUDGMENTS OF OTHER PEOPLE 


TABLE 1 
CATEGORIES FOR GRADING ON CRITERIA 


CRITE- 
RION Hick MEDIUM Low 
LO  good-looking average not good-look- 
IQ very high ^ above average Page y and be- 
SC upper middle SX 
TABLE 2 


ALTERNATIVE MODES OF PRESENTATION 


OBJECT oF JUDGMENT 


SET Form I Form II 

A criterion party 

B party criterion 

A party criterion 

B criterion party 
TABLE 3 


CONSTITUTION OF THE SAMPLE 


Sex 
PanTY Mares FEMALES BorHSEXES 
Conservatives 49 70 119 
Labourites 65 54 119 
Others 24 22 46 


When S has completed the task, he is asked what 
Party he himself would vote for if he had to choose 

tween Conservative and Labour only. 

The procedure for Form II was of course the same, 
apart from the sequence. It will be noted that each S 
un Bre to a single criterion in order to avoid 

Experimenters and subjects. The field work was 
carried out by a group of honor students and the author, 
fae e drawn from the circle of relatives and 
pra z of the experimenters, excluding students except 
Th ew cases where no other contacts were available. 

© sample procured in this manner, while clearly un- 
Tepresentative of the general population, was not con- 

to any particular social group or class. 


€ category “Others” in Table 3 isa heterogeneous : 


ga eluding 21 Liberals, 10 “nonpolitical” subjects, 
Bee who, on questioning, recognized one or more 
d toons the Photographs, and 7 persons who re- 

© divulge political opinion. Except for party 


identi 5 
Eaton, these 46 subjects were excluded from the 


RESULTS 


ü Oy of party identification. At the out- 
18 necessary to consider briefly whether 


331 


recognition of some MP’s by Ss may have af- 
fected the results. Before the experiment, trials 
were undertaken to discover how familiar 
people were with the faces of MP’s. It appeared 
that individuals were often able to identify 
their own or neighboring MP’s, whose pictures 
they had seen on boardings or in local news- 
papers, but hardly anyone recognized other 
backbenchers. This was to be expected since 
photographs of backbenchers are rarely pub- 
lished in the national press. 

In the actual experiment, less than 10 per 
cent of the Ss even guessed that the pictures 
were of MP's. 

Rates of success in identifying party clus- 
tered very near to 60 per cent for all groups 
of Ss, the only slight deviation being 56.4 per 
cent for Conservative women. The over-all 
proportion of correct results was 59.5 per cent, 
which is significantly better than chance at 
the .001 level of confidence. 

Analysis of judgments. The first question to 
ask with respect to each of the three criteria 
was whether the four groups (male and female 
Conservatives and Labourites) had in fact 
responded differently to the photographs. If a 
chi-square analysis proved this to be the 
case for a particular criterion, successive 
breakdowns were carried out to discover what 
accounted for the differences. Within each 
criterion, the judgments of photographs be- 
lieved to represent Conservatives (B/CON) 
were treated separately from those believed 
to represent Labourites (B/LAB). 

It appears from Table 4 that LO and IQ 
were judged differently, but SC was not. More 
detailed breakdowns of the data have shown 
that the differences exhibited are a function 
of judgments in which party affiliation was 
wrongly identified. LAB and CON judges did 


TABLE 4 
SIGNIFICANCE OF Group DIFFERENCES IN RESPONSES" 
CRITERION 
BELIEVED INTELLI- SOCIAL 
PARTY Looks GENCE Crass 
B/LAB x= 34.06 x= 28.67 x? = 24.96 
p< Ol p< 02 ns 
B/CON Xœ = 32.06 -x = 27.74 x? = 13.65 
p< Ol p< 05 ns 


* In all cells df = 15 


332 


not differ significantly in their judgments of 
those photographs for which party affiliation 
was correctly identified. 

Distortion of judgments. The following steps 
were taken in deriving distortion scores. 
First, a standard judgment of each photograph 
was obtained by computing the percentages 
of all Ss who judged the photograph High, 
Low, and Medium on a given criterion. In 
this fashion it was possible to determine what 
percentage of all judgments of actual LAB 
photographs were High, Low, and Medium 
on each criterion. Parallel percentages were 
determined for all actual CON photographs. 
"These percentages provided a standard around 
which distortion scores were computed. The 
distortion scores were produced by computing 
the differences (in terms of percentage High 
and Low judgments) between the standard 
and the percentage of High and Low judg- 
ments given by a segment of the sample (e.g., 
male Conservatives) to those photographs they 
believed were representative of affiliation with 
a given party. Thus in Table 5, male Conserva- 
tives are shown to have given 16 per cent 
fewer High, or more Low, LO judgments to 
photographs they believed to be LAB than 
were given by all Ss to actual LAB photo- 
graphs. 

In the left half of Table 5 the scores of 
Conservative men and women appear to mani- 
fest a consistent bias. This is shown as fewer 
High judgments of Looks, IQ, and Social Class 
than the standard for photographs of MP's 
believed by these Ss to be of the Labour 
Party, and more High judgments for photo- 
graphs of MP's believed to be Conservative. 

Labour men exhibit the opposite bias as 


TABLE 5 
DISTRIBUTION OF DISTORTION SCORES 
CONSERVATIVE LABOUR 
Supers Jupces 
BELIEVED 
Party Crirerion Mates Femares Mates FEMALES 
B/LAB LO —16.0* —12.0 +13.0 — -L3.0 
IQ —11.0  —13.0 712.5 —8.5 
SC —10.0 —21.0 —12.5 —1.0 
B/CON LO +18.5 410.5  —18.5 -0.5 
IQ 49.0 +18.0 —6.5 0.0 
SC +8.5 +13.5 +18.0 +6.5 


* This reads, e.g., that male Conservative judges gave 16% less 
High and/or more Low judgments than the standard. The standard 
is based on the percentage of judgments from all Ss that were High 
or Low for this Looks criterion. 


Gustav JAHODA 


far as criteria LO and IQ are concerned, but 


when it comes to SC they deviate in the same _ 
direction as the Conservatives. Apart from - 


their moderate amount of distortion, Labour 
women present no clear-cut trend; regarding 
SC, their responses are uniform with those of 
all other groups of judges. 


Finally, the experimental design provided | 


an answer to the question of whether or not 
the order of presentation (party first or cri- 
terion first) made any difference. In other 
words, was there a greater tendency for, say, 
Conservative judges to rate as Low the photo- 
graphs previously labeled LAB, than to pick 
out those earlier rated, say, not good-looking” 
and call them LAB? A chi-square test brought 
out no significant differences, and it may be 
concluded that in either direction the associa- 
tion was of approximately equal strength. 


Discussion 


The 59.5 per cent success rate in identifying 
party membership was of the same order of 
magnitude as those obtained by Allport and 
Kramer (1) and Lindsey and Rogolsky (7); 
whose Ss were required to pick out pictures of 
Jews. This appears surprising in view of the 
fact that LAB and CON MP’s as groups, prob- 
ably differ less from each other than the rank 
and file supporters of the two parties. 

The results of the experiment confirm the 
general hypothesis stated at the outset. Two 
specific effects were also postulated: first, 
overrating of photographs of one’s own and 
underrating those of the opposite party when 
party identification has come first; second, 
identifying those previously judged High as 
belonging to one’s own, and those judged Low; 
to the opposite party. At first sight it appears 
that both these effects were established, an 
impression reinforced by the finding that ordet 
of presentation (party or criterion first) made 
no difference. 

However, the crucial fact must be empha- 
sized that significant differences occurred only 
when party was wrongly identified. This means 
that subjects ‘of different parties agreed sub- 
stantially in their ratings of the photographs 
but disagreed on the parties to which they 
thought the MP’s belonged. In other words, 


3'The fact that all SC distortions occurred in thè 


same direction accounts for the absence of significa? 
group differences with respect to this criterion. 


ee es 


mm 


POLITICAL ATTITUDES AND JUDGMENTS OF OTHER PEOPLE 


contrary to appearances, and to the plan of 
the experiment, judgment on a criterion was 
always the independent, and party identifica- 
tion the dependent, variable. 

Tn attempting to explain this, a distinction 
can be drawn between the two operations of 
judgment on a criterion such as “Looks,” and 
party identification. The former may be called 
a first-order judgment, being made with 
reference to directly perceived characteristics; 
party identification may be termed a second- 
order judgment, for it is based on a previous 
first-order one. Thus, in identifying party, the 
subjects first had to consider the directly 
observable attributes of the picture and then 
proceed to an inference or guess concerning 
party affiliation. Consequently, whatever the 
order of presentation, implicit first-order judg- 
ments, albeit not necessarily on a specified 
criterion, must have come first. If these con- 
siderations are justified, it follows that the 
experimental design was unsuitable for testing 
the first of the effects above; it could only have 
been done by providing subjects with ready- 
made party labels for the photographs. 

What did emerge conclusively was that 
Labour men gave a disproportionate share of 
the good-looking and intelligent-appearing 
types to the Labour Party, while Conserva- 
tives, and to some extent Labour women, 
assigned an excessively large number to the 
Conservative Party. The distortion scores, 
of course, have to be interpreted accordingly. 

So far, no reference has been made to the 
discrepancy between the response patterns 
relating to LO and IQ on the one hand, and 
to SC on the other. This may be partly ac- 
aed for, perhaps, by the fact that “favora- 
hig and “unfavorable” are somewhat am- 
; Buous terms when applied to social class 
levels in England, for valuation depends to 
Some extent on the individual's position within 


the class hierarchy and his political attitudes. . 


ci instance, Conservative Ss tended to 
cl aracterize the working class by such epithets 
as "coarse," “shabby,” “dull”; Labourites 
eres to upper-class people as ''snob- 
ish,” “smug,” “hard-faced.” 
E It is unlikely, however, that such differential 
à us valuations are sufficient to explain the 
ies discrepancy; a more general hy- 
esis may be put forward tentatively to 


. 4ccount for the data as a whole: in their relation 


the outer world, people expect certain at- 


333 


tributes to occur together. Their expectations 
may be based either on motivational factors, 
or upon reality, or both. Whenever people 
encounter new situations or new people, their 
judgments will be affected by these expecta- 
tions which they carry with them into the 
situation. With respect to LO and JQ, Con- 
servative and Labour subjects produced dif- 
ferent patterns of distortion because their 
expectations differed; with respect to SC, the 
distortion patterns were similar to one another 
because reality had created in both types of 
subject the expectation that Conservatives are 
likely to have high, and Labourites low, class 
status. 

While no direct data were collected on the 
various subjects’ expectations, their comments 
in the course of the experiment tend to support 
the above hypothesis. Statements such as the 
following were frequent with Conservative 
subjects: “The ones with breeding in their 
features are Conservatives"; ‘Socialists are 
rough-looking types and lack polish”; “The 
ones with a vacant expression are Labour.” 
And on the other side: “The fat and stupid- 
looking ones are Conservatives"; ‘(Labour 
people have a frank and open appearance.” 
Similar descriptions of working-class people 
and Labourites, or upper-class people and 
Conservatives, often implied a bracketing of 
the two pairs of categories. 

There remains the problem of the deviant 
responses of Labour women. The relevant 
literature (2, 3, 12) contains a good deal of 
evidence to show that Labour and/or working 
class women diverge considerably from Labour 
men in their social attitudes. The divergence 
is mostly in the Conservative direction, and 
one is led to conclude that a willingness to 
vote Labour rests on a somewhat different 
basis in women from that usual in men. 

Finally, when assessing the extent of the 
distortions obtained in accordance with the 
initial hypothesis, it should be noted that the 
results might have been more striking in the 
abserice of two negative factors: first, some 
of the more sophisticated subjects were aware 
of the possibility of bias inherent in the judging 
situation and attempted to guard against it; 
second, the subjects were not specially selected 
for the intensity of their attitudes, which 
ranged all the way from enthusiasm to rather 


half-hearted support. 


334 


SUMMARY 


1. The hypothesis was set up that in 
judging other people, subjects tend to asso- 
ciate favorable personal characteristics with 
Political attitudes of which they themselves 
approve. 

2. In order to test the hypothesis, a set of 
photographs of British Members of Parliament 
was graded according to three criteria by 119 
Conservative and 119 Labour subjects. A 
further task was the party identification of the 
men pictured. 

3. About 60 per cent of the photographs were 
correctly identified as to party, a result con- 
siderably better than chance. 

4. There were significant differences between 
Conservative and Labour subjects with respect 
to criteria Looks and Intelligence, but not to 
Social Class. These differences occurred only 
when photographs were wrongly identified in 
terms of party. 

5. With reference to Looks and Intelligence, 
distortions were found in the expected direc- 
tion, favorable to political allies and unfavora- 
ble to opponents, 

6. The responses of men and women were 
similar with Conservatives, but divergent in 
the case of Labour supporters. 


Gustav JAHODA 


REFERENCES 


1. Attrort, G. W., & Kramer, B. M. Some Toots 

of prejudice. J. Psychol., 1946, 22, 9-39, fy 

2. BENNEY, M., & Geiss, P. Social class and politics 
in Greenwich. Brit. J. Sociol., 1950, 1, 310-327, 

3. Bircu, A, H., & Campsext, P. Voting behaviour 
in a Lancashire Constituency. Brit, J. Sociol, 
1950, 1, 197-208. 

4. Bruner, J. S. Personality dynamics and the process. 
of perceiving. In R. Blake & G. Ramsey (Eds), 
Perception, an approach to personality. New York: 
Ronald, 1951. Pp. 121-147. 

5. FENsTERHEIM, H., & Tressert, M. E. The in 
fluence of value systems on the perception of 
people. J. abnorm. soc. Psychol., 1953, 48, 93-98, 

6. Hsb, E. H. An experimental study of rationaliza- 
tion. J. abnorm. soc. Psychol., 1949, 44, 277-278. 

7. LiNpsEv, G., & Rocorsxy, S. Prejudice and identi- 
fication of minority group membership. J. 
abnorm. soc. Psychol 1950, 45, 37-53. 

8. Postman, L. Toward a general theory of cognition, 
In J. H. Rohrer & M. Sherif (Eds.), Social psy- 
chology at the crossroads. New York: Harper, 
1951. Pp. 242-272. 

9. Razran, G. Ethnic dislikes and stereotypes. A 
laboratory study. J. abnorm. soc. Psychol., 1950, 
4b, 7-27. 

10. Secor, P. F., Bevan, W., JR., & Duxxs, W. E 
Occupational and physiognomic stereotypes in 
the perception of photographs. J. soc. Psychol, 
1953, 37, 261-270. 

11. Wurm, J. S. An experiment on reading character 
from portraits. J. gen. Psychol., 1949, 40, 11-45. 

12. Zwz1o, F. Women's life and labour. London: Victor 
Gollancz, 1952. 


Received March 28, 1953. 


4 


REACTION TIME AS A FUNCTION OF MANIFEST ANXIETY 
AND STIMULUS INTENSITY! 


CHARLES WENAR 
University of Illinois, College of Medicine 


ment between clinical theory and be- 

havior theorists’ formulations of the 
nature of anxiety. Fenichel (2) states: “All 
anxiety is a fear of experiencing a traumatic 
state (p. 133) .... It is the ego’s judgment, 
‘danger ahead’ (p. 213).... The ego's 
judgment of impending danger brings the or- 
ganism into a state similar to that of the 
trauma, but of lower intensity. The ‘tamed’ 
anxiety ...is used to indicate the necessity 
for starting defensive action" (p. 132). A 
similar idea is elaborated by Mowrer (8), and 
by Miller and Dollard (7), who regard anxiety 
as a conditioned pain response which has drive 
value. Thus, stimuli present at the time of a 
traumatic event tend, on being repeated, to 
arouse certain components of the pain response, 
even though the painful event itself does not 
occur. Man, being a highly articulate organism, 
can report being "afraid" or “anxious” at such 
times. When such anxiety is prolonged over a 
considerable period of time, even though no 
objective dangers can be identified, the clinical 
term “anxiety state” or “anxiety neurosis” is 
used. Most descriptions include two types of 
response from which the anxiety state is in- 
ferred: first, there are complaints of generalized 
fear of varying intensities, depending on the 
Severity of the case; and second, there are 
Concomitant physiological changes such as 
Tapid heartbeat, increased sweating of skin 
Surfaces, etc, 
ee Bent study, the important in- 
ico m both clinical theory and behavior 
ae ata higher drive level than nofanxious 
Bae There is also considerable experimental 
x ence, involving a number of different re- 
Ee measures, which points to the validity 
of attributing motivational properties to anx- 


"y. Taylor (16) found more rapid eyelid 


wt 
Bieter Huy was part of a doctoral dissertation com- 
State Unk eee in the department of psychology at the 
E. Farbe oity of Iova, under the direction of Dr. I. 
Brown and m os is also indebted to Dr. J. S. 
T. Ted Hunter fi i : Pr. 
and constructing the XR E their help in designing 


T= is at present considerable agree- 


ty is that chronically anxious individuals: 


conditioning in anxious subjects than in non- 
anxious subjects, and Rosenbaum (11) found 
greater responsiveness to generalized stimuli 
in an anxious than in a nonanxious group. Both 
studies interpreted the results as indicating a 
higher drive level in the anxious group on the 
basis of Hull's theory (3) that an increase in 
drive facilitates the evocation of a response. 
Johanson’s findings (5) that reaction time de- 
creased in subjects made anxious by shocking 
them for reacting slowly can be interpreted in 
terms of increase in motivation. 


PROBLEM 


Since there is experimental evidence that 
not only anxiety but also stimulus intensity 
may be considered a motivational factor, the 
problem arises concerning the effect on be- 
havior of varying both the anxiety level of 
individuals and the intensity of the stimulus. 
Hull's theoretical approach states this problem 
precisely enough to lead to predictions which 
can be experimentally verified. According to 
him, if the learning factor ($H&) is held con- 
stant, then the higher the drive level, the 
greater the magnitude of the response tend- 
ency. From this, one would predict that anxious 
individuals would react more strongly than 
nonanxious individuals to discrete stimuli in 
simple situations. However, he also postulates 
that the total drive is composed of irrelevant 
(D) and relevant (D) drives which summate 
according to the formula (100) D 4- D/D 4- 
100. Now, in an experimental situation in which 
anxious and nonanxious individuals respond 
to discrete stimuli, anxiety can be considered 
“as contributing to the irrelevant drive, and the 
intensity of the experimental stimulus can be 
considered as contributing to the relevant 
drive. On these assumptions the application 


2 Passey (9) and Spence and Taylor (14) found more 
rapid eyelid conditioning with a strong puff of air than 
with a weak puff; Woodwórth (17) cites studies indi- 
cating that reaction time decreases as the intensity of 
stimulation increases. Also, see Spence (13) for a more 
explicit statement of the way in which intensity of 
stimulation is now treated in the framework of S-R 
theory. 


335 


336 


of Hull's formula would result in the expecta- 
tion that the total magnitude of response of 
anxious individuals to a stimulus should be 
greater than that of nonanxious individuals, 
but differences between the two groups should 
decrease with increasing intensity of stimula- 
tion. 

Experimental evidence on this problem, how- 
ever, is far from clear. Some results are in line 
with the implications of the above-mentioned 
theory. Malmo (6), using as a response meas- 
ure the number of “bursts” of motor activity 
defined in terms of finger movements on a 
button the subject could press if the stimulus 
became too painful, found that anxious sub- 
jects reacted more strongly, than nonanxious 
subjects to a mild pain stimulus, and that 
they continued to do so as the intensity of 
the stimulus was increased. However, the 
rate of increase was more rapid in the non- 
anxious group. Solomon and Fentress (12) 
found that anxious individuals had low resist- 
ance levels initially and had small absolute 
changes in GSR to different sensory stimuli 
such as a buzzer and a light; nonanxious in- 
dividuals had high resistance levels and large 
absolute changes. Darrow and Solomon (1), 
using a similar procedure with psychotic pa- 
tients, found similar results in regard to abso- 
lute change in GSR. 

Other experimental findings are at variance 
with predictions from the present application 
of Hull’s formula. Spence and Taylor (15) 
using a weak and a strong puff of air in condi- 
tioning the eyelid response found that the 
speed of conditioning of anxious and non- 
anxious individuals was not significantly af- 
fected by increasing the intensity of the noxious 
stimulus. Rosenbaum (11) found that the 
difference in stimulus generalization between 
anxious and nonanxious individuals was sig- 
nificant only when a strong shock was adminis- 


tered for a given number of incorrect responses; ^ 


when a mild shock or a buzzer was used, there 
was no significant difference between the two 
groups. This led him to speculate that the 
drive properties of clinical anxiety may not 
be activated under conditions of little or no 
threat. 


Hypotheses 


1. On the assumption that anxiety can be 
considered as contributing to the irrelevant 


CHARLES WENAR 


drive (D), and the drive induced by the in. | 
tensity of the experimental stimulus can be 
considered as contributing to the relevant 
drive (D), Hull's formula for drive combina- 
tion would imply that the total drive level of 
anxious groups would be higher than that of 
nonanxious groups; however, increase in 
stimulus intensity should result in a reduction 
of the difference between the total drive level 
of the anxious and nonanxious groups. 

2. If Rosenbaum's speculation that the drive 
properties associated with clinical anxiety may | 
not be aroused under experimental conditions 
of little or no threat is correct, there should 
be no difference in the drive level of anxious 
and nonanxious individuals when the stimulus 
is relatively mild and innocuous. 


METHOD 


Design. The above hypotheses were tested by 
measuring the reaction time of anxious and nonanxious 
individuals to three different stimuli—a buzzer, a weak 
shock, and a strong shock. With other factors held 
constant, it was assumed that speed of reaction (1/ 
R.T.) would be a positive function of the variations in 
motivation associated with variations in level of mani- 
fest anxiety and stimulus intensity. In the initial or 
“training” series, the stimuli were presented at regular 
12-sec. intervals and, on the above assumption, in- 
creased speed of reaction during training was used as 
one criterion of increased drive. Following training, the 
subject’s reaction time to stimuli presented at intei 
of shorter and longer duration than the training interval 
was determined in order to obtain a temporal gradient 
of response strength. On the assumption that such à 
temporal gradient of response strength would represent 
a special instance of stimulus generalization, it was 
supposed that an increase in drive, in terms of anxiety 
and/or stimulus intensity, would raise the height an 
steepen the slope of the obtained gradient. This, then, 
was the second criterion of increased drive. 

Subjects. Subjects (Ss) were members of the under- 


* The rationale here derives from Hull who postu- 
lates that the generalization of habit strength from & 
conditioned stimulus to other stimuli along a give? 
sensory dimension is a negative growth function of the 
distance, in sensory units, between these stimuli an 
the conditioned stimulus. Since he also postulates that 
drive multiplies all existing habit systems, it follows 
that when drive is increased, the response tendency t0 
the conditioned stimulus and to stimuli differing from 
it along a given dimension should be strengthened. This 
produces both an increase in the height and a steepe?" 
ing of the slope of the generalization gradient. RO 
baum (10) pointed out that the temporal gradient 0. 
response strength can be regarded as a special instante 
of empirical stimulus generalization in which habit 
strength generalizes to internal cue patterns rather that 
to external sensory stimuli. j 


Reaction Time, MANIFEST ANXIETY, AND STIMULUS INTENSITY 


graduate course in introductory psychology at the 
State University of Iowa. Anxious and nonanxious in- 
dividuals were selected by means of a revision of the 
‘Anxiety Scale devised by Taylor (16). This scale con- 
sists of 50 items from the Minnesota Multiphasic test 
which were judged by trained clinicians to be the 
clearest indices of symptomatic anxiety. Forty-two 
Ss were taken from those obtaining scores in the upper 
20 per cent of the distribution, and an equal number 
from the lower 20 per cent. 

Apparatus. For individuals in the shock groups the 
output voltage of a Variac, which was connected to the 
115-volt, 60-cycle supply line, was fed to the Ss through 
a pair of isolating transformers. Dry electrodes con- 
sisting of smoothed-surface dimes were used. The vol- 
tages applied to the Ss from these transformers were 
read from a 2000 ohm-per-volt AC voltmeter. For in- 
dividuals in the buzzer group, a buzzer of medium 
intensity was placed behind a cardboard shield ap- 
proximately 8 in. from the reaction-time switch. In the 
case of both shock and buzzer, the stimulus was of brief 
duration, 

Two Hunter-Brown Electronic Timers (4) were used 
to regulate the interval between presentations of the 
stimulus. They were connected so that the dial could 
be set for the standard 12-sec. interval and reset for the 
desired interpolated intervals. 

In order to measure reaction time, a Standard 
Electric clock was started the instant the stimulus was 
presented and was stopped when the subject depressed 
a lever on a microswitch. j 

All apparatus except the reaction-time switch were 
screened from the subject's view. 

Procedure. Both anxious and nonanxious groups were 
subdivided into three groups of 14 Ss according to the 
type of stimulus used as a signal for reaction—a mild 
shock, a strong shock, or a buzzer. Thus there were six 
groups comprising a factorial design with two degrees of 
anxiety and three intensities of stimulation. Anxious 
and nonanxious individuals were assigned to the groups 
at random, and all six conditions were run concomi- 
tantly. 

At the beginning of the experiment S was seated at a 
table with the reaction-time key on his right and (in the 
pee of the shock groups) the electrodes attached slightly 
ae the wrist of his right arm. In order to determine 
s Dee necessary to produce a "strong" and a 

Weak" shock, each S was given a series of shocks, be- 
ginning at 10 volts and increasing to 50 volts, which he 
din instructed to rate as mild, medium, or strong. The 
TOM shock for both Strong Shock Groups was 
ig 5 ae e mean value for both Weak Shock Groups. 

9 volts. There was no tendency for one anxiety 
d to be Systematically lower or higher than the 
er in its judgments of mild and intense voltages. For 
thr oe the strength of shock was kept constant 

A out the experiments. 
sure given three or four practice trials in 
teers th e reaction-time key in order to accustom 
Mi uns e movement. In the caseof the Buzzer Groups, 
UM T was presented three times before the experi- 
the EE began so that the S might readily identify 
tlie em cime were worded so as to emphasize 
9f the response, and S was told his reaction 


337 


time after each presentation of the stimulus in order to 
maintain his motivation. 

In the initial phase of the experiment, the stimulus 
was delivered at regular 12-sec. intervals. There were 17 
such presentations, which constituted the training or 
temporal conditioning series. Then, on trials 18, 21, 27, 
35, 41, 48, and 55 the signal was presented at varying 
intervals. These intervals were 3, 6, 9, 12, 15, 18, and 21 
seconds, and their sequence of presentation was varied 
in terms of a latin-square design. These intervals con- 
stituted the test trials for determining the temporal 
gradient of response strength. Between the test trials, 
the 12-sec. interval was administered in order to re- 
establish the original temporal conditioning. 

‘The response measure for reaction time was the time, 
in hundredths of a second, between the administration 
of the stimulus and the depression of the key. 


RESULTS 


Before analyzing the data, a reciprocal 
transformation was performed on all reaction- 
time scores in order to eliminate the typical 
skewness of the distribution of latency meas- 
ures. Because of this the response measure will 
be referred to as speed of reaction rather than 
reaction time. 

Speed of reaction during training. The first 
17 trials were divided into four blocks of trials, 
and the mean speed of reaction per trial with- 
in each block was plotted. Since the result- 
ing curves appeared to be approximately 
parallel, the mean of the speed-of-reaction | 
score for the 17 trials was used as a representa- 
tive measure of the speed of reaction of the 
various subgroups during training. These 
means of speed-of-reaction scores for each sub- 
group are presented in Table 1. An analysis 
of variance indicated that both increase in 
anxiety and increase in stimulus intensity were 
effective in increasing the speed of reaction 
during training. The former was significant at. 
the 5% level of confidence, the latter at the 
‘1% level of confidence. The interaction be- 
tween anxiety and stimulus intensity was not 
significant. de: i 

Height of the temporal gradient of response 


TABLE 1 


MEAN OF THE SPEED-OF-REACTION SCORES 
ror TriAts 1—17 


(N = 14) 
STRONG WEAK 
GROUPS SHOCK . SHock BUZZER 
Anxious 7.62 5.86 7.25 
Nonanxious 7.16 5.36 6.07 


338 
TABLE 2 
MEAN OF THE SPEED-OF-REACTION SCORES 
AT THE 3-, 6-, AND 9-sEC. INTERVALS 
STRONG WEAK 

Groups SHock SHock BUZZER 
Anxious 5.10 4.15 4.29 
Nonanxious 5.01 3.49 3.75 


strength. The temporal gradient of response 
strength was obtained by plotting the mean 
speed of reaction for each group at each of the 
test intervals. Since the curves for the anterior 
portion of the gradient (intervals of 3, 6, and 
9 sec.) appeared to be quite different from 
those for the posterior portion (intervals of 
15, 18, and 21 sec.), the two segments were 
analyzed separately. 

As a measure of the height of the anterior 
portion of the gradient, the mean speed-of- 
reaction scores at the 3-, 6-, and 9-sec. inter- 
vals were summated for each group. The mean 
of these scores for each group is presented in 
Table 2. An analysis of variance indicated 
that an increase in anxiety significantly in- 
creased the height of the anterior portion of 
the temporal gradient of response strength 
(at the 5% level of confidence), and that an 
increase in stimulus intensity also increased 
the height significantly (at the .1% level of 
confidence). The interaction between anxiety 
and stimulus intensity again was not sig- 
nificant. 

As a measure of the height of the posterior 
portion of the gradient, the mean speed-of- 
reaction scores at the 15-, 18-, and 21-sec. 
intervals were summated for each group. The 
mean of these scores for each group is pre- 
sented in Table 3. An analysis of variance 
indicated that both increase in anxiety and 
increase in stimulus intensity were effective in 
increasing the height of the posterior portion 
of the temporal gradient of response strength. 
The former was significant at the .1% level 


TABLE 3 


MEAN OF THE SPEED-OF-REACTION SCORES 
AT THE 15-, 18-, AND 21-SEC. INTERVALS 


STRONG WEAK 
GROUPS SHock Sgock BUZZER 
Anxious 8.49 7.11 7.71 
Nonanxious 7.38 5.73 6.33 


CHARLES WENAR 


TABLE 4 


MEAN or DIFFERENCES OBTAINED BY SUBTRACTING 


MEAN SPEED-OF-REACTION SCORES AT 3 SEC. FROM 
MEAN SPEED-OF-REACTION SCORES AT 12 Sec. 


STRONG WEAK 
Groups Sgock Spock Buzzer 
Anxious 3.75 5.99 5.14 
Nonanxious 4.37 3.95 4.38 


of confidence, and the latter was significant at 
the 1% level of confidence. The interaction 
between anxiety and stimulus intensity was 
not significant. 

Slope of the temporal gradient of response 
strength. As a measure of the steepness of 
slope of the gradient for intervals shorter than 
the training interval, the speed of reaction at 
the 3-sec. interval was subtracted from the 
speed of reaction at the 12-sec. interval. The 
mean of these differences for each group is 
presented in Table 4. An analysis of variance 
indicated that neither increase in anxiety nor 
increase in stimulus intensity was effective in 
steepening the slope of the anterior portion of 
the temporal gradient of response strength. The 
interaction between anxiety and stimulus in- 
tensity likewise was not significant. 

An examination of the posterior portion of 
the temporal gradient of response strength re- 
vealed that, for the Anxious Strong Shock 
Group, the speed of reaction showed an in- 
crease from the 12-sec. to the 21-sec. interval, 
whereas, in all other groups, the speed of reac- 
tion decreased. Because of the theoretical im- 
plications of such a finding, if reliable, an 
analysis of trend of the six curves was done in 
order to provide a more sensitive measure of 
possible differences in slope between groups. 
The results indicated that neither increase m 
anxiety nor increase in stimulus intensity was 
effective in steepening the slope of the pos- 
terior portion of the temporal gradient of 
response strength. 


Since the steepness of the slope of the 


temporal gradient of response strength was the 
only instance of insignificant results, the 
gradients were examined for a possible €x 
planation. An examination revealed that ™ 
the case of the Anxious Strong Shock Group 
the speed of reaction, instead of increasing aP* 
preciably at the 12-sec. interval, was only 
slightly faster than at the 9-sec. interval. Fot 


Reaction TrwE, MANIFEST ANXIETY, AND STIMULUS INTENSITY 


every other group there was a considerable rise 
in the gradient between these two points. In 
addition, it was not the over-all shape of the 
gradient of the Anxious Strong Shock Group 
which deviated from that of the other groups, 
but only the reaction at the 12-sec. interval. 
It would seem a reasonable inference that this 
point represented a chance deviation rather 
than a stable characteristic of the group. On 
logical grounds there is no apparent reason 
why the relative positions of the groups in 
terms of speed of reaction at this 12-sec. in- 
terval should have differed from their posi- 
tions in the training trials which were also 
12 sec. apart. If this score of the Anxious 
Strong Shock Group was a chance deviation, 
the results of the above analyses of differences 
in steepness of slope of the temporal gradient 
of response strength are unreliable, and con- 
clusions from them should be drawn only with 
great caution. 


Discussion 


Since speed of reaction was used as a meas- 
ure of drive in the present study, the results 
provided additional evidence that both mani- 
fest anxiety and stimulus intensity may be 
tegarded as having motivational effects. The 
results obtained indicated that either an in- 
Crease in anxiety level or an increase in the 
Intensity of stimulation may produce a sig- 
hificant increase in drive. 

In regard to the problem of the effect of dif- 
ferent stimulus intensities on the drive of 
anxious and nonanxious individuals, there was 
ho evidence for the predicted decrease in the 

erence in total drive levels of the two groups 
as the stimulus intensity was increased. How- 
Fini there are two important considerations. 
" t the use of both a buzzer and a shock as 
Signals involved differences not only in the 
Intensity of the stimuli, but in the modality 
: ag Since there is evidence indicating 
Enron time is slower in the case of 
HR an in the case of auditory stimuli, 
B Issue of how much the increase in speed 
Bu d in the present study was due to 
HM 5 Intensity of the stimulus and how 
qe i ids due to differences in modalities is 
een Second, examination of the 
SUM o; the two intensities of shock alone 
,. ^16 two criteria of increased speed of reac- 


t Pn 
‘on during training and raising the temporal 


339 


gradient reveals that the decrease in difference 
between anxious and nonanxious groups was 
in the predicted direction. It is possible that 
the introduction of a stimulus affecting another 
modality might have obscured this relation- 
ship to some extent. 

The theoretical explanation of the failure 
to get the predicted decrease in difference in 
total drive level between anxious and non- 
anxious groups with increased stimulus in- 
tensity which is simplest and most in keeping 
with the observed behavior of the subjects 
may be that, in spite of the fact that the 
anxious groups consistently reacted faster 
than the nonanxious ones, the irrelevant 
anxiety drive was fairly high for both groups. 
For example, if one thinks of the anxiety 
drive as ranging from unconcern (no anxiety) 
to panic (extreme anxiety), it is possible to 
have one group significantly less anxious than 
another but, at the same time, to have both 
groups fairly high on this range. Thus, it is 
quite possible to have one group less anxious 
than another but still tense enough to prevent 
a significant decrease in difference between 
the two groups with increased intensity of 
stimulation.* Since 64 of the 84 Ss had been 
in an experiment in which they were shocked, 
and since most of them expressed concern 
that they would be shocked again, the entire 
population might be described as “shock 
sensitive." 

The present results also indicate that, con- 
trary to Rosenbaum's speculation that the 
drive properties of anxiety may not be aroused 
under conditions of little or no threat, anxious 
individuals had a higher drive level than non- 
anxious ones even when the stimulus was not 
very intense. Such findings fit nicely with the 
clinical concept of anxiety. What the anxious 
individual often fears most is a new or un- 
familiar situation; that is, he has a generalized 
fear of “what might happen." No matter how 


* innocuous the task, therefore, the experimen- 


tal situation itself is often sufficiently threaten- 
ing to arouse a significantly greater amount of 
manifest anxiety in him than in nonanxious 
individuals. 

4]f one thinks of the drive value of anxiety and of 
stimulus intensity as ranging from 0 to 100, Hull's 
formula for combination of drives, (100) D + D/D + 


100 can be used to construct a table of hypothetical 
values of total drive to illustrate the above point. 


340 


SuMMARY 


The present study was designed to investi- 
gate the relative increase in drive of anxious 
and nonanxious individuals as a function of 
different intensities of stimulation. On the 
assumption that, in a reaction-time study, 
anxiety can be considered as contributing to 
the irrelevant drive and stimulus intensity 
can be considered as contributing to the rele- 
vant drive, Hull's formula for drive combina- 
tion was used to predict that the total drive 
level of anxious individuals would be greater 
than that of nonanxious individuals, but that 
an increase in stimulus intensity would result 
in a reduction in the difference in the total 
drive level between the two groups. 

Anxious Ss were defined as individuals whose 
Scores were in the upper 20 per cent of the 
Taylor anxiety scale, and nonanxious Ss 
as individuals whose scores were in the lower 
20 per cent of the scale. There were 42 Ss 
in each group. Stimulus intensity was varied 
by using a strong shock, a weak shock, and a 
buzzer as signals for pressing a key in a simple 
reaction-time procedure. Following training, 
reaction time to stimuli presented at intervals 
of shorter and longer duration than the 12-sec. 
training interval was determined in order to 
obtain a temporal gradient of response 
strength. Increased speed of reaction during 
training and increase in the height and a steep- 
ening of the slope of the temporal gradient of 
response strength were the three measures of 
increase in drive. 

The results indicated that both an increase 
in manifest anxiety and an increase in stim- 
ulus intensity were effective in increasing the 
speed of reaction during training, and in 
raising the height of the temporal gradient of 
response strength; neither factor was effective 
in steepening the slope of the gradient. In 
none of the measures used was there a signifi- 
cant change in the difference in reaction time 


between anxious and nonanxious groups as a ' 


function of differences in stimulus intensity. 
These results were interpreted as indicating 
that both increase in manifest anxiety and 
increase in stimulus intensity may be re- 
garded as having motivational effects. It 
was speculated that the failure to find a sig- 
nificant decrease in the difference in speed of 
reaction between anxious and nonanxious 
individuals with increased stimulus intensity 


CHARLES WENAR 


might be attributable to the fact that both 
groups were responding at relatively high 
anxiety levels and/or to the fact that the 
stimulus not only varied in intensity but also. 
affected different modalities. 


REFERENCES 


1. Darrow, C. W., & Sorowon, A. P. Galvanic skin 
reflex and blood pressure in psychotic states: 
reaction to sensory, indifferent ideational and 
critical ideational stimuli. Arch, Neurol. Psychiat., 
Chicago, 1934, 82, 273-299. 

2. FENICHEL, O. The psychoanalytic theory of neurosis, 
New York: Norton, 1945. 

3. Hurt, C. L. Principles of behavior. New York: 
D. Appleton-Century, 1943, 

4. Hunter, T. A., & Brown, J. S. A decade-type 
electronic interval timer. Amer. J. Psychol., 1949, 
62, 570-575. 

5. Jonanson, A. M. Influence of incentive and punish- 
ment upon reaction time. Arch. Psychol., 1922, 
8, 54-93. 

6. Marwo, R. B. Experimental studies of mental 
patients under stress. In M. L. Reymert (Ed.), 
Feeling and emolion. New York: McGraw-Hill, 
1950. 

7. Miter, N. E., & Dorzanp, J. Social learning and 
imitation, New Haven: Yale Univer. Press, 1941. 

8. Mowrer, O. H. Stimulus-response analysis of 
anxiety and its role as a reinforcing agent. 
Psychol. Rev., 1939, 46, 553-565. 

9. PassEy, G. E. The influence of the intensity of un- 
conditioned stimulus on the acquisition of a 
conditioned response. J. exp. Psychol., 1948, 88, 
420-428. 

10. ROSENBAUM, G. An experimental study of a tem- 
poral gradient of response strength in instrumen- 
tal conditioning. Unpublished master’s thesis, 
Univer. of Iowa, 1947. 

11. RosENBAUM, G. Stimulus generalization as a func- 
tion of clinical and experimentally induced anx- 
iety. Unpublished doctor's dissertation, Univer. 
of Iowa, 1950. x 

12. Soromon, A. P., & Fentress, T. L. Galvanic skin 
reflex and blood pressure reactions in psycho- 
neurosis. J. nerv. ment. Dis., 1934, 80, 163-182. 

13. Spence, K. W. Theoretical interpretations of 
learning. In C. P. Stone (Ed.), Comparative 
psychology. New York: Prentice-Hall, 1951. — 

14. SPENCE, K. W. Learning and performance in eyelid 
conditioning as a function of the intensity of the 
UCS. J. exp. Psychol., 1953, 45, 57-63. 

15. Spence, K. W., & Tavrom, J. Anxiety and the 
strength of the UCS as determiners of the amount 
of eyelid conditioning. J. exp. Psychol, 1951, 
42, 183-188. 

16. Tavron, J. The relationship of anxiety to the con 
ditioned eyelid response. J, exp. Psychol., 1951, 
41, 81-92. 

17. WoopwonrE, R. S. Experimental psychology. New 
York: Holt, 1938. © | 


Received March 7, 1953. 


SOME EFFECTS OF COOPERATION AND COMPETITION UPON 
SMALL GROUP BEHAVIOR! 


MARTIN M. GROSSACK 
Philander Smith College, Litile Rock, Arkansas 


OOPERATIVE and competitive social 
( situations are often said to have 

differential effects on group behavior. 
This paper reports an experiment testing 
specific hypotheses that concern some of these 
differences. The theoretical rationale is based 
on the following three assumptions: 

1. A cooperative situation is one of “task 
direction” (1), while a competitive situation 
is not. A situation of task direction is defined 
as one in which an individual goal is reached 
by the achieving of a group’s task. 

According to Deutsch (2), the basic differ- 
ence between cooperation and competition is 
in the way an individual achieves his goal in 
both situations. Cooperation was defined by 
him as a situation in which no individual 
teaches his goal unless all other individuals 
also enter their goal regions. Competitive situ- 
ations were defined as those in which no 
individual reaches his goal unless all other 
individuals are unable to enter their goal 
regions, 

2. Cooperation may be viewed as an example 
of cohesiveness due to task direction. Back (1) 
demonstrated that a relatively high amount of 
Cohesiveness ("total field of forces which act 
9n members to remain in the group”) existed 
In Situations where the group mediated indi- 
vidual goals. 

3. Conditions of task direction (e.g., co- 
Operation) are likely to produce "instrumental 
Communications," while competitive situations 
will tend not to have such an effect. Festinger 

4 described instrumental communications as 
those “in which the reduction of the force to 


communicate depends upon the effect of the - 
Communication on the recipient.” 


in competition, as defined here, requires that 
cran ividual's success lead to the failure of 
5i T group members. Instrumental com- 

‘nications (influencing another member to 


VThis ant 
fe A Article is based on a dissertation submitted 
A detailed uate school of Boston University in 1952. 

5 S Teview of the literature may be found in 
ETE ‘ume (8). The writer acknowledges the con- 

ve help of Drs. Robert Chin, Nathan Maccoby, 


and John Thibaut throughout this research. 


accept an accurate solution to a problem, 
exerting pressures toward uniformity or com- 
municating relevant information) would serve 
to minimize a competitive individual's own 
chance of reaching his goal. 


HYPOTHESES 


1. Indications of cohesiveness will be more 
frequent among cooperative than among 
competitive individuals, 

a. Cooperative individuals will attempt to 
influence others more than competitive indi- 
viduals will. 

This prediction is based on the second 
assumption. It has empirical substantiation in 
findings reported by Back (1) and Festinger, 
Schachter, and Back (5) that cohesiveness is a 
determinant of social influence, and has the 
following theoretical rationale: 


The magnitude of change the group can induce (its 
internal power) will be equal to or less than the magni- 
tude of the resultant force on the member to remain in 
the group (its cohesiveness) (5, p. 166). 


b. Cooperative individuals will send and 
accept more pressures toward uniformity than 
will competitive individuals. i 

This hypothesis is derivable from Festing- 
er's theoretical treatment of communication ' 
(4). Results of Festinger and Thibaut (7) 
show more acceptance of pressures toward 
uniformity when greater pressures are induced 
on individual members. In a cooperative 
situation we can assume such pressures to 
exist because of the need to locomote as a 
group. oe : 

c. Cooperative individuals will send more 
relevant communications than will competitive 
individuals. 

Communications of relevant information 
may be considered instrumental for coopera- 
tive group locomotion, but not for individuals 
in a competitive situation. Interpersonal com- 
munication of an instrumental type may be 
considered the means by which cooperative 
individuals facilitate one another’s goal 
achievement. 


341 


342 


2. Cooperative individuals will send and 
receive more communications, more instru- 
mental communications, and fewer consum- 
matory communications than will competitive 
individuals. 

Communications of opinion and information 
are considered instrumental, while those of 
tension and antagonism are coordinated to 
consummatory communications (those in 
which “the reduction of the force to com- 
municate occurs as a result of the expression 
and does not depend upon the effect... on 
the recipient" [4, p. 281]). 

4. Cooperative individuals are more likely 
to take group functions (actions that facilitate 
group solidarity or smooth functioning) and 
less likely to take individual functions (actions 
not directed toward group goal achievement, 
e.g., aggression, blocking, etc.) than are com- 
petitive individuals. 

This is a replication of Deutsch's hypoth- 
esis (2). 


METHOD 


Eighteen female student groups of five individuals 
each were created for experimental study. Seats in the 
experimental room were so arranged that subjects (Ss), 
upon entering and receiving places, had to face a wall, 
and were unable to see one another. 

The participating Ss were asked neither to speak 
nor turn around and to raise their hands if there were 
any questions. After five Ss were seated and assigned 
letters from A to E, the experimental group meeting 
was ready to begin. 


Creation of Independent Variable 


Each S received a sheet containing a different set 
of instructions except for the fifth person, who was 
given one of the other four randomly. The types of 
instructions distributed were? 

1. High motivation, cooperative. The Ss were told that 
the group would be rated as a group on its solutions to 
a human relations problem. The best group would be 
rewarded. 

2. High motivation, competitive. Here S was told 
that she would be rated individually and that the 


individual with the best solution would be rewarded. . 


3. Low motivation, cooperative. The S was told that 
E was just studying how groups solve problems, that Ss 


2 The full text of the instructions has been deposited 
with the American Documentation Institute. Order 
Document No. 4174 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress, 
Washington 25, D. C., remitting in advance $1.25 for 
35 mm. microfilm or $1.25 for 6 x 8 in. photocopies. 
Make checks payable to Chief, Photoduplication Serv- 
ice, Library of Congress. 


Martin M. Grossack 


will be rated as a group but that no one will ever know 
about the group’s rating. 

4. Low motivation, competitive. The S would be 
rated as an individual. The E was described as inter- 
ested in how individuals solve problems, but not inter- 
ested in whether any specific person does well or poorly, 

(The motivational aspects of the instructions were 
quite unsatisfactory in producing differences in re- 
sponse. An analysis of the data in terms of motivational 
effects was not completed.) 

The Ss were permitted to communicate by writing 
notes, if they wanted to, during three periods of four 
minutes each. 


Group Task 


The problem selected for use in this experiment was 
a short case study of a delinquent boy, Johnny Rocco 
(12). Individuals receiving the case study were asked 
to consider the problem from the point of view of a 
person responsible for the boy's future treatment, 
Schachter's love-punishment continuum, modified by 
the writer, was also distributed. The Ss were asked 
three times during the experiment to indicate which of 
seven alternatives was the preferred treatment for 
the boy. 


Control of Group Process 


Tt was believed methodologically valuable to develop 
a technique that would control the operation of per- 
sonality variables in the interaction situation. By 
systematically substituting a standard set of stimulus 
messages, designed by E, for those actually sent by the 
Ss, control of this aspect of interaction became possible. 
Also, this prevented individuals from learning that 
each received different instructions. Some implications 
of this technique are discussed elsewhere (9). 

Each S received a set of identical notes, supposedly 
written by the other four people in the group. The 
fifth individual was necessary so that each S would 
receive the same amount of notes (ie. four). The 
standard stimulus messages had the following thematic 
content: 

From A: Shows high motivation to work on the 
group's problem, wants to work together with others 
in the group and to reach a common solution. 

From B: Shows high motivation and desire to work 
on the problem alone. 

From C: Shows low motivation, is in field to the 
extent that communication mentions the problem and 
the “sender’s” attitude toward it. 

From D: Shows low motivation and tends to be out 
of field, horseplays.? 

The experimenter collected messages written by the 
subjects; meanwhile, Ss indicated preferences on the 
love-punishment continuum and reread the experi 
mental instructions. Then Ss were permitted to open 
the stimulus notes received. During the last writing 
period, however, Ss first read the planted notes and 
then indicated their preferences on the love-punishment 
scale. A postsession questionnaire consisting of eighteen 
"agree-disagree" items, three sociometric, and Ont 
concerning the instructions was then distributed. 


2 The notes are more fully described in (10). 


COOPERATION AND COMPETITION 


Summary of Experimental Procedure 


For a convenient view of the experimental pro- 
cedure, the steps in each group may be summarized 
as follows: 

1. Five Ss, facing the wall, were asked neither to 
communicate with nor turn towards one another. 

2. Introductory remarks are made to put Ss at ease. 

3. Receipt of experimental written instructions 
(independent variable). Some Ss receive cooperative, 
others competitive instructions. 

4, Receipt of human relations problem and scale 
of suggested solutions to it. Instructions indicate that 
communications are restricted to written messages 
during three note-writing (N1, N2, N3) periods. The 
Ss are to communicate at their own discretion. 

5. First note-writing period (N1). The Ss write 
notes which are collected, then indicate preference on 
scale, are reminded neither to turn nor communicate, 
reread instructions, and receive set one of standard 
stimulus notes. 

6. The Ss read received planted notes (set one), 
second note-writing period (N2) starts, Ss write notes 
which are collected, then indicate preference on scale, 
are reminded neither to turn nor communicate, reread 
instructions, and receive set two of planted messages. 

1 The Ss read received planted messages (set two), 
third note-writing period starts (N3); Ss write notes 
which are collected, receive set three of planted notes, 
tead notes, and indicate preferences on love-punish- 
ment scale. ^ 

8. Distribution of postsession questionnaire which 
Ss answer. 

9. Participants are told about the experiment and 
all questions answered when possible. They are thanked 
for their cooperation and asked to keep the details of 
the experiment confidential, 


RESULTS AND INTERPRETATIONS 


The results of this experiment will be dis- 
Cussed in four sections: (a) the experimental 
Instructions, (6) cooperation as a determinant 
of cohesiveness, (c) cooperation as a determi- 
nant of instrumental communication, and (d) 
Conflicts in expectations, 


The Experimental Instructions 


The Success of the experiment as a whole 
Was contingent upon the efficacy of the experi- 
mental instructions. A measure of the impres- 
cg given by the instructions was'available 
Ne one to the last questionnaire item: “At 

€ time T received the instructions, they gave 
me the following impression (check one): 

.l The problem isn't important for me, I 
Vill be rated as my group is rated. 

2. The problem isn't important for me, I 

T. be rated as an individual. 

n The problem is important for me, I will 
© rated as my group is rated. 


343 


TABLE 1 
DIFFERENCE BETWEEN RECIPIENTS OF COOPERATIVE 
AND COMPETITIVE INSTRUCTIONS IN FREQUENCY 
OF RESPONSE TO QUESTIONNAIRE ITEM* 


RESPONSE 
Inprvip- 
Instruction RECEIVED Group UAL 
Cooperative 37 8 
Competitive 10 34 


* Chi square = 28.9, p = .001. 


4. The problem is important for me, I will 
be rated as an individual.” (Table 1) 

The results in Table 1 support the point of 
view that the cooperative and competitive 
instructions had consequences for individual 
perceptions in the expected direction. Of the 
45 Ss receiving cooperative instructions, 37 
reported the feeling that they would be rated 
as a group. Of 44 Ss receiving competitive 
instructions, 34 reported the feeling that they 
would be rated as individuals (one S did not 
answer the item in a scorable manner). Such 
differences could be expected to occur by 
chance less than .001. 

More evidence for the successful creation of 
cooperation and competition comes from re- 
sponses to item 8 of the postsession question- 
naire as reported in Table 2. The statisti- 
cally significant difference in responses to that 
item (“My success in the group was pretty 
dependent on the other members also suc- 
ceeding”) indicates that the instructions were 
effective as far as cooperation and competition 
are concerned. 

The causal perceptions for experiences of 
success and failure in the group meeting re- 
flected the experimentally created social situ- 
ation. Cooperative Ss agreed with item 15, “If 
I do not get a rating of one, it will be due 
mainly to the other people in the group” 
significantly more than did recipients of com- 
petitive instructions. Similarly, item 14, “If I 
get a rating of one, it will be due mainly to 
my own efforts” was agreed to by competitive 
Ss significantly more often than by coopera- 
tive Ss (Table 2). 


Cooperation as a Determinant of Cohesiveness 


It was predicted that more cohesiveness 
would be shown by cooperative than by com- 
petitive individuals. This was expected on the 


344 


TABLE 2 
Avethes DIFFERENCES BETWEEN COOPERATIVE AND 
Competitive SUBJECTS IN RESPONSE TO 
QUESTIONNAIRE ITEMS 


Martin M. Grossack 


TABLE 3 
DIFFERENCES BETWEEN COMPETITIVE AND CoorzR. - 
ATIVE INDIVIDUALS AS SHOWN IN CONTENT 
ANALYSIS OF WRITTEN COMMUNICATIONS" 


Irem 


1. I would like to attend another 


meeting such as today’s. 2.29 2.36 .07 s 
2. I thoroughly enjoyed today’s 

meeting. 2.27 2.28 .01 - 
3. I tended to experience conflict 

in today's meeting. 3.67 4.29 .62 1.63 .20 
4. The needs of other members 

conflicted with my own. 3.64 4.23 .59 1.48 .20 


5, Ibenefited quitea bit from dis- 
cussing the problem with the 


group. 4.22 4.30 .08 f 
6. I contributed a good deal to 

the group's work. 3.31 3.67 .37 1.08 .30 
T. Our group was pretty suc- 

cessful. 4.00 3.42 .58 2.19 .05 
8. My success in the group was 

pretty dependent on the other 

members also succeeding. 3.67 4.55 .88 2.35 .05 
9. "The goal of the group was very 

important for me. 3.29 3.58 .29 .735 * 
10. I was very involved in the 

group's problem. 3.36 3.10 .26 .686 * 


11, I accepted the goal of the 
group as worthy of my best 
efforts. 

12. I felt it was necessary to agree 
with the other members about 
a common solution to the 
problem. 

13, I felt it was more necessary to 
compete with than cooperate 
with the other members. 

14, If I get a rating of one it will 
be due mainly to my own ef- 
forts. 

15, If I don’t get a rating of one, 
it will be due mainly to the 
other people in the group. 

16, At times I felt angry toward 
some of the other members. 

17. Giving information to the 
other members benefited me. 

18. I felt my individual reputa- 
tation was at stake during the 
group meeting. 


1.84 .10 


1.51 .20 


3.18 


4.50 
4.15 


3.14 2.74 ..44 


4.98 5.52 


* Not significant. 
** A score of 1 indicates strong agreement; 2, moderate 


agreement; 3, slight agreement; 4, slight disagreement; 5, moder- 


ate disagreement; and 6, strong disagreement. 


basis that cooperative Ss have their goal 
mediated by the group, while this is not the 
case for competitive Ss. The results of this 
experiment support that prediction (Table 3, 
item 1). A content analysis of communications 
found cooperative Ss showing cohesive be- 
havior (speaking of the group as a whole, 
using words: group, we, us, others, etc.) signifi- 


Car " 
ConTENT Square p 
1. Indicates cohe- does does not 
siveness 
Coop. 26 19 
Comp. 14 31 5.49.02 
2, Attempts influence does does not 
Coop. 26 19 
Comp, 13 32 6.56.02 
3. Sends pressures to- does does not 
ward uniform- 
ity 
Coop. 19 26 
Comp. 6 39 7.98 01 
4, Sends relevant less than3 3 or more 
content (men- 
tions specific 
solution to 
problem) 
Coop. A 21 
Comp. 34 11 3.90 05 
5. Number of words under150 150 or more 
sent | 
Coop. 18 27 | 
Comp. 27 18 2.84 10 
6. Communications 6 or less over 6 
“received”’t 
Coop. “sender” 10 26 
Comp. “sender” 3 5 23.57 001 
7. “Receives” ex- 0-1 2 and more 
pression of 
tension 
Coop. “sender” 33 3 
Comp. “sender” 2 14 7.56 01 
8. "Receives" ex- does does not 
pression of 
antagonism 
Coop. “sender” 5 31 
Comp. “sender” 14 22 4.59 05 
9. "Receives" opin- less than 10 10 or more 
ion 
Coop. “sender” 6 30 
Comp. “sender” 28 8 24.21 .001 
10, “Receives” infor- lessthan4 ^ 4ormore 
mation 
Coop. “sender” 15 21 l 
Comp. “sender” 30 6 11.62 .00l 
11, Communicates does does not 
>. tension 
Coop.” 10 35 
Comp. 18 26 2.73 40 
12. Uses nondirective does does not 
more than di- 
rectivemanner 
of influencet 
Coop. 16 9 
Comp. 5 8 4.76 -05 
13. Takes individual does does not 
tole 
Coop. 18 27 
Comp. 23 pi 87 


COOPERATION AND COMPETITION 


TABLE 3 (Concluded) 


14. “Receives” indi- observed expected 
vidual role frequency frequency 
Only to coop. 2 15 
“sender”? 
Only to comp. 29 15 24.30 001 


“sender” 


* All chi squares contain Yates’s correction for continuity. 
1 There were a total of 72 “receivers” in 18 groups. 
$ Just includes Ss who made at least one influence attempt. 


cantly more so than did competitive Ss. This 
finding supports the assumption of cooperation 
as a determinant of cohesiveness. This measure 
of cohesiveness is similar to that used by 
Polansky, Lippitt, and Redl (11).* 

From the conceptual definition of cohesive- 
ness, a prediction was made that cooperative 
Ss would more frequently attempt social 
influence (an explicit attempt to modify the 
behavior of the recipient, such as accepting a 
particular solution to the human relations 
problem). This received substantial support 
from the results of the experiment. The num- 
ber of cooperative Ss who attempted to influ- 
ence others was significantly greater than the 
number of competitive Ss (see Table 3, item 2, 
p< 02). This concurs with Back's finding of 
Cohesiveness as a. source of influence (1). 

Results in Table 3, item 3, substantiate 
(P < 01) the hypothesis that cooperative Ss 
Would send more pressures toward uniformity 
than competitive Ss (favors reaching common 
agreement by all members). Festinger (4) 
hypothesizes that high cohesiveness is a source 
of increased pressure toward uniformity for 
individuals in a group. Since cooperative Ss 
Were to be rated as a group, they were sub- 
Jected to greater pressures toward uniformity 
than were competitive Ss. 

; Festinger states that “pressures toward uni- 
SY Will be greater, the more dependent 
* Nm members are on the group in order 
each their goals" (4, p. 273). This condition 
Tuo nonexistent for the competitive Sg and ex- 
P'icitly defined for cooperative individuals. If 
a S reaches his goal only when others do not 
ompetitive), agreeing with others would de- 

Prive him of success, 
reponses to the love-punishment scale 
e 4) support the hypothesis that co- 


4 
eus Percentage agreement for the categories 
rom 71 (attempts influence) to 97 for men- 


lons Y 
: (orice). Solutions to the human relations problem 


345 


TABLE 4 


CHANGES IN RESPONSE TO LOVE-PUNISHMENT SCALE 
SHOWING ACCEPTANCE OF PRESSURE TOWARD 
UNrroRMITY 


Group Drrr. MEANS t ? 


A. CHANGE FROM First TO SECOND RESPONSE 


.791 * 
2.07 .05 


.103 
278 


Competitives 
Cooperatives 


B. CHANGE FROM SECOND TO THIRD RESPONSE 


Competitives .026 .176 * 
Cooperatives .361 3.34 .01 
* Not significant. 


operative Ss are more likely to accept pressures 
toward uniformity than are competitive Ss. 
Festinger and Thibaut (7) also report greater 
opinion changes when the pressure toward uni- 
formity is increased. Festinger has formulated 
the following hypothesis with which the results 
here concur: “The amount of change in opinion 
resulting from receiving a communication will 
increase as the pressure towards uniformity in 
the group increases [for the recipient]" (4, p. 
276). 

Deutsch (3) also reports that “Indiy coop 
[si] were affected by the ideas of others 
significantly more than were Indiv comp 
[sic].” 

As predicted, communications containing 
relevant information were more frequently 
sent by recipients of cooperative instructions. 
Explicit mentions of specific solutions to the 
problem were considered relevant communica- 
tions (Table 3, item 4). The relationship was 
found to be statistically significant at the .05 
level of confidence. 

When an issue is relevant for group loco- 
motion, there will be more pressures on mem- 
bers to communicate with one another about 
it (4, 12). Here, mentions of specific solutions 


*to the problem probably were one of the most 


relevant topics possible. This result is sub- 
stantiated by Schachter (12), who reports that 
pressures to communicate are strongest under 
conditions of relevance to the group. 


Cooperation as a Determinant of Instrumental 
Communications 


The results of this experiment show that 
cooperative Ss communicated more words 


346 Martin M. 
per individual and less tension than did com- 
petitive Ss. However, the differences (Table 3, 
items 5 and 11) are not statistically significant 
(05 « p < .10). 

The failure to establish definitely the hy- 
pothesis possibly can be explained in part at 
least by the nature of the experimental design. 
Since competitive Ss as well as cooperative Ss 
were given the alternatives of sending messages 
or thinking alone the entire hour, and since a 
norm of communicating by message was intro- 
duced with the standard stimulus messages, 
there were forces on the competitive Ss to 
communicate. They knew that other indi- 
viduals possibly were writing during the first 
"note-writing" period and received messages 
supposedly from the other group members. 

'There are marked differences between co- 
operative and competitive Ss (owing to the 
Standard stimulus messages) when they are 
viewed as recipients of communications. Co- 
operative Ss (ie., perceived sources of set A 
and set C notes) were much more likely to 
"receive" communications than were com- 
petitive Ss (i.e., perceived sources of set B and 
set D notes). The results reported here sub- 
stantiate the hypothesis (p < .001, Table 3, 
item 6). 

A selective factor seemed to operate that 
channeled certain communications to specific 
"recipients." Individuals who “sent” opinions 
and information “received” such communica- 
tions from their recipients. The same held true 
for communications of tension and antagonism. 
As Festinger, Schachter, and Back report, 
“communication occurs towards those seen as 
being most affected by the information” (5, 
p. 130) Cooperative Ss invited communications 
in the messages they sent, while the competi- 
tive standard stimulus notes that were from 
“B” explicitly restrained communications of 
any type and those of “D” invited answers not 
relevant or hostile to the problem at hand. 

Cooperative Ss were more likely to receive 
instrumental communications (opinion and 
information) than were competitive Ss (p < 
001). Cooperative Ss were significantly less 
likely to receive consummatory communica- 
tions (tension and antagonism) than were 
competitives (Table 3, items 7, 8, 9, 10). This 
may be explained in terms of the tendency 
for instrumental communications to be chan- 
neled toward cooperative recipients and for 


Grossack 


cooperative notes to induce restraining forces 
against noninstrumental answers. 

Of the 21 Ss who used nondirective type 
social influence, 16 were recipients of Coopera- 
tive instructions. The difference is significant 
(p < .05, Table 3, item 12). Striving for a 
goal that is mediated by their group makes 
social communication instrumental for co- 
operative goal achievement, Cooperative indi- 
viduals, rated as a group and having a com- 
monality of fate, are more likely to influence 


one another than are competitives. This basic | 


Situation makes it less necessary for coopera- 
tives to exert considerable effort in attempting 
social influence. They need not expect too 
much resistance, all other things being equal, 

For competitive Ss, the situation is quite 
different. Their goal achievement occurs 
through individual effort and is facilitated by 
the failure of others. Social influence is not 
theoretically necessary for them. Competitive 
Ss can more readily resist inductions, and 
stronger influence attempts probably would be 
necessary. Hence, the difference reported con- 
cerning nondirective influence could be theo- 
retically expected. 

Findings by Deutsch (3) that cooperative 5s 
are more likely to take group functions and 
less likely to take individual functions than 
are competitives were not substantiated here 
(Table 3, item 13). No significant differences 
were found here between cooperative and 
competitive individuals in the taking of group 
and individual membership roles. 

One possible explanation is that the hypoth- 
eses in this research neglected the situation 
in which they were tested. Deutsch studied 
groups in face-to-face interaction. It may be 
that note-writing interaction is less likely to 
produce individual role taking. If further re- 
search substantiates this finding, an important 
difference will be found to exist between 


„ordinary small groups and those situations in 


which communications are restricted to written 
messages. Festinger (4) suggests that defensive 
reactions are more commonly aroused in face- 
to-face groups. 

When the notes containing individual roles 
were further examined and a comparison made 
between the number of persons who take 
“individual” roles only to competitive “send- 
ers” and those who do so only to cooperative 
“senders,” a statistically significant difference 


COOPERATION AND COMPETITION 


was found. Twenty-nine persons took indi- 
vidual roles toward just competitives (“send- 
ers" B and D), while only one person took an 
individual role towards a cooperative “sender” 
(Table 3, item 14). The same selective factor 
mentioned in explanation of differences in the 
receipt of consummatory and instrumental 
communications possibly was in operation here. 


Conflicts in Expectations 


Since the planted notes were heterogeneous, 

they were bound to conflict in one way or 
another with the expectations of the recipients 
as structured by the experimental instructions. 
Some unanticipated questionnaire results sug- 
gest that the frame of reference of an individual 
will determine his expectations of others, From 
this, two additional hypotheses may be in- 
ferred: (a) Individuals who perceive them- 
selves as cooperative will tend to expect co- 
operative behavior from one another, and (6) 
individuals who perceive themselves as com- 
petitive will tend to expect competitive be- 
havior from one another. 
7 Competitive Ss tended to accept the item 
‘Our group was pretty successful” (Table 2) 
significantly more often than did cooperative 
Ss. This result may be related to a difference 
in the frame of reference and expectations 
held by cooperative and competitive Ss. 
Competitive Ss could legitimately expect no 
help from other people in the group. Yet they 
received some. On the other hand, cooperative 
Ss could expect, on the basis of the instructions 
they received, cooperative behavior from all 
the other group members. This they did not 
receive, Competitive expectations probably 
were exceeded by their receiving help from 
Cooperative individuals, while the expectations 
of those receiving cooperative instructions were 
not met during the group meetings. 

Although not statistically significant, there 


Was a tendency for cooperative Ss to agree. 


More often than competitive Ss (Table 2) that 
I tended to experience conflict in today's 
Meeting” and “The needs of other members 
prie with my own.” Also, cooperative Ss 
dad less likely to agree that “Giving informa- 
“on to the other members benefited me.” These 
modings suggest the same interpretation. A 
Ctailed analysis of sociometric responses in 


jm conflicts in expectations is presented 


347 


SuMMARY AND CONCLUSIONS 


From previous research on cooperation, 
competition, and studies of small groups, an 
attempt was made to formulate hypotheses 
that would integrate theoretical approaches 
to both areas of investigation. The main 
problem selected for study was to determine 
the consequences of cooperation and competi- 
tion on small group cohesiveness, social influ- 
ence, and communication. 

The results of the experiment permit the 
following conclusions: 

1. Cooperation may be considered a determi- 
nant of group cohesiveness. Cooperative Ss 
showed significantly more cohesive behavior, 
attempts at influence, exertion and acceptance 
of pressures toward uniformity, and communi- 
cations of relevance than did competitive Ss. 

2. Cooperation may be considered a deter- 
minant of instrumental communications. Co- 
operative Ss (attributed senders of cooperative 
messages) received significantly more com- 
munications, more instrumental communica- 
tions (opinion and information), and fewer 
consummatory communications (tension and 
antagonism). Also, they tended to send more 
communications and fewer consummatory ones 
(showing tension), but these findings were not 
quite significant statistically. A significantly 
greater frequency of nondirective influence by 
cooperatives was explained in terms of the 
fact that considerable effort in social influence 
is theoretically not necessary for cooperative Ss 
as compared to competitive Ss. 

A prediction that cooperative Ss would take 
more group and fewer individual functions 
than competitive Ss was not supported. This 
was explained as possibly due to the fewer 
defensive reactions reported for note-writing 
group situations. 

The experimental results support the fol- 
lowing ad hoc hypotheses: 

1. The frame of reference of an individual 


“will determine his expectations of others. 


a. Individuals who perceive themselves as 
cooperative will tend to expect cooperative 
behavior from one another. 

b. Individuals who perceive themselves as 
competitive will tend to expect competitive 
behavior from one another. 

Much remains unknown concerning differ- 
ences between face-to-face and note-writing 


348 Martin M. 
groups. In problem-solving settings, the latter 
may have much to offer as far as efficiency for 
research purposes is concerned. 

The extent to which the norm introduced 
by having all Ss receive communications and 
having the opportunity to send them caused 
competitives to communicate possibly can be 
determined by subsequent study. Also, the 
extent to which cooperative messages induced 
restraining forces against competitive and 
out-of-field responses should be determined, 

Further research is needed to study the 
development of channels of communication in 
groups. The differences between cooperative 
senders in the content of communications re- 
ceived suggests some things about “who gets 
what” in the communication process. It would 
seem valuable to study perceptions that send- 
ers have of possible recipients and determi- 
nants of the content sent to specific receivers. 

The relationship between determinants of 
cohesiveness and instrumental communication 
has not been studied intensively. Possibly, in 
situations where cohesiveness is due to attrac- 
tiveness of members or group prestige, one 
may not expect as much instrumental com- 
munication as under conditions of task 
direction. 


Grossack 


REFERENCES 

1. Back, K. W. Influence through social communica. 
tion. J. abnorm. soc, Psychol., 1951, 46, 9-24, 

2. Deutscu, M. A theory of cooperation and compe- 
tition. Hum. Relat., 1949, 2, 129-152. 

3. DEUTSCH, M. An experimental study of the 
effects of cooperation and competition "upon. 
group process. Hum. Relat., 1949, 2, 199-232, 

4. FEsTINGER, L. Informal social communication, 
Psychol. Rev., 1950, 67, 271-282. 

5. Festincer, L., SCHACHTER, S., & Back, K. Social 
pressures in informal groups. New York: Harper, 
1950. 

6. FESTINGER, L., SCHACHTER, S., BACK, K., KELLEY, 
H., & Tura, J. Theory and experiment in 
social communication. Ann Arbor: Univer. of 
Michigan, Research Center for Group Dynamics, 
1950. 

7, FEsrINGER, L., & THIBAUT, J. Interpersonal com- 
munication in small groups. J. abnorm. soc. 
Psychol., 1951, 46, 92-99. 

8. Grossack, M. The effect of cooperation and - 
competition upon small group cohesiveness, 
social influence and communication. Unpub- 
lished doctor's dissertation, Boston Univer., 1952, 

9. Grossack, M. Controlling interaction in small 
group research. J. Psychol., 1953, 35, 241-244, 

10. Grossack, M. Cues, expectations and first im- 
pressions. J. Psychol., 1953, 35, 245-252. —— 

11. PorawskY, N., LiPPrr, R., & Rept, F. An investi- 
gation of behavioral contagion in groups. Hum. 
Relat., 1950, 8, 319-348. 

12. ScHAcHTER, S. Deviation, rejection and com- 
munication. J. abnorm. soc. Psychol., 1951, 46, 
190-207. 


Received March 13, 1953. 


ETHNOCENTRISM AND TOLERANCE OF TRAIT “INCONSISTENCY” 


IVAN D. STEINER 
University of Illinois 


processes in which some attributes of 
complex stimulus objects become ac- 
centuated while others are ignored or denied 
(2, 3). When an individual assumes that cer- 
tain attributes belong together, this assump- 
tion tends to determine which attributes of a 
complex stimulus object he will perceive and 
remember (2, 3, 5, 8). These two propositions 
provide the basis for an approach to problems 
of interpersonal perception in social relations. 
"The behaviors of interacting persons may be 
presumed to be influenced by the percepts 
they hold of one another. According to the 
above propositions, these percepts are formed 
and maintained under the guiding influence 
exerted by the perceiver's assumption that 
certain human attributes belong together 
while others do not. Thus, the individual's 
assumption that certain attributes belong to- 
gether is expected to influence his percept of 
the person with whom he is interacting; the 
percept is expected to influence his behavior 
toward that other person. From a research 
point of view, this theoretical orientation sug- 
gests that knowledge of an individual's per- 
ceptual assumptions should enable one to make 
Predictions concerning his percepts and be- 
haviors. Conversely, it should be possible to 
Predict the nature of an individual’s perceptual 
assumptions from a knowledge of his percepts 
or behaviors. If two categories of subjects are 
known to hold different percepts, it may be 
inferred that their percepts are based on dif- 
ferent sets of perceptual assumptions. Given 
4 knowledge of the differences between the 


pu and memory are "selective" 


percepts which are held by these two categories, 


it Should be possible to make predictions con- 
cerning differences between the perceptual 
assumptions they are employing. This paper 
Teports a study in which predictions of this 

d are made and tested. It investigates the 
Perceptual assumptions made by two types of 
Bible -types which have been shown by 
ae Tesearchers to hold different percepts of 

rna with whom they interact. 

‘“tsons with a high degree of ethnocentric 
Prejudice and persons with a low degree of 


‘low ethnocentrics 


ethnocentric prejudice were chosen to repre- 

sent two distinct types of perceivers, Two 

major considerations led to this choice, There 

is empirical evidence indicating that ethno- 

centrism is associated with differences in 

cognition. Studies such as those reported by 

Adorno et al. (1) have found that high eth- 

nocentrics differ considerably from low eth- 

nocentrics in their percepts of people. Eth- 

nocentrism is positively associated with 

perceptual perseveration (4) and “intolerance 

of ambiguity” (4, 6, 7). In addition to provid- 

ing types of perceivers who hold different per- 

cepts of people, the ethnocentrism variable is 

one in which social psychologists have had 

a practical interest. For both of these reasons, 

high and low ethnocentrics were selected for 

investigation. However, it should be apparent 

that this choice was somewhat arbitrary. ' 
Whenever amy two types of perceivers hold 

different percepts, it seems pertinent to in- | 
quire whether this difference may be a conse- 

quence of their holding different perceptual | 
assumptions. 

‘This study not only attempts to determine 
whether high and low ethnocentrics differ 
from one another in their perceptual assump- 
tions; it also formulates certain postulates 
concerning the systematic nature of the dif- 
ference between the perceptual assumptions of 
high and low ethnocentrics. Two hypotheses 
will be derived and tested. 


THEORETICAL FORMULATION 


Definitions of Terms 

Trait contingency. The problem is one of 
comparing the assumptions which high and 
make concerning the rela- 
tionship of one human trait to another. If 
high ethnocentrics assume that two designated 
attributes are likely to occur together, will 
low ethnocentrics also expect these two at- 
tributes to occur together? In the following 
discussion the term trait contingency will be 
used to denote the level of probability which 
an individual perceiver attributes to the co- 
occurrence of two traits. If he thinks a given 


349 


350 Ivan D. 
pair of traits have a high probability of oc- 
curring together, those two traits have a high 
trait contingency for him. If he feels that they 
have a low probability of occurring together, 
they have a low trait contingency for him. 
Trait discrepancy. All of us are accustomed 
to making evaluative judgments of human 
traits. We say that honesty and intelligence 
are “good” traits and that laziness and un- 
faithfulness are “bad” traits. We would prob- 
ably experience little difficulty in assigning 
most traits to positions along a continuum 
representing various degrees of “goodness” 
vs. “badness.” In the following discussion the 
term trait discrepancy will be used to denote 
the distance separating two traits along an 
evaluative dimension, the ends of which are 
defined by the polar terms good and bad. If 
two traits are separated by a short distance 
(ie. have very similar evaluative connota- 
tions), they have a small trait discrepancy. If 
two traits are separated by a long distance 
(i.e., have very dissimilar evaluative connota- 
tions), they have a large trait discrepancy. 


Postulates 


It may be expected that the trait con- 
tingency of any pair of traits will be partially 
determined by the degree to which the two 
traits are alike on certain dimensions of mean- 
ing. If two traits connote about the same de- 
gree of activity vs. passivity, or the same 
degree of goodness vs. badness, they may, for 
that reason, have a high trait contingency. 
This study concentrates on one dimension of 
trait similarity: the evaluative, or good-bad 
dimension. It investigates the relationship 
between trait contingencies and trait dis- 
crepancies, and attempts to determine whether 
the relationship between these two variables 
is greater for high ethnocentrics than it is 
for low ethnocentrics. 

The most general assumption concerning 
the relationship between these variables may. 
be stated as follows: trait contingencies will 
vary as an inverse monotonic function of trait 
discrepancies. This assertion contends that 
the larger the trait discrepancy (other things 
being equal), the smaller will be the trait 
contingency. Any unit of trait discrepancy will 
have the effect of lowering the trait contingency 
of a pair of traits. Many units of trait dis- 
crepancy will reduce trait contingencies by a 


STEINER 


larger amount than will few units of trait 
discrepancy. 

The above statement of relationship makes 
no allowance for a differential between high 
and low ethnocentrics. Studies such as those 
reported by Adorno et al. (1) would seem to 
Suggest that high ethnocentrics are more 
sensitive than are low ethnocentrics to dif- 
ferences along a good-bad dimension, For 
example, Frenkel-Brunswik (1, ch. 11) 
described her low-ethnocentric Ss as more 
likely to see a heterogeneous mixture of good 
and bad traits within any one person. They 
seemed more likely than high ethnocentrics 
to "tolerate" the co-occurrence of pairs of 
traits which have large trait discrepancies. 
Therefore, it seems desirable to refine the 
above statement of the relationship between 
trait contingencies and trait discrepancies. 

Let us propose that high and low ethno- 
centrics do, in fact, differ in sensitivity to trait 
discrepancies. High ethnocentrics, we shall 
Say, are more sensitive to the trait discrepancy 
factor. Any unit of trait discrepancy will have | 
a larger effect on the trait contingencies of 
high ethnocentrics. e 

Postulate 1. The effect of any unit of trait 
discrepancy separating one trait from another 
will be to decrease the trait contingency of 
those traits, the amount of this decrease being 
larger for high ethnocentrics than for low 
ethnocentrics. | 

Figure 1 presents this postulate graphically. 
As trait discrepancies become progressively 
larger (ie, as more and more units of trait 
discrepancy are added), trait contingencies 


IN TRAIT CONTINGENCY 


POSTULATED DECREASE 


1 2 3 4 
UNITS OF TRAIT DISCREPANCY 


Fic. 1. PosrUzATED ExrEcr or tHE TRAIT DIS- 
CREPANCY FACTOR ON TRAIT CONTINGENCIES 


ETHNOCENTRISM AND TOLERANCE OF TRAIT “INCONSISTENCY” 


sustain increasing amounts of total decrease. 
Furthermore, this total amount of decrease is 
always larger for high ethnocentrics than for 
low ethnocentrics. Every unit of trait dis- 
crepancy is shown to result in a larger decrease 
‘in the trait contingencies of high ethnocentrics 
than in the trait contingencies of low ethno- 
centrics. (It is to be noted that the two lines 
need not be straight in order to satisfy the 
requirements of this postulate.) 

Postulate 1 has dealt exclusively with the 
possible effects of the trait discrepancy factor. 
It is now necessary to consider the effects which 
other factors may have on trait contingencies. 

Postulate 2. The combined effects of all 
other factors will not be such that they cancel 
the differences between the trait contingencies 
of high and low ethnocentrics which have 
been created by the trait discrepancy factor. 

It is possible that certain other factors may 
have differentiating effects on the trait con- 
tingencies of high and low ethnocentrics. If 
these differentiating effects are negatively cor- 
related with those exerted by the trait dis- 
crepancy factor, they would tend to cancel 
the differentiating effect of the latter factor. 
The two lines shown in Fig. 1 would then as- 
sume the same slope; the uncontrolled effect 
of other factors would eliminate the dif- 
ference in slope which would have been present 
if the trait discrepancy factor had been the 
only determinant of trait contingencies. 
^ This postulate does not deny that other 
actors may be potent determinants of trait 
En paces, It denies only that such other 
E will have a combined effect which sys- 
AS aed differentiates between high and 
mn ee in a direction opposite to 
Tm eting effect of the trait dis- 
ub Aas actor. In substance, it asserts that 
ie ationship shown in Fig. 1 will prevail 

n when other factors are permitted to vary. 


Hypotheses 


E the case in which a trait, A, is 
Pte di rom another trait, B, by one unit 
EOM E dy The presence of this unit 
C Iscrepancy will decrease the trait 
Eds Bencies which both high and low ethno- 
hd a will give to this pair of traits. This 
AP ben xd in Fig. 1 by the positive slope 
the Tin Ines, However, the steeper slope of 

€ representing the relationship for high 


351 


ethnocentrics indicates that the presence of 
this one unit of trait discrepancy will cause 
a larger decrease in the trait contingency 
which high ethnocentrics give to this AB pair. 
The trait contingency which low ethnocentrics 
give to this AB pair will be relatively less af- 
fected by the presence of this unit of trait 
discrepancy. Consider now the case in which 
trait A is separated from a third trait, C, by 
two units of trait discrepancy. The trait con- 
tingencies which high and low ethnocentrics 
will give to this AC pair will be even more dif- 
ferentially decreased than was true in the 
case of the AB pair. The difference between 
the amounts of depressive influence exerted by 
the trait discrepancy factor on such AB and 
AC pairs will always be greater for high ethno- 
centrics. 

1f, in the above case, the AC pair had had 
a trait discrepancy of three units, the dif- 
ference between high and low ethnocentrics 
would have been even more striking. In the 
case of high ethnocentrics, the AB and AC 
pairs would have sustained markedly different 
amounts of decrease. But in the case of low 
ethnocentrics, these two pairs would have 
sustained only moderately different amounts of 
decrease. Given the steeper slope of the line 
representing the relationship for high ethno- 
centrics, we may generalize as follows: the 
greater the difference between the trait dis- 
crepancies of an AB and AC pair, the more 
dissimilar high and low ethnocentrics will be 
with respect to the amount by which the de- 
crease sustained by the AC pair exceeds that 
sustained by the AB pair. 

Let us confront both high and low ethno- 
centrics with a series of test items. Each item 
will consist of an AB and an AC pair of traits; 
the AC pair will always have a larger trait 
discrepancy than the AB pair. The task of S - 
is to cross out the pair in each item which has 


. the lower trait contingency (ie. the pair 


which consists of traits having the lower 
probability of co-occurrence). What hypotheses 
can be advanced concerning the responses of 
high and low ethnocentrics in such a test 
situation? 
It cannot be hypothesized that all Ss, re- 
less of ethnocentrism, will tend to cross 
out the AC pairs. Factors other than the 
trait discrepancy factor might combine to 
give the AC pairs very high trait contingen- 


352 


cies. Even though the trait discrepancy fac- 
tor may decrease the trait contingency of the 
AC pair by a considerable amount, it is still 
very possible that the resultant trait contin- 
gency for the AC pair may be higher than 
that for the AB pair. In the absence of con- 
trols over such: other factors, no hypothesis 
could be made concerning the comparative 
trait contingencies of the AB and AC pairs. 

However, it is possible to advance hypotheses 
concerning the differences between the re- 
sponses of high and low ethnocentrics. Re- 
gardless of the effect of other determining 
factors, the trait discrepancy factor will exert 
a more marked effect on the trait contingen- 
cies of high ethnocentrics. In the case of high 
ethnocentrics, it will exert a stronger depressive 
influence on the AC pair than on the AB pair. 


In the case of low ethnocentrics, it will exert | 


more nearly equal influences on the AB and 
AC pairs. Consequently, the amount of de- 
crease sustained by the AC pair relative to the 
amount sustained by the AB pair should tend 
to be greater for high ethnocentrics. This 
means that the resultant trait contingency 
for the AC pair relative to that of the AB pair 
should be smaller for high ethnocentrics than 
for low ethnocentrics. 

"Therefore, given test items in which the AC 
pair always has a larger trait discrepancy than 
the AB pair, we may predict: 

Hypothesis 1. High ethnocentrics will cross 
out more AC pairs than will low ethnocentrics. 

We have already noted that when the AB 
and AC pairs have markedly different trait 
discrepancies, the trait discrepancy factor 
should distinguish more clearly between high 
and low ethnocentrics. The greater the dif- 
ference between the trait discrepancies of the 
AB and AC pairs, the more dissimilar should 
be the trait contingencies given by high and 
low ethnocentrics. 

Hypothesis 2. The greater the difference be- 


tween the trait discrepancies of the AB and. 


AC pairs in any test item, the greater will be 
the excess of high ethnocentrics over low ethno- 
centrics who will cross out the AC pair. 


METHOD 


In a preliminary data-gathering step, 30 Florida 
State University students rated the desirability of 32 
traits. A ten-point rating scale was used. A computa- 
tion was then made of the mean rating received by each 
trait, With these mean ratings as criteria of evaluative 


Ivan D. STEINER 


connotation, 18 test items were prepared. Each test 


E 


item consisted of a selected trait A paired successively 
with a trait B and a trait C. Different traits were use] 
in different test items, but in every case the selection | 
of traits was such that traits A and B had received - 


approximately equal mean ratings on the evaluative 
continuum. Trait C was always a trait which had re- 
ceived a somewhat different evaluative placement, 
Randomization was employed in determining whether 
the AB or AC pair would occupy the left-hand position 
in a test item. The first test item was: (passionate— 
quick-tempered) (lazy—Qquick-tempered).* 

Fifty-two Florida State University students re. 
sponded to the test items by crossing out the pair of 
traits in each item which seemed less likely to occur 
together.? The tests were scored by counting the number 
of items to which the S responded by crossing out the 
AC pair (i.e., the pair with the larger trait discrepancy). 
Each S also responded to the California Ethnocentrism 
Scale (1, p. 142) which will be referred to here as the E 
scale. The Ss also reported their academic averages at 
the university. 

In order to obtain a more precise estimate of the 
trait discrepancies of the AB and AC pairs, a second 
group of 31 students was asked to rate the traits on two 
continua designed to represent the evaluative dimen- 
sion. These two continua were defined by the polar 
terms: good-bad and worthless-valuable.? The 31 
judges indicated the position of each trait on each 
continuum by placing a check mark on a seven-point 
graphic rating scale, a technique copied from Osgood'st 
Semantic Differential. In computing the judge's rating 
of each trait, the seven positions along the two continua 
were assigned numerical values ranging from 1 (for the 
bad or worthless end of the continuum) to 7. The mean 
of the ratings given to each trait on these two continua 
was taken to represent the individual judge's rating of 
the trait. The mean of the ratings given to each trait by 


1This instrument and a table showing the trait 
ratings and trait discrepancies of the test items used in 
the instrument have been deposited with the American 
Documentation Institute. Order Document No. 3983 
from the ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washington 
25, D. C., remitting in advance $1.25 for microfilm or 
$1.25 for photocopies. Make checks payable to Chief, 
Photoduplication Service, Library of Congress. —. 

? The writer is indebted to Basil Zimmer of Florida 
State University for cooperation in gathering these data 

3 These two continua were selected to represent the 
evaluative dimension because Osgood (see footnote » 
‘had previously found them to have factor loadings t 
.88 and .79 on what appeared to be an evaluative factor. 
The judges also rated each trait against two continua 
representing the “activity” dimension (active-passive 
and slow-fast) and two continua representing ) 
"potency" dimension (strong-weak and small-Jarge 
Because the trait ratings on these two dimensions CO 


related highly (tho = .80 and .92) with ratings on the 


evaluative dimension, the activity and potency ^ 
mensions will not be treated here as independent 


mensions. 


4 Osgood, C. E. Personal communication, Decembêt 


1952. 


| 


ETHNOCENTRISM AND TOLERANCE OF TRAIT “INCONSISTENCY” 


all 31 judges was used to represent the position of the 


| trait along the evaluative dimension. (The standard 


deviations of the judges’ ratings ranged from .70 to 1.36.) 
When these mean ratings were assigned to the traits 


"used in the 18 test items, each item was found to con- 


| form to the criterion; in each item the AB trait discrep- 


ancy was smaller than the AC trait discrepancy. 


RESULTS 


As a test of hypothesis 1, Ss whose E scores 
fell in the top quarter of the distribution of E 


| scores were compared with Ss whose E scores 


fell in the bottom quarter of the distribution. 


| The Ss whose E scores fell in the top quarter 


crossed out a mean of 8.69 AC pairs. The Ss 
whose E scores fell in the bottom quarter 


| crossed out a mean of 6.85 AC pairs (t = 
| 2,15). Because both of the hypotheses tested 


in this research were formulated before the 
data were gathered, only one tail of the / 
distribution was used in computing probabili- 
ties. The probability corresponding to a ¢ of 
245 is less than .03. High E’s crossed out sig- 
nificantly more AC pairs than did low E’s. 
Hypothesis 2 was tested by correlating the 
amount of difference between the trait dis- 
crepancies of the two pairs in each item with 
the numerical excess of high E’s over low E’s 
who crossed out the AC pair in that item. This 
product-moment correlation was +.57 (p < 
01). Items in which there is a large difference 


_ between the AB and AC trait discrepancies 


tend to discriminate best between high and 


low ethnocentrics. 


Discussion 


; Confirmation of these two hypotheses of- 
S Support for the contention that high and 
"i ethnocentrics do differ in the assumptions 
i ey make concerning the co-occurrence of 
eee traits. Furthermore, the differences be- 
: m the assumptions made by high and low 
E ues appear to be systematically 
dis toa difference in sensitivity to trait 

! oe High ethnocentrics tolerate less 
n ron, between their value systems and 

perceptual assumptions. 

lu 2 dM to consider certain alterna- 
f GE anations of the data. The test items 
E imilar in form to items included in many 

rc igence tests. Adorno ef al. (1, ch. 8), and 
CCS (6) have obtained moderate nega- 
,'* Correlations between indicators of intel- 


fence and E-scale scores, Therefore, it is 


- 1 than under 


353 


possible that the greater tendency of high 
E's to cross out the AC pairs is somehow a 
function of their lower intelligence. However, 
in the present sample, the rank-order correla- 
tion between college grades, as indicators of 
intelligence, and the number of AC pairs 
crossed out was only +.05. Another test of 
this alternative explanation of the data was 
provided by a comparison of the mean number 
of AC pairs crossed out by students with 
high, and with low, academic averages. The 
16 students with academic averages of B— 
or above crossed out a mean of 8.38 AC pairs, 
and the 12 students with averages of C— or 
below crossed out a mean of 8.58 AC pairs. 
The ¢ value for this difference between means 
is .26 (p > .70). In view of these findings an 
explanation based on the intelligence factor 
seems untenable. : 

Another explanation would hold that the 
high E's differ from low E's in the evaluative 
connotations they give to the trait names used 
in this study. If this were so, the differences 
between the trait discrepancies of the AB and 
AC pairs might not be the same for high E's 
as they are for low E’s. High E's might then 
have crossed out more AC pairs simply be- 
cause the differences between the AB and AC 
trait discrepancies were larger for them than 
they were for low E's. In order to test this 
possibility, the mean trait ratings given by 
high-E judges were computed separately from 
those given by low-E judges. The difference 
between the trait discrepancies of the AB and 
AC pairs in each item was then computed 
under two conditions: (a) the traits carry the 
mean rating assigned by high-E judges (Con- . 
dition 1), and (b) the traits carry the mean | 
ratings assigned by low-E judges (Condition 
2). For the 18 test items, the mean difference | 
between the trait discrepancies of the n and | 
AC pairs was only .09 larger under Con ition 
eg Condition 2 (p > .80). As a 
further test of this alternative explanation, a 
computation was made of the amount by 
which the difference between the trait dis- 
crepancies of the pairs in each item was larger 
under Condition 1 than under Condition 2. 
These indexes were computed separately for 
each item and then correlated with the pre- 
viously described measure of the ability of the 
item to discriminate between high- and low-E 
scorers (ie., number of high E’s who crossed 


354 Ivan D. 
out the AC pair minus the number of low E's 
who crossed out the AC pair). The obtained 
product-moment correlation was —.22 (p > 
-30). Not only does this correlation fail to 
reach statistical significance, but it is in the 

. direction opposite to that demanded by this 
alternative explanation. In view of these 
findings, this explanation seems untenable. 


SUMMARY 


This study has investigated the assumptions 
which two kinds of people make concerning the 
likelihood that various human traits will occur 
together. Interest in the problem derives from 
a theoretical orientation which maintains 
that such assumptions lead to selective per- 
ceptions and memories, and hence exert an 
effect on the percepts which an individual 
forms of other people. If this orientation is 
sound, knowledge of an individual’s percepts 
should permit predictions about the assump- 
tions which he employs in forming concepts, 

Because other researchers have found that 
high and low ethnocentrics differ in their 
percepts of people, it was inferred in this 
study that they also differ in the assumptions 
they make concerning the interrelationships 
of human traits. It was postulated that when 
two human traits are considered to be about 
equally desirable, high ethnocentrics will 
assume that these traits have a high probability 
of occurring together. If two traits are unequal 
in desirability, high ethnocentrics are expected 
to assume that these traits are unlikely to 
occur together in other people. In the case of 
low ethnocentrics, it was postulated that the 
relative desirability of two traits would have 
less effect on the assumed probability of their 
co-occurrence. 

Two hypotheses were derived and tested. As 
a result of these tests, the null forms of the 
hypotheses were rejected at the .03 and .01 
levels. High and low ethnocentrics do make 
different assumptions concerning the likeli- 


STEINER ^ $ 


hood that various pairs of human traits-wil 
co-occur. Furthermiore, the differences between 
the assumptions made by these two kinds of 
perceivers are in the predicted direction: the 
assumptions which high ethnocentrics make 
are more closely related to their appraisals of 
the desirability of the traits. 

Although the data in this study have dealt 
exclusively with differences between the 
perceptual assumptions made by high and low 
ethnocentric perceivers, the problem under 
investigation has a wider scope. In its broader 
implications, the problem is one of determining 
the effect which perceptual assumptions may 
have upon percepts, and ultimately upon the 
behavior of interacting persons. The present 
study has been successful in using other re- 
searchers’ findings concerning percepts as a 
basis for predicting perceptual assumptions. 
Additional research is needed to determine 
the extent to which percepts and behaviors 
are a consequence of perceptual assumptions. 


REFERENCES 


1. Aporno, T. W., FRENKEL-BRUNSWIK, Est, 
Levinson, D. J., & Sanrorp, R. N. The authori- 
tarian personality. New York: Harper, 1950. j 

2. Attport, G. W., & Postman, L. J. The basic 
psychology of rumor. In T. M. Newcomb & E. L. 
Hartley (Eds.), Readings in social psychology. New 
York: Henry Holt, 1947. Pp. 547-558. 

3. BARTLETT, F. C. Remembering. Cambridge, Eng.: 
Cambridge Univer. Press, 1932. 

4. FRENKEL-BRUNSWIK, Erse. Intolerance of am- 
biguity as an emotional and perceptual personality 
variable. J. Pers., 1949, 18, 108-143. 

5. Honowrrz, E. L., & Honowrrz, R. E. Development 
of social attitudes in children. Sociometry, 1939, 4, 
301-338. 

6. O’Connor, P. Ethnocentrism, “intolerance of am 
biguity" and abstract reasoning ability. J. abnorm. 
soc. Psychol., 1952, 47, 526-530. 

7. Roxeacu, M. Generalized mental rigidity as 4 
factor in ethnocentrism. J. abnorm. soc. Psycholy 
1948, 43, 259-278. 

8. Zuric, M. Einstellung und Aussage. Z. Psychol 
1928, 106, 58-106. 


Received March 23, 1953. 


| 


" 


u 


SEYMOUR ROSENBERG 


Human Resources Research Center, Randolph Air 
Force Base, Texas 


[HE severity of stuttering behavior has 

been shown to vary as a function of a 

number of conditions. In a recent re- 
view of the literature, Bloodstein (2) has at- 
tempted to order the rather heterogeneous list 
of conditions which have been found to affect 
stuttering. He classified these conditions into 
six general types. One of his descriptive classes 
of interest for the present study is the behavior 
of the listener. Studies summarized under this 
dass are predominantly concerned with the 
status of the listener as a friend, stranger, mem- 
ber of the family, etc., rather than the behavior 
of the listener during stuttering. Bloodstein 
noted, “The kinds of listener behavior which 
tend to facilitate the stutterer’s speech do not 
appear to be specifically defined in the litera- 
ture” (2, p. 296). Reports obtained from stut- 
terers by personal account and by investigators 
(3, 6, 7) seem to indicate that the “stutterer 
speaks best when there seems to be no reaction 
to his stuttering at all” (2, p. 297). 

But what are the listener reactions which 
are peculiar to stuttering? Before attempting 
an experimental answer to the problem of the 
effect of the behavior of the listener on the 
Severity of stuttering, we must have rather 
Clear descriptions of what the listener does. 
Descriptions by the stutterer of listener be- 
havior are complicated by the fact that the 
Stutterer is so intimately involved in the speak- 
Mg Situation that his report may differ con- 
siderably from an objective analysis of the 
Speech situation, 

A review of the literature reflects a paucity of 
Stuttering speech. A study by Ainsworth (1) 
may be considered as somewhat related to this 
Problem. He investigated the breathing of 


! We wish to express our a) iati 
t ippreciation to Professor 
poet L. Milisen for his suggestions and criticisms in 
eed and performance of this study and to 
ie rald Waite and Norman Willy for their par- 
hos aoe experimenters. The authors assume full 
ae ty for the statements and conclusions in the 
*Both authors were for i iversi 
where the study was ae ee ee 


objective descriptions of listener reactions to - 


.. THE EFFECT OF STUTTERING ON THE BEHAVIOR OF THE 
Kcti a LISTENER: 


AND 


JAMES CURTISS* 
Pennsylvania State University 


listeners exposed to stuttering. His listeners 
included normal speakers and stutterers. The 
experiment was apparently not designed to 
throw light directly on the problem posed here 
in view of the exclusion of normal speech as a 
stimulus condition. A more relevant study is 
reported by Van Riper (10). He studied the 
effect of “hearing and seeing” stuttering upon 
stutterers’ speech. Normal speech was included 
as a control condition. Results indicated that 
the subjects demonstrated more interruptions 
following the stuttering condition. All subjects 
were stutterers, however. 

The present experiment is an objective study 
of some of the listener responses which may be 
influenced by stuttering. The listeners were all 
normal speakers. Listener responses chosen by 
the authors to be studied were (a) eye contact, 
(6) hand movement, and (c) other bodily 
movement during “stuttering”? and during 


normal speech. 


METHOD 


Subjects. Twenty college students from an introduc- 
tory psychology class at Indiana University were used 
as Ss in the study. The Ss signed up voluntarily al- 
though they were required to serve as Ss in at least 
three psychological experiments as part of the course 
requirements. There were 13 males and 7 females with 
an age range from 17.0 years to 40.5 years, mean age 
24.3 years. The study was defined for them as a psy- 
chological experiment, with special care being taken 
to give no cues of the fact that their behavior was being 
observed in a speech situation. The experiment was 
performed in the Psychological Clinic. 

Procedure. At the time of the appointment, one of 
the persons who also served as one of the Os met the S 
and escorted him to a "waiting room." The S was told, 
“Will you please be seated in here? We will call you in 
a few minutes. Please stay in the room because we are 
behind schedule." One of the Es was already seated in 
the room and was posed as an S who was also presum- 
ably waiting to be called for the experiment. To this 
E, O said briefly, "We'll be ready for you in a few 
minutes." The door was then closed. The E started a 
conversation with S. At the end of five minutes of 


3 The “stutterers” were normal speakers who faked 
stuttering during the experiment. Thus variability in 
frequency, duration, and severity could be more ac- 
curately controlled than would be the case if the Es 
had been actual stutterers. 


355 


356 


TABLE 1 
SUMMARY OF THE PROCEDURE 


BURIBCIS | First Five | Seconp Five 
Minvres Minutes 
| 

Nos. 1, 5, 7, 10, 12 E #1, E #2, 

Stuttering | Normal 

Nos. 2, 11, 13,14,18 | E #1, E #2, 
| Normal | Stuttering 

Nos. 4, 8,9, 15, 17 E #2, E #1, 

i Stuttering | Normal 

Nos. 3, 6, 16, 19, 20 E #2, | E #1, 
Normal | Stuttering 


observation of the S through a half-silvered mirror, 
O returned and said to E, *We are ready for you now." 
To S he said, “We will be ready for you in a few min- 
utes.” As O started to close the door, a second E walked 
up and inquired about being an S for a psychology 
experiment. The O said to this second E, “Will you 
come in here and wait a few minutes?” The door was 
closed and a second five-minute observation was begun 
during which the second E talked to the same S. 
Finally O re-entered and said to S, “O.K., will you come 
with me?” 

The S was taken to another room where the “psy- 
chology experiment" was begun. This was a brief 
period in which various things were done with different 
Ss as part of pilot studies for other experiments or 
simply to disguise the fact that the experiment had 
actually been performed during the so-called “waiting” 
period. This was done to maintain the naiveté of 
future Ss. 

The behavior of each S was observed under two 
conditions. These conditions were “stuttering” and 
normal speech. The order of the two conditions and of 
the two Es was counterbalanced. Thus each Æ stuttered 
the first five minutes with five Ss, stuttered the second 
five minutes with five other Ss, spoke normally the 

. first five minutes with a third set of five Ss, and spoke 
normally the second five minutes with a fourth set of 
five Ss. With these restrictions imposed, each S was 
randomly assigned to one of the four possible combina- 
tions. 

Table 1 is a summary of the counterbalancing pro- 
cedure. The Ss shown in the table were numbered in 
the order in which they participated in the experi- 
ment. 

Observation and recording of behavior. The waiting 


room was approximately 7 x 7.5 feet and was arranged . 


so that two comfortable chairs were placed in opposite 
corners of the room as indicated in Fig. 1. The window 
(W) in the wall to the right of the Ss’ chair looked out 
on a path in front of the building with a large area of 
trees beyond the path. The half-silyered mirror (M) 
was on the adjacent wall almost directly in front of the 
S. 

Two Os looked at the S through the mirror from a 
dark room next to the "waiting" room. Each O was 
provided with three electrical leaf switches mounted on 
a board in front of them. The switches were operated 
by the Os' fingers and corresponded to (a) loss of eye 


SEYMOUR ROSENBERG AND JAMES CURTISS 


Fic. 1. “WAITING? ROOM AND. 


SKETCH OF THE 
ADJACENT OBSERVATION Room 


contact from the E, (b) hand and/or arm movement, 
and (c) other body movement. These circuit makes and 
breaks were recorded on an Esterline-Angus Operations 
Recorder. In order to insure that the Os would not 
respond to each other's judgments, a screen (Sc) was 
placed between the two sets of leaf switches. As an 
additional precaution against one O being influenced 
by the judgments of the second, the recorder was placed 
in a third room so as to minimize the cues provided 
by the relay clicks. A final precaution was taken to pre- 
vent the Os from hearing the clicks and/or hearing the 
conversation in the waiting room by a masking tont 
delivered through head sets provided for each 0: 
Neither O knew which condition was being executed at 
any time since they had noschedule of therandom assign: 
ment, nor could they see most of the E's body. Figure! 
indicates how part of the mirror was blocked out in 
the observation room but not in the waiting room. 
The conversation was recorded on a tape recorder 
placed in the Os’ room with a hidden microphone in 
the waiting room. 4 ! 
“Stultering.” An attempt was made previous t0 
the experiment to standardize the type and frequency | 
of stuttering and the material discussed in the speaking | 
situation. The stuttering was faked by two graduate | 
students in speech. The stuttering was repetitious am 
an attempt was made to keep head and body move 
ments to a minimum during a block. Eye contact T 
maintained during the interruption, There was $^ 
attempt by both Es to maintain eye contact at a COM” 
stant frequency and duration during the “otters 
and “normal” conditions. The first interruption es 
severe one and occurred on the first word spoken 9' i 
E and S met in the room. A total of 15 interrupt 
was faked throughout the five-minute period. In i 
to control as much as possible for conversational ky y 
tent in the situation, a list of topics was selected d 
the Es which included the following: the weather, 
rent events, sports, school major, and “what do E. 
think the experiment will be about?" These topics 7. 
curred fortuitously in the two conditions since d v 
did not have complete control over the topics sel 
by a particular S. i 
rs es The two Os practiced before the exper | 


' ment to a point where fair agreement as to what con- 
stituted a response was reached. Judgments of each 
response class were based on the following definitions: 

1. “Eye contact loss" was defined as an eye move- 
ment (which may or may not have included the head) 
in any direction away from the E's approximate head 
- position in the room. The total frequency with which 
an eye contact loss was initiated and total duration of 
these losses in seconds in each five-minute period were 
obtained from the operations recorder for each O. 

2, “Hand movement” was defined as any movement 
of the arm and/or hand excluding single finger tappings. 
(Single finger movements were usually of such small 
magnitude as to produce wide disagreement between 
observers. This was found prior to the experiment and 
it was decided to omit the recording of such movements 
entirely.) Total frequency with which a hand move- 
ment was initiated and total duration of movement 
were obtained as measurements from each O in each 
condition. (Smoking was discouraged in the waiting 
room with a large “No Smoking” sign.) 

3. "Other bodily movement” included any observ- 
able movement of the body other than head, eye, hand, 
and arm movements. Again, frequency and duration 
measurements were obtained, 

4. "Speech." After the experiment was completed, 
both Os listened to the tape recordings and measured 
the total speech time of both E and S together in each 
five-minute period.* An electric chronoscope, operated 
by a telegraph key, was used by each O. 


RESULTS 


- The total length of time (in seconds) with 
. which each response occurred during a five- 
Minute period was obtained for each S. The 
. Mean and standard deviation of each response 
based on twenty’ Ss are shown in Table 2. 
Separate values are shown for each condition 
- and for each observer. The values in the table 
Show the mean length of time that the Ss (a) 
looked away from the E, (b) performed hand 
Movements, and (c) performed other bodily 
Movements, The Tesponse class “total speech 
| n E and S” refers to the total time spent by 
th E and S in talking to each other during 
the five-minute period. 
pole 3 is a summary of the frequency data 
T each response. The measurement obtained 


for each is the total number of times each of © 


b. three Tesponses was initiated during a five- 

bs ute period. For example, “number of eye 

ntact losses” refers to the total number of 

a O noted that an S looked away from 
Th ‘uring a particular speech condition. 

© seven response measurements presented 


1085 

Originally, the authors had intended to measure 
qe moe of the E and the S separately as well as 
Poor qu; ence completion by the S. Owing to the 

Quality of the recordings, this was not possible. 


E 
f 


T 


EFFECT or STUTTERING ON LISTENING 


357 


TABLE 2 


MEAN AND SD or TOTAL Duration (SECONDS) oF 
Four Response CLASSES UNDER EACH 
CONDITION From EACH OBSERVER 


W = 20) 

OBSERVER 41 | OnsERvER #2 
RESPONSE CLASS BS. 2 

TUT- STUT- 
NORMAL] Terro | NORMAL TERING 
Loss of eye Mean 91.43 | 105.20 | 95.40 | 110.05 
contact by S SD 46.28 | 58.47. | 42.13 | 55.05 
Hand movement Mean | 126.18 | 126.55 | 141.10 | 138.75 
of S SD 70.78 | 67.28 | 66,32} 62.05 
Other body move- Mean 17.65 6.43 | 25.15 | 15.35 
ment of S. SD 21,51 9.65 | 27.78 | 15.98 
Total speech of E Mean | 270.30 | 247.54 | 273.21 | 252.29 
and S SD 18.83 | 24.72 | 17.17 | 22.86 

TABLE 3 


MEAN AND SD or TOTAL FREQUENCY OF THREE 
REsPONsF CLASSES UNDER EACH CONDITION 
FROM EACH OBSERVER 


(N — 20) 


Opsserver #1 | Onserver 2 


Response Crass TIC 


TERING TERING 
No. of eye contact | Mean 27.85 | 23.85 | 23.50 | 20.95 
losses SD 8.97 7.54 7.55 9.35 
No. of initiations of | Mean 21.95 | 18.55 | 18.85 | 17.15. 
hand movements | SD 7.72) 7.100| 6.09} 7.32 
No. of initiations of | Mean 8,30} 4.60) 7.50| 5.65 
other body move- | SD 8.02 5.19 7.55 5/22 


ments 


in Tables 2 and 3 were subjected to similar 
statistical designs. Separate analyses of vari- 
ance were computed for each response, The 
data were first classified according to three 
variables, i.e., subjects, conditions, and ob- 
servers. Thus, each cell in such a design con- 
tained one measurement of a particular S 
under one of two conditions by one of the 
two Os. For each response, variance estimates 
were obtained for the three variables, the 
first-order interactions and the second-order 
(triple) interactions. With these variance esti- 
mates, the reliability of Os as well as differ- 
ences between conditions were evaluated. 
Reliability. Several methods of estimating O 
agreement are available from the statistical 
design. For example, we may test the difference 
between O means. This type of reliability 
simply requires that the average judgments of 
each O be the same. It would permit consider- 
able inconsistencies between the Os from one 


358 


S to the next as long as these inconsistencies 
counteracted each other. A more conservative 
requirement of reliability is that the Os pro- 
vide the same relative measurement for the Ss. 
The latter type of reliability allows for a differ- 
ence between Os as long as this difference is 
consistent for all Ss and/or for all conditions. 
The most conservative reliability requirement 
is a combination of both tests, i.e., the same 
absolute measurements from both Os for all Ss 
and/or all conditions. Conventional use of 
correlation coefficient provides an estimate of 
relative reliability only. 

For the purposes of the present experiment, 
computations of relative reliability estimates 
were considered adequate. One such estimate 
of reliability can be made by testing the con- 
sistency of Os’ behavior with different Ss. The 
variance attributable to interaction of Os with 
S was compared with the triple interaction 
(error term) in an F ratio. With the use of 
the variance estimates in this way, reliability 
is suggested when the F is insignificant, i.e., 
when Os are consistent with Ss.5 The F values 
for the seven response measures are shown in 
the first column of Table 4. The only F ratio 
which is significant at the .05 level (indicating 
lack of reliability) is the one computed for 
“hand movement duration." 

A second determination of relative reliability 
may be obtained from the variance due to 
interaction of Os with conditions. The question 
here is whether the Os behave with relative 
consistency under the two conditions. This 
first-order interaction was also tested with the 
triple interaction as the error term. The result- 
ing F’s are shown in the second column of 
Table 4. None of these F’s is significant. 

"The results of all these statistical tests sug- 
gest that Os were relatively consistent in their 
observations of different Ss under the two con- 
ditions. The single exception, duration of hand 
movement, is noted above. 


Main effects. The variable of primary interest’ 


in the study is the speech condition. In order 
to evaluate the effect of the two conditions on 


5Tt may be argued that this statement of reliability 
is severely limited because it is based on failure to re- 
ject the null hypothesis. In the present instance, this 
statistical limitation is not*serious since the main pur- 
pose of computing reliability here is to determine 
whether O's inconsistency is significant enough to be 
considered in choosing an error term for testing main 


effects. 


Seymour ROSENBERG AND JAMES CURTISS. 


TABLE 4 


F VALUES OBTAINED FROM ANALYSES OF VARIANCE 
ON EACH OF THE SEVEN RESPONSE MEASURES 
FOR ESTIMATING “RELIABILITY” 


F FOR ĪNTER- F FOR ĪNTER- 
ACTION OF Os ACTION OF Os 
RESPONSE CLASS Grin Ss (df ^ wru Conn 
19, 19) TIONS (df 1, 19) 
Duration eye con- 2.11 0.05 
tact loss 
Duration hand 2.68* 0.18 
movement 
Duration body 0.91 0.14 
movement 
Duration total 1.60 1.11 
No. eye contact 1.19 1.09 
losses 
No. hand move- 1.49 1.48 
ments 
No. body move- T 4.31 
ments 
*p < 05. 


Note,—F values whose $ < .05 indicate lack of reliability. | 


each of the responses, the variance between 
conditions was tested against the triple inter- 
action. The “unadjusted” F values in the 
first column of Table 5 were computed in this 
way. (“Adjusted” F’s will be explained below.) 
The unadjusted F values fail to approach 
significance with only one of the response 
classes, viz., the duration of time which the 
listener “moves his hands." The other re 
sponses are significant at the levels indicated 
in the tables. From these results we get ? 
picture of the listener during normal spee 
and during stuttering. He loses eye contact 
with the stutterer for a longer period of time 
than with the normal speaker. The interval 
of loss of eye contact are further exaggerated 
by the fact that the number of times the 5 
looks away is less with the stutterer than with 


the normal speaker. In other words, during 


stuttering, loss of eye contact by the listener 1s 
maintained for longer continuous intervals, an 

for a longer total duration. The amount 0 
hand movement does not change with a change 


5 Choice of triple interaction as the error term M 
guided by the fact that O by condition interaction 
failed to approach significance. This does not er 
the possibility that S by condition interaction may a 
count for some of the variability between conditio i 
However, this interaction contains variance because d 
counterbalancing the order of conditions and Æ$, 
that its use as an error term was considered too Seve" 


' TABLE 5 
F RATIOS BETWEEN SPEECH CONDITIONS OBTAINED 
FROM ANALYSES OF VARIANCE AND COVARIANCE 
OF THE RESPONSES 


UNE Criss Unapjustep F — ApjusrED F 


(df 1, 19) (df 1, 18) 
Duration eye con- 54.27*** 0.46 
tact loss 
Duration hand 0.09 0.41 
movement 
Duration body 29.46** 0.02 
movement 
Duration total 624.18*** = 
speech 
No. eye contact 23.47*** 7.24* 
losses 
No. hand move- 13.73** 0.00 
ments 
No. body move- 39.00*** 0.00 
ments 
*) < 05. 
“p< 01. 
"p < 001, 


in condition but the frequency with which the 
listener will initiate a hand movement de- 
creases during stuttering. Other bodily move- 
Ment is significantly depressed during stutter- 
ing both in duration and in the number of 
times a movement is initiated. The listener 
Starts a “bodily movement” less frequently 
and for a shorter interval during stuttering. 
Finally, the total amount of conversation 
during Stuttering is depressed. Whether this 
depression of speech is due to the S, the E, or 
both was not measured in this study. 
he Purpose of making the latter measure- 
ment, ie, total duration of speech, was to 
demonstrate what. differences would exist if 
the total speech time could be held constant. 
The adjusted F's in Table 5 were computed 
E employing this measurement as the “un- 
9. pel variable" in a covariance design 
i he Tesults indicate that differences be- 
E en conditions remain with only one meas- 
ement, viz. number of eye contact losses. 
eg words, if there had been the same 
> nt of speech in each condition, frequency 
E Contact loss alone would remain signifi- 
Y different between conditions. 


Discussion 


M Tesults of this study demonstrate that 
d responses are altered significantly by 
"Ing. The responses studied were chosen 


ErrECT OF STUTTERING ON LISTENING 


359 


since they constitute a part of many stutterers' 
complaints about their audience. More subtle 
and complex listener reactions are also re- 
ported by the stutterer as “unfavorable” 
listener behavior. Self-descriptions of these 
reactions indicate "impatience, amusement, 
embarrassment, excessive solicitude” pip: 
297) as some of the listener responses to 
stuttering. Although somewhat ambiguous, it 
is possible that a part of the referents for some 
of these terms include the responses measured 
in this experiment. Various temporal com- 
binations (as well as total frequency and 
duration) of the responses may be crucial to 
the report of the stutterer, and more impor- 
tant, to its effect upon his speech. Future 
experimental studies on the effect of “listener 
behavior” on stuttering which employ the 
results of this study may profitably include 
consideration of such a possibility. 
Nevertheless, the statistical significance ob- 
tained for most of the measured responses 
demonstrate the probable reality of the reports 
of the stutterer about these particular listener 
reactions. This conclusion should be qualified 
to include the fact that there is considerable 
variability between Ss. Inspection of the SD’s 
in Tables 2 and 3 leads us to believe that some 
extreme Ss during “normal speech” behave 
quantitatively similar to other Ss during 
stuttering. With this degree of overlap be- 
tween conditions, some Ss will be relatively 
“poor listeners” regardless of who is talking 
to them. Nor have the present results neces- 
sarily demonstrated that all Ss react differ- 
ently with a stutterer than with a normal 
speaker. Thus, indiscriminate complaining by 
some stutterers about “poor listeners” is not 
ruled out by the present results. To be accurate 
about a single listener, the stuttering individ- 
ual must compare the same listener under two 
stimulus conditions, ie. when there is rela- 


* tively fluent speech and when there is stutter- 


ing. In this connection, it would be worth 
while to investigate how well stutterers can 
discriminate the two conditions by observa- 
tions of the listener only. 

The possibility of adaptation by the listener 
should not be neglected in our account of the 
stutterer's sensitivity to "poor listeners.” It is 
conceivable that prolonged contact with a 
stutterer may reduce the differences in reac- 
tions to stuttering and normal speech. A 


360 


stutterer may continue to respond to his 
listener on the basis of his initial contacts in 
spite of subsequent changes in the behavior of 
the listener. The present study was not de- 
signed to study the adaptation phenomenon. 
More stringent controls on the spacing of in- 
terruptions, etc. would have been necessary. 
"The reports of the Es are suggestive, however, 
of the value of such a study. They report that 
the Ss appeared to "relax" and become pro- 
gressively more verbal during the “stuttering 
conditions.” The effect was not so marked 
during the “normal speech condition.” The 
finding by Ainsworth (1) that the variability 
of breathing of the listener decreased during 
the study, tends to support the qualitative 
observations of the Es in the present study 
concerning adaptation. 

A covariance analysis was made for all Ss’ 
responses to discover what effects would re- 
main significant if duration of total speech were 
held constant. It may be argued, for example, 
that less verbal exchange between two persons 
is followed by less eye contact, less activity, 
etc. The covariance results indicate that only 
one response, frequency of eye contact loss, 
remains significant between conditions. This 
result is consistent with the possibility that 
decreased speech results in changes of most of 
the responses. However, we must not neglect 
the equally tenable interpretation that speech 
and the other measured activity covary con- 
comitantly with changes in the independent 
variable. In order to experimentally control the 
amount of verbal behavior, it may be necessary 
to eliminate the difference between conditions. 

In focusing on the listener, the present study 
has departed from most investigations in the 
field which are concerned primarily with the 
behavior of the stutterer. The utilization of 
the present results for further studies on the 
conditions under which stuttering is altered is 
of importance. But it may also be profitable 
to dwell on the behavior of the listener for a 
better understanding of the stuttering phe- 
nomenon. The present experiment indicates 
that stuttering seems to act as a behavioral 
depressant. In terms of learning principles, 
stuttering may be considered as a possible 
negative reinforcer to the listener, i.e., a noxious 

or punishing stimulus.’Although the effects of 
noxious stimuli on behavior are complex, there 
is some reason (4, 5, 8) to believe that noxious 


SEYMOUR ROSENBERG AND Jis Cunmss | 


stimuli may act as behavior depressants af 
some situations. This fact is particularly evi. - 
dent in instances where the experimenter has 
not intentionally provided escape or avoid. 
ance responses. In a study involving the 
administration of inescapable shock to rats, 
Brown and Jacobs (4) have hypothesized that 
the “freezing” of rats on the grid may actually 
constitute partial or complete shock avoidance, 
They point out that shock reduction may be 
effected by greater contact area of the animal 
with the grid. If the present experiment is an 
analogous situation, decrease in listener ac 
tivity may be part of a noninteractive pattern. 
which would presumably be reinforced by 
producing less interactive behavior on the part 
of the stutterer, 

Evidence for such a hypothesis is provided - 
by more than the fact that listener behavior 
was generally depressed during stuttering. 
In addition, loss of eye contact is in 
during stuttering.” Such responses su 
terminate (avoid) at least part of the stutter 
ing behavior (particularly visual components) 
performed by the E. A clear-cut experimental 
demonstration of the reinforcing properties of 
stuttering would involve the conditioning of a 
listener response in which cessation of stutter- 
ing is contingent upon the occurrence and/or 
maintenance of this response. Adaptation to 
the stuttering by the listener would also appear 
to be an important consideration in attempting 
to use stuttering as a reinforcer. Although 
focused on the listener, this type of study 
may give us a better understanding of how 
the stutterer alters his social interactions. 


SUMMARY AND CONCLUSIONS 


The purpose of this study was to investigate 
the effects of stuttering on the frequency and 
duration of (a) loss of eye contact, (6) 
movement, and (c) other bodily movement © 
the listener. Twenty college students were 
as Ss in the investigation. The experimental 
situation was defined for them as a waiting 
period prior to the "experiment for which they 
signed up.” The Ss were exposed to five mit 

"Although not measured titatively, the t% 
Hs faking stuttering frequently reported that = 
sentences were completed by the listener. This is 69 
sistent with the hypothesis that stuttering is a er" 
stimulus and that sentence completion is one of! 
instrumental acts which terminate such 


| 
] 
l 
| 


utes of normal speech in conversation with one 
E and five minutes of stuttering speech in 
conversation with a second E in this waiting 
period. The Es were presumably other Ss who 
were also waiting their turns for the experi- 
ment. Observations of the Ss' behavior were 
made without their knowledge through a 
half-silvered mirror by two Os. 

Statistically significant F's were found in 
five of the six response measures. The follow- 
ing differences in listener responses were found 
to occur during stuttering: (a) increased dura- 
tion of loss of eye contact, (b) decreased fre- 
quency with which a change in eye contact 
away from the stutterer was made, (c) de- 
creased frequency with which a hand move- 
ment was initiated, (d) decreased total dura- 
tion of other bodily movement, and (e) 
decreased frequency with which a bodily 
movement would be initiated. Adjusted F’s 
were then computed by covariance using total 
Speech time of S and E as the uncontrolled 
variable. One adjusted F remained significant, 
viz, frequency of eye contact loss. 

Results of the experiment were discussed in 
terms of the complaints of the stutterer about 
his audience behavior. Also discussed were the 

ications of the present results in the under- 
Standing of the social interactions of the 
stutterer, 
e results support the following conclu- 


sions. 


1, Listener behavior is significantly affected 
by stuttering. 


x È 4 
Evpecr or STUTTERING ON LISTENING 


361 


2. Statistical control for total amount of 
speech removes most of the differences in 
listener behavior during stuttering. 

3. Stuttering seems to act as a behavioral 
depressant to the listener. The “avoidant” 
activities of the listener may be of significance 
in classing stuttering as a negative reinforcer 
(noxious stimulus) to the listener. 


REFERENCES 


1, Arnswortn, S. Studies in the psychology of stut- 
tering: XII. Empathic breathing of auditors 
while listening to stuttering speech, J, Speech 
Disorders, 1939, 4, 149-156, 

2. Broopstem, O. Conditions under which stuttering 
is reduced or absent: a review of literature. 
J. Speech Hearing Disorders, 1949, 14, 295-302. 

3. BroopsrxiN, O. A rating-scale study of conditions 
under which stuttering is reduced or absent. J. 
Speech Hearing Disorders, 1950, 16, 29-36, 

4. Brown, J. S., & Jacons, A. The role of fear in the 
motivation and acquisition of J. exp. 
Psychol., 1949, 89, 747-759. 

5. Estes, W. K. An experimental study of punish- 
ment. Psychol. Monogr., 1944, 87, No. 3 (Whole 
No. 263). ` 

6. FLercner, J. M. The problem of stuttering, New 
York: Longmans, Green, 1928, 

7, GREENE, J. S. I was a stutteror, New York: Grafton, 
1932. 

8, SkmwNER, B. F. The behavior of organisms, New 
York: D. Appleton-Century, 1938. 

9. SwEDecom, G. W. Statistical methods. (Ath Ed.) 
Ames, Iowa: The Collegiate Press, 1946. 


10. VAN River, C. The influence of empathic response 
on frequency of stuttering. Psychol. Monogr., 
1937, 49, No. 1 (whole No. 217) 244-246. 
Received March 28, 1953; 


> 


COMPLEXITY OF RESPONSE AS A FACTOR IN THE VOCABULARY 
PERFORMANCE OF SCHIZOPHRENICS 


R. HARRINGTON 
Veterans Administration Hospital, Marion, Indiana 


OMPLEXITY of the response required, as a 
factor in the vocabulary performance 


of abnormals, has largely been neg- 
lected in the literature. Utilizing the idea that 
vocabulary seemed to be the most stable aspect 
of intelligence, Babcock (1) devised a mental 
efficiency test comparing the vocabulary level 
with achievement on other types of tests. 
Yacorzynski, however, has pointed out that 
this apparent stability may be an artifact of the 
scoring systems used. He feels that definitions 
reflecting different kinds of thinking may re- 
ceive the same quantitative score and, in re- 
lation to the fact that this score is often the 
same, he states, “It seems logical to assume, 
however, that the reason why the vocabulary 
of the deteriorated individual remains rela- 
tively unchanged is because easier methods are 
left to the organism even if the more difficult 
solutions are no longer available” (7, p. 265). 
Moran, Moran, and Blake (5), however, re- 
ject this hypothesis and suggest that most of 
the differences reported by Feifel were con- 
tributed by the organics and manic-depressives 
in his particular sample (3). 
Defining a word verbally is a highly complex 
task involving memory, motor behavior, per- 


ception, motivation, a knowledge of sentence ` 


structure, and the ability to select and organize 
words in a meaningful way. Selecting a correct 
definition from a number of possible definitions 
may certainly be considered a much less com- 
plex task than offering a spontaneous verbal 
definition. Motivation, perception, and mem- 
ory are all involved, but the selection and 
organization of the words into a meaningful 
sequence have been done by someone else. All 
that remains for the subject is to recognize the 
words and make a choice limited to these 
recognized possibilities. 

It would appear then that the question of 
whether the “more difficult” solutions are no 
longer available to an abnormal individual is 
deserving of study. The present study at- 
tempted to investigate the role which the com- 


1From Veterans Administration Hospital, Fort 
Custer, Michigan. 


AND 


J. C. EHRMANN?! 
Michigan Staie College 


plexity of response plays in the ability of one | 
group of abnormals (schizophrenics) to engage 
in abstract thinking as inferred from their per- 
formance on vocabulary tests. The hypotheses 
were: 

1. The relative concreteness of the defini- 
tions of vocabulary items given by schizo- 
phrenics is in part a function of the com- 
plexity of the response required. 

2. Schizophrenics will give significantly less 
abstract definitions to vocabulary words than 
will normals when the response required is one 
of verbal definition. 

3. There will be no significant differences 
between schizophrenics and normals in the 


degree of abstractness of definitions of vocabu- | 


lary words when the response required is the 
simple task of indicating a correct choice. 


METHOD 


Subjects. Twenty male patients and ward attendants 
from the United States Veterans Administration Hos- 
pital, Fort Custer, Michigan served as subjects (S9). 
All patients had been diagnosed as Schizophrenic Re- 
action, Type Unclassifed, and were in good contact. 
Each patient was equated with an attendant on the 
factors of: age, educational level, IQ, and vocabulary 
level. Table 1 presents the data on these factors. 

Procedure. Two tasks were utilized. Task I called for 
the Ss to define 20 words from the Vocabulary subtest 
of the Wechsler-Bellevue Scale, Form I. Task II re 
quired the Ss to indicate the correct definitions in aseries 
of definitions given for each word. The words were 
capable of being defined as nouns and seemed to pr 
sent a wide range of difficulty. - 

Each of the series of five definitions for each word in 
Task II included one each of the following types 
definitions, arranged in random order (examples atè 
given for the word “nitroglycerine”) : 

Type 1. A definition which would be scored as catt 
gorical according to Gerstein’s examples (4): “A type 
of explosive." s 
Type 2. A definition which would be scored as func 
tional according to Gerstein's examples: “It is used i 
bombs." a 
Type 3. A definition which would be scored as mo 
crete according to Gerstein's examples: “It is yell 


stuff.” ET 
Type 4. A definition which was in no way correct: 
large baby-buggy.” tbe 


Type 5. A definition which was incorrect but might" 
mistaken for the correct definition: “Used to find nil 
gen in the air.” 


362 


CoMPLEXITY OF RESPONSE AS FACTOR IN VOCABULARY PERFORMANCE OF SCHIZOPHRENICS 363 


TABLE 1 


Mean Scores on Ace, IQ,* EDUCATIONAL AND 
VOCABULARY LEVELS OF THE Two GROUPS 


TABLE 2 


MEAN SCORES OF THE ATTENDANT AND PATIENT Grove 
ON VERBAL AND MULTIPLE-CHOICE TASKS 


ATTEND- PA- ATIEND- PA- 
ANTS TENTS Der. ft ? ANTS TIENTS Drm. | ? 
Age (in years) 29.1 28.3 810 1228 NS Verbal 
Education (grade) 10.6 10.7 Bi Rater I 3.47 3.32 15 2.63 .01 < .02 
IQ 103.25 101.0 2.25 1.81 NS Rater II 3.47 3.29 18 3.05 < 01 
Vocabulary level (stand- — 6.5 6.8 3 Multiple-choice 3.80 3.81 0t 
ard scores) 


*From Wechsler-Bellevue Intelligence Scale, Form I (6). 


The purpose of including definitions of types 4 and 
5 was to give some measure of whether the individuals 
responded in a random manner or chose incorrect defi- 
nitions which were likely to be confused with correct 
ones, The data indicate that the scores were not ma- 
terially affected by such responses. 

Scoring. For Task I, the verbal definitions were rated 
by two independent judges according to the definitions 
of concrete, functional, and categorical as given by 
Gerstein: 

(a) The concretistic or descriptive method. . . . 
It implies memory of an object at the concrete 
Sensory level. For example, “an apple is red.” 
(b) The functional or usage method.... A 
word is defined by the subject recalling the use 
to which that object was put in the past; e.g., 
an apple is something you eat.” 

(c) The categorical or conceptual method. This 
represents the abstract method of defini- 
tion. . . - "An apple is a fruit." is a conceptual 
definition whose modus operandi is more com- 
plex than the descriptive or functional meth- 
ods” (4, p. 368). 

It is important to realize that these definitions were 
SOR on the degree of abstractness of the definitions, 
Whether correct or not in the usual sense. Hence, a 

tion which would receive no credit according to 

3 echsler’ 5 criteria could be given an appropriate score, 
“pending on whether it was judged to be concrete, 

‘unctional, or categorical. 

Aes type of response received a scoring weight as 
a us Don't Know—1; Concrete—2; Functional— 
it tuegorical 4, After scoring each definition accord- 
Re highest type of response given, an average 
n as obtained for each subject. In 91 per cent of 


€ definitions give i 
Score to be Mes a the two judges agreed on the exact 


For Task TT the following types of definitions gained ` 


Ps indicated: No response—i ; Incorrect re- 
j Incorrect but capable of being confused with 


* correct definition —1: 5 i 
i Categorical on—1; Concrete—2; Functional— 


REsurrs 


oa can be Seen from Table 2 that the patient 
vebal Miss morienti less abstract in its 
group Thi tions than was the attendant 
"Would be s finding is in accord with what 

expected on the basis of the work of 


other investigators and supports hypothesis 2 
of this study. 

Table 2 also indicates that there was no 
significant difference between the groups on 
the multiple-choice task, and thus hypothesis 3 
of the present study is supported. The results 
of this study fail to show the differences ob- 
tained by Chodorkoff and Mussen (2). 

These results lend support to the Yacorzyn- 
ski hypothesis and cast doubt upon the expla- 
nation given by Moran, Moran, and Blake (5) 
in regard to Feifel’s results. When analyzed 
on the present three levels, the schizophrenics 
did give significantly fewer abstract definitions 
than equated normals, and it is entirely pos- 
sible that the same phenomenon occurred in 
Feifel’s sample. 

Both groups obtained a higher score on the 
multiple-choice task than on the verbal task, 
this difference being significant for both groups 
at beyond the .01 level. This discrepancy be- 
tween the scores on the verbal and multiple- 
choice tasks was significantly greater, at the 
.02 level, for the patient group than for the 
attendant group, when the smallest difference 
between the means (Rater I) is considered. 


Discussion 
These results suggest that complexity of the 
response is an important factor in the vocabu- 
lary performance of schizophrenics, and the 
differential findings of this investigation raise 
some interesting questions regarding the cogni- 


stive functions in schizophrenia. Whether they 


can be explained as an attempt by the schizo- 
phrenic to simplify, and thus maintain, a sem- 
blance of order in the world, or whether they 
represent a definite impairment in the ability to 
give a complex, highly integrative response is a 
problem for future investigation. 


SUMMARY 


Equated pairs of schizophrenics and attend- 
ants were tested on two tasks differing in the 


364 


complexity of the response required. The per- 
formance of the schizophrenic group was sig- 
nificantly poorer than that of the attendants 
where the more complex response.was involved. 
Where the less complex response was involved, 
the groups did not differ. 

Both groups did significantly better on the 
less complex task but this discrepancy was sig- 
nificantly greater for the schizophrenic group. 

The results are interpreted as supporting the 
hypotheses of the study. They support the 
Yacorzynski hypothesis and cast doubt on 
Moran, Moran, and Blake’s recent explanation 
of Feifel’s results. 


REFERENCES 


1. Bascock, H. An experiment in the measurement of 
mental deterioration. Arch. Psychol. 1930, No. 
117. 


R. HARRINGTON AND J. C. EHRMANN 


2. Cuoponkorr, B., & Mussen, P. Qualitative aspects 


of the vocabulary responses of normals and schizo- 
phrenics. J. consult. Psychol., 1952, 16, 43-48, 


3. Ferret, H. Qualitative differences in the vocabulary | 
responses of normals and abnormals, Gem, - 


Psychol. Monogr., 1949, 39, 151-204. 


4. Gerster, Reva A. A suggested method of analyz- 


ing and extending the use of Bellevue-Wechsler vo~ 


cabulary responses. J. consult. Psychol., 1949,18, ' 


366-374. 

5. Moray, L. J., Moran, Frances A., & BLAKE, R. R. 
An investigation of the vocabulary performance of 
schizophrenics: II. Conceptual level of definitions. 
J. genet. Psychol., 1952, 80, 107-132. 

6. Wecuster, D. The measurement of adult intelligence. 
(3rd Ed.) Baltimore: Williams & Wilkins, 1944. 

7. YACORZYNSKI, G. K. An evaluation of the postulates 
underlying the Babcock deterioration test. Psy- 
chol. Rev., 1941, 48, 261-267. 


Received April 6, 1953, 


AUTHORITARIANISM AND 
A MILITARY 


LEADERSHIP CHOICE IN 
SETTING? 


E. P. HOLLANDER? 
Carnegie Institute of Technology 


Personality (1) a few short years ago, a 

spate of studies has been reported in 
which scores derived from the F scale or one 
of its modifications have been related to a 
variety of cognitive, behavioral, or social 
criteria. Paralleling this development, the 
influence of situational determinants of leader- 
Ship has received heightened emphasis in the 
literature (e.g., 8, 11, 18). A number of recent 
reports, in particular, have sought to bring 
about an articulation of these two areas by 
relating authoritarianism* to leadership emer- 
gence and followership (2, 4, 7, 16, 22). Among 
other evidence contained in these reports, 
Sanford (16) has provided a link between 
authoritarianism and orientation to leader- 
ship, and Christie (4) has established a rela- 
tionship between acceptance within military 
trainee groups in an Army basic training unit 
and shifts in authoritarianism scores accom- 
panying training. The findings of these studies 
Suggest that an individual’s choice of a leader 
is related to his authoritarianism and that, 
Within a military context, individuals who 


S the publication of The Authoritarian 


‘Opinions or conclusions contained in this report 
are those of the author. They are not to be construed 
as necessarily reflecting the view or the endorsement 
of ue Navy Department. 

à This report is based upon a dissertation submitted 
0 Columbia University in partial fulfillment of the 
Tequirements for the degree of Doctor of Philosophy. 
e author wishes to acknowledge his appreciation to 
j FEL Gooritn Watson, Robert L. Thorndike, Edward 
a mee Otto Klineberg, Kenneth Herrold, and Sam- 
is Renan of that institution, and to a multitude 
Mire. nas at the Naval School of Aviation Medi- 
" i t f Bureau of Medicine and Surgery, and the: 
and n y Naval Research for their valuable assistance 
i Couragement in the implementation and ful- 

As of this study. 

Paper drawn from the substance of this rt was 

Presented before Division 8 at the 1953 Robes of the 
pu zy iolpekal Association. 

esearch covered herein was completed by the 

Othe e at the Aviation Psychology Laboratory 

sacola, eee School of Aviation Medicine, Pen- 


S ae “authoritarianism” as it is used here shall 
: restricted i i 
authoritarian in meaning to an acceptance of 


attitudes embodied in the F scale or 


; One of its modifications, 


365 


Show an increase in their authoritarianism 
Scores during training tend to be more readily 
assimilated into the organizational structure. 


PROBLEM 


Within military groups, the problem center- 
ing about authoritarianism and leadership 
Choice by peers poses at least three funda- 
mental questions: First, is the military leader- 
ship role perceived to be authoritarian within 
a given military group? If so, are the more 
authoritarian members of the group more 
likely to be chosen as "military leaders" by 
their potential followers? Finally, is there a 
definable relationship between one's own 
authoritarianism score and the authoritarian- 
ism score of the individual who. is one's 
leadership choice? 

An answer to the first question has been 
tentatively provided by an exploratory study 
recently completed by the author among 
Naval Aviation Cadets. The hypothesis 
tested was that the military leadership role 
would be perceived by these subjects to be 
authoritarian, within a framework provided 
by the F scale. It was found that, when the 
cadets were given two consecutive adminis- 
trations of the F scale, the first under a 
conventional set and the second with instruc- 
tions to respond to the scale as though it 
were “a test of military leadership potential" 
on which they wished to make a “good score,” 
a significant upward shift in F scores was 
obtained from the first to the second adminis- 
tration. The hypothesis was upheld, and it 
was concluded that F-scale attitudes were 


likely to be ascribed to the military leader- 


ship role by the subjects utilized. The two 
remaining questions were then investigated 
by the study to be reported here. The hy- 
potheses considered were as follows: (a) 
that there is a significant positive relation- 
ship between scores achieved on the F scale 
and choice by one's peers for a military leader- 
ship position; and (6) that there is a signifi- 
cant difference, with respect to F-scale scores, 
between the leadership nominees of those 


366 


“high” on authoritarianism and those “low” 
on authoritarianism. 


METHOD 


Subjects. Nine consecutively formed sections of 
cadets at the Naval School of Pre-Flight, a total V 
of 268, were drawn as a sample. The age range for the 
group was 19 to 26; its mean age was 22.1 years, with 
slightly more than 80 per cent of the cases falling within 
a range of 21 through 24. The educational level of the 
subjects (Ss) ranged from high school graduates to one 
S with six years of college training; the mean was 2.8 
years of college, with approximately 86 per cent of the 
group having had two or more full years of college. 
About half of the group had had some previous en- 
listed service. Geographically, the process of selection 
operated so as to represent the major areas of this coun- 
try in the study sample. 

Procedure. At the end of three months of training, 
the Ss were asked to fill out a leadership nomination 
form for their section on which they indicated three 
cadets whom they considered to be best qualified for 
the hypothetical position of “student commander,” 
and three whom they considered to be Jeast qualified. 
This form closely approximates those sociometric 
nomination forms used in previous studies in similar 
settings (e.g., 14, 20, 21). By applying appropriate 
plus and minus weights of 3, 2, 1 for each nomination 
an individual received, a leadership score was derived. 
At about the same time in the test administration 
schedule, Ss were requested to complete the F scale. 
The leadership scores and F scores were then trans- 
formed to standard scores based upon the individual 
section." In each case, 30 was set as the mean and 10 as 
the SD. 


RESULTS 


The major hypothesis was tested by two 
analyses. The first of these, a chi-square 
analysis making use of the total sample distrib- 
uted into categories of “high,” “medium,” 
and "low" for the two major variables, is 
presented in Table 1. As will be noted, the 
chi square was significant beyond the .05 
level, with the obtained frequencies indicating 
a rejection of the hypothesis. In addition to 
this, a correlational analysis was completed 
between authoritarianism and leadership 
scores for the total sample. The correlation of 
—.23 secured between these variables was 
significant at the .01 level.* 


5 The distribution of raw F-scale total scores was 
unimodal and approximately symmetrical. The mean 
equalled 114.01 with an SD of 20.92. There is no reason 
to suppose that this sample differed significantly from 
the California samples on which the F scale was 
normed. 

* Since some degree of curvilinearity was observed 
in the correlation plot, an eta was computed and found 


E. P. HOLLANDER ve E 


TABLE 1 i) an 
AUTHORITARIANISM SCORES AND LEADERSHIP 
STANDING BY CRITERION Groups} 


AUTHORI- LEADERSHIP* 

TARIAN- : Torats 
ISM Hick | MEDOM | Low 

High 13 (21)**| 34 (31) | 27 (22) 74 


Medium | 30 (32) | 53 (47) | 31 (35) 114 
Low 32 (22) | 25 (34) | 23 (24) 80 


Totals TE 81 268 


T High indicates a score of .6ø or more above the section mean} 
Medium indicates a score between .5e above and .5e below the 1 
section mean; Low indicates a score of .6z or more below the sec- 
tion mean, 

* Chi square = 12.77; df = 4; p < .05. 

**The numbers in parentheses indicate the expected fre- 
quency for each cell. 


In a supplementary series of analyses, it | 
was found that scores achieved by Ss on the 
college-level ACE test correlated negatively 
with the authoritarianism scores, r = —.J1, 
and positively with leadership, r = +.30. 
Accordingly, a partial r was calculated be 
tween the two major study variables with 
intelligence held constant; rar.ı was found 
to be —.18, a low but still significant relation- 
ship. No other variables studied, i.e., age, 
educational level attained, or previous military 
experience, were found to be related signifi 
cantly to both authoritarianism and leader- 
Ship. Officer-like qualities (OLQ) scores 
assigned by superior officers in charge of the 
cadets were found, however, to be correlated | 
with the leadership scores at a level of +55: | 
On the other hand, the authoritarianism 
scores correlated with OLQ at a virtually 
zero level, r = —.06. Table 2 presents 4) 
matrix of the relevant intercorrelations. 

The second hypothesis was tested by # 
further analysis summarized in Table 
Those “high” on authoritarianism wel? 
compared with those “low” on authoritarian 
ism with regard to their respective leadership i 
nominees’ scores on the authoritarianism 
variable. The results revealed that the author. | 
ltarianism scores of those Ss nominal n 
"highest" on leadership by Ss who we! 
respectively “high” and “low” on au 
tarianism, did not significantly differ fro 


to be .29. The chi-square test of linearity was apie 
and a value which did not support the hypot! 
nonlinearity was yielded. 


. AUTHORITARIANISM AND LEADERSHIP CHOICE IN A MILITARY SETTING 


: TABLE 2 
INTERCORRELATIONS AMONG AUTHORITARIANISM SCORE, 
ACE Test Score, OFFICER-LIKE QUALITIES 
ScoRE, AND LEADERSHIP CRITERION 
For Torat Sropy SAMPLE 


(N = 268; o, = .06) 


VARIABLES ACE OLQ L. Cnrr. 
Authoritarianism —.21** — .06 .28** 
ACE -.17** — -p.30** 
OLQ + .55** 

“p <o 

TABLE 3 
MEAN  AUTHORITARIANISM SCORES* OF CADETS 


Nommnatep HIGHEST OR LOWEST ON LEADER- 
SHIPBY CapETts WHo WERE HIGH OR 
Low ON AUTHORITARIANISM 


NOMINEES 
Noumarons NOMINATED NOMINATED 
“Hionest” | “Lowest” on 
ON LEADERSHIP | LEADERSHIP 
- High score on au- | X = 25.30 X= 33.74 | CR = 4.87 
thoritarianism e? = 80.06 a? = 136.97 
N=” N= 7 P «.00 
Low score on au- | Y = 27.29 X = 33.17 | CR=3.79 
thoritarianism a? = 60.60 g? = 129.55 
N-80 N= 78 $ <.001 
CR= 1.38 CR= 30 
$ > .10 $> 10 


* The scores rej 
;. Scores reported are based upon a standard 
Score distribution with a mean of 30 and an SD of 10. 


one another; this held true for the “lowest” 
D as well. It would appear, therefore, 

S nominated as “highest” on leadership 
Were significantly lower on authoritarianism, 
respective of the standing on authoritarian- 
Ism of those making the nominations. 


Discussion 


The findings obtained indicate a rejection « 


VON two hypotheses with the resultant 
E that authoritarians are less ac- 
x e, as leaders, to their fellow cadets, 
OM nonauthoritarians. A fundamental 
idi Seems to be posed here. The military 
Hm on does present an essentially authori- 
n Structure after all. Moreover, it may 
Stanted that military officer indoctrinees 


are 
exposed to an authoritarian conception 


367 


of military leadership,” and, in keeping with 
the evidence presented above, that this cadet 
population tends to ascribe authoritarian 
attitudes to the military leadership role. Yet, 
in a free choice situation such as this, where 
a military leadership role is to be filled, the 
more authoritarian individuals are not popular 
choices among the trainees.’ Indeed, precisely 
the reverse trend is observed. In considering 
this finding, there are two assumptions in- 
volved which properly might be scrutinized: 
(a) that Ss would act upon the traditional 
role prescriptions which others like themselves 
appear to recognize as bound up with the 
military leadership role; (b) that the accept- 
ance of authoritarian ideology, as embodied 
in the F scale, necessarily implies the mani- 
festation of some form of authoritarian 
behavior. 

Taking account, first, of the discrepancy 
between the recognition of certain role pre- 
scriptions and the actual selections for role 
fulfillment, one might contend that the con- 
ception of an authoritarian military leader 
may be anchored in an incomplete and some- 
what distorted stereotype. Operationally, this 
stereotype might be rejected under the impact 
of the intimate human interactions which 
serve to determine leadership choice, even 
within a military context. A case might readily 
be made in support of the further contention 
that the individual chosen as a leader by his 
peers, in a military setting where rigid lines 
of authority are drawn, may be one who is 
less authoritarian in his interactions with 
them.? This relationship might be attributable 
to loyalties within the peer group which resist 
dominance behavior of the sort of which the 
authoritarian is presumptively capable. Thus, 

7 Cf. M. Brewster Smith's description of the impact 
of OCS in The American Soldier (19, p. 389). 

* While this finding is apparently in contradiction to 
Christie's (4), it is of special interest to note that his 
most accepted group had initially been below the group 
mean on authoritarianism. 

? Although this would appear to run contrary to 
the work of Lewin and his associates (12), where inter- 
nal aggression was found to characterize groups of 
youngsters under authoritarian rule, there may be 
several obvious reasons which would serve to explain 
this contradiction: First, cadet groups function with a 
high level of goal orientation; second, numerous oppor- 
tunities are afforded for othe release of aggression 
through competition with other groups; finally, there 
remains an open question as to the comparability of 
child and adult groups. 


368 


the authoritarian may elicit a negative 
response from his potential followers because 
of his emulation of an authoritarian conception 
of the military leadership role. 

There yet remains a question as to the 
reasons for the conflict between this study 
and that by Carter, Haythorn, ef al. (3) 
which found that participants who secured 
leadership status in leaderless group discus- 
sions displayed authoritarian behavior. Notice, 
first, that in the latter study a behavioral 
criterion was applied and, second, that the 
investigators studied discussion groups of an 
unstable, temporary nature. The current 
study simply infers authoritarian behavior 
and, in contrast, makes use of persisting 
"real" groups of some months’ establishment. 
It might also be noted that Bass, McGehee, 
et al. (2) report a significant negative, but 
somewhat curvilinear, relationship between 
F-scale scores and leadership status in leader- 
less group discussions. The discrepancy be- 
tween this finding and the one reported by 
Carter, Haythorn, ef al. leads appropriately 
to a consideration of the second assumption 
noted in the foregoing, that is, the correspond- 
ence between authoritarian behavior in some 
form and the subscription to F-scale atti- 
tudes. 

Without implying any disrespect for the 
monumental achievement represented by The 
Authoritarian Personality, one might wish 
nonetheless to further examine the F scale 
and the meaning of an F-scale score. At the 
outset, fairness demands that consideration 
be given the fact that the scale’s constructors 
were concerned with tapping the personality 
syndrome which would be particularly re- 
ceptive to antidemocratic ideology (1, p. 4). 
In this regard, they have established that the 
scale is related to clinical evidence of rigidity, 
stereotypy of thinking, moral conservatism, 
and the like. Moreover, the scale has been 
shown to correlate respectably with scales of 
anti-Semitism, ethnocentrism, and political- 
economic conservatism. Yet, withal, there 
exists no body of evidence which directly 
indicates that an acceptance of this ideology 
is necessarily indicative of some qualitative 
behavior, ie, a traditionally conceived 
“authoritarian behavior.” This is not to 
deny that behavioral differences can be in- 
ferred (as they usually are) from empirical 


| 
E. P. HOLLANDER ] 


evidence. Several notable studies, among 
them those by Rohde (15), Bass, McGehee, 
et al. (2), and Scodel and Mussen (17), support 
the notion that perceptible differences do 
exist between the behavior of “high” and 
“low” scorers on the F scale. Even where 
the influence of intelligence is held conii 
as is the case in the current study, apparent. 
differences in the behavior of “highs” and 
"lows" persist. This kind of evidence, while 
not conclusive, cannot be dismissed lightly. 
But it still gains us little in our quest for the 
precise behavioral qualities of the extreme. 
groups on a continuum of F-scale scores 
which means, in effect, that we cannot neces: 
sarily impute authoritarian behavior to the 
person who accepts F-scale attitudes. 
What may we say, then, about the individual 
who scores high on the F scale? In the first 
place, it is apparent that these “author 
tarians” are defined as such on the basis of 
their having agreed with some grouping of F 
items. Furthermore, as they stand, these 
items appear to be a potpourri of discredited 
concepts running the gamut from hokum | 
(“some day it will probably be shown that 
astrology can explain a lot of things”) to 
sanctimonious self-indulgence (^no sane, nor- 
mal, decent person could ever think of hurting 
a close friend or relative”). When this state 
of affairs is taken into account, it seems reason- 
able to consider that the “high” scorer on the 
F scale may not be so much an authoritarian, 
behaviorally, as he may be an individual 
with probable inadequacies in the sphere of 
social intelligence or social perception. 
conception does no great disservice to ti 
work of the California group, and it partially 
explains contradictions of the sort alluded t? 
above in connection with the study by C 
Haythorn, et al. (3) on the one hand, an 
Bass, McGehee, et al. (2) on the other. Mort 
pointedly, too, it brings into focus a likely 
explanation for the findings obtained in the | 
current study. i 
A variety of investigations (cf. 11, 18) havé 
suggested that responsiveness to the ni i 
others, a social facility perhaps, may oa 
stitute a relatively stable component of leade s 
ship. Also, studies completed in mile 
units have indicated that some factor simila | 
to “permissiveness” plays a role in dete! i 
a leader’s acceptability to his followers: N 


AUTHORITARIANISM AND LEADERSHIP CHOICE IN A MILITARY SETTING 


this regard, particular attention is directed to 
a study reported in The American Soldier 


F (19, p. 385) and to an Air Force study carried 


on during World War II (5, p. 34). Both the 
enlisted and officer combat veterans ques- 
tioned in these studies indicated a conception 
of the military leader’s role which was heavily 
weighted in the permissive direction. Of special 
note was the expression of a desire for greater 
responsiveness to their needs, opinions, and 
very beings by the men appointed over them. 

If it is true that the man who subscribes to 
the bulk of the F-scale items is correspondingly 
inadequate in his social interactions, this 
would imply that he is ill-equipped to accept 
the needs of other individuals or to deal with 
them in a socially effective fashion." Even 


- within the context of an essentially authori- 


tarian institution, therefore, the ‘F-scale 
authoritarian" may be expected to draw only 
à minimal following for a position of leader- 


ship Viewing the findings within this frame 


of reference probably serves best to explain 
the fact that whether nominators were “high” 
or “low” on the F scale, they tended to favor 
low” scorers as military leaders. 

In conclusion, then, we have seen that 
acceptance of authoritarian attitudes may 
make an individual less desirable as a military 
leader from the standpoint of those who may 
be placed in a position of followership—in this 
Case, officer indoctrinees. Whether such free 
choice of a leader renders that leader effective 
In institutional terms remains an open ques- 
tion to be studied further, although there 


` already exists some affirmative evidence (e.g., 


20, p. 157; 21, p. 291).!! In this vein, additional 


n B. Jones, of the Naval School of Aviation 
e cine, has recently completed a study with two 
ü pendent samples of cadets in which he correlated 
e Taylor Manifest Anxiety Scale of the MMPI with 
capital une a oH rank Ware The 
ates for these groups were +.21 (N = 
166) and +.22 (N = 245) respectively; in both cases 
Er 1S were significant beyond the .01 level. It would 
aH therefore, that "high" scorers on the F scale 
inclined to have greater anxiety, which undoubtedly 
i mes the adequacy of their social interaction. This, 
Sed rs Nee Ln to accord well with the points 
individuals. o the leadership acceptability of such 
u 
ee Tecent study by the author (10) indicates that 
in ominations on leadership, taken at the preflight 
ay aBMücantly predict the pass-fail criterion in 
Air Training. The validity coefücient for this 


369 


study appears indicated as well with respect 
to the relationship between the acceptance 
of F-scale attitudes and the manifestation of 
authoritarian behavior. Clearly, we are con- 
fronted with the fact that there are a variety 
of meanings to the descriptive term "authori- 
tarian," ie. an ideology, a behavior, and an 
institutional form. At this juncture, the reality 
of the matter appears to be that these mean- 
ings cannot be interchanged without attendant 
confusion in our research efforts. 

It should be reiterated that the relation- 
ships found in this investigation are of a 
relatively low magnitude and that their 
utility for prediction is virtually nil. On the 
positive side, however, there remains a 
residual of evidence which indicates that 
there are personal qualities relevant to social 
interaction, measurable to some extent by the 
F scale, which bear upon an individual's 
acceptability as a military leader to potential 
followers. 


SUMMARY 


A study was undertaken to determine the 
relationship between  F-scale scores and 
leadership status among officer indoctrinees. 
The major hypothesis was that there would 
be a significant positive relationship between 
scores on a measure of authoritarianism and 
leadership acceptance by peers in a military 
setting. ; 

Nine consecutive sections of cadets at the 
Naval School of Pre-Flight, an N of 268 
cases, were utilized as Ss. At the end of the 
third month of training, Ss filled out a leader- 
ship nomination form for their section on 
which they named three cadets best qualified 
for the hypothetical position of "student 
commander" and three who were least quali- 
fied. At about the same time, cadets completed 
the F scale. Leadership nomination scores 
and authoritarianism scores were then de- 
rived for all cadets. 

Correlational and chi-square analyses tested 
the hypothesis and indicated a significant 
negative relationship between authoritarian- 
ism and leadership. The correlation of —.23 


variable was equalled only by that for final preflight 
average; other predictor variables from this level of 
training, including superiors’ ratings on military 
leadership qualities, showed no comparable predict- 
ability. 


370 


between these variables was significant beyond 
the .01 level. A supplementary series of 
analyses focused upon the relationship of a 
number of other variables to the two main 
study variables. Intelligence, as measured by 
the ACE test, was found to be related signifi- 
cantly to both authoritarianism, r — —.21, 
and peer nominations for leadership, r = +.30. 
The significance of the negative relationship 
between authoritarianism and leadership 
persisted, however, even when the effect of 
intelligence was partialed. In addition to the 
major finding, it was found that the leader- 
ship nominees of those cadets “high” or “low” 
on authoritarianism did not differ significantly 
with respect to their authoritarianism scores. 
In both cases, those nominated "highest" 
on leadership had a significantly lower score 
on authoritarianism than those nominated 
"lowest" on leadership. 

A number of tentative suggestions was 
offered to explain the relationships obtained. 
It was concluded that a high score on the 
F scale might be indicative of a lack of social 
intelligence or social perception. Accordingly, 
it was suggested that individuals who are 
"authoritarians" in the F-scale sense are 
unable to deal effectively with the needs of 
others and therefore tend to be rejected as 
leaders by potential followers. Further study 
aimed at pin-pointing the multiple usages of 
the term “authoritarian” was indicated as 
necessary. 

REFERENCES 


1. Aporno, T. W., FRENKEL-BRUNSWIK, ELSE, LEVIN- 
son, D. J., & Sanrorp, R. N. The authoritarian 
personality. New York: Harper, 1950. 

2. Bass, B. M., McGznrz, C. R., Hawxiws, W. C., 
Youns, P. C., & GEBEL, A. S. Personality vari- 
ables related to leaderless group discussion be- 
havior. J. abnorm. soc. Psychol., 1953, 48, 120- 
128. 

3. CARTER, L. F., HavrHORN, W., Mxrgowrrz, B., 
& LANZETTA, J. The relation of categorizations 
and ratings in the observation of group be- 
havior. Hum. Relat., 1951, 4, 239-253. 

4. CmrISTIE, R. Changes in authoritarianism as re- 
lated to situational factòrs. Amer. Psychologist, 
1952, 7, 307. (Abstract) 

5. CRANNELL, C. W., & MoLLENKOPF, W. G. Combat 
leadership. Chapter III in Army Air Forces 
Aviation Psychology: Program Research Reports. 


E. P. HoLLANDER 


Washington: U. S. Government Printing Office, 
1947. (Rep. No. 14) 


6. Gres, C. A. The principles and traits of leadership, 


10. 


11. 


13. 
14. 


15. 


17. 


18. 


19. 


21. 


22. 


J. abnorm. soc. Psychol., 1947, 42, 267-284, 


. Havron, D., Fav, R., & Goopacre, D. The effes- 


tiveness of small military units. Washington: 
Institute for Research in Human Relations, 
1951. 


. HEMPHILL, J. K. Situational factors in leadership. 


Ohio St. Univer. Stud., Bur. Educ. Res. Monogr, 
1949, No. 32. 


. HOLLANDER, E. P. An investigation of the rela- 


tionship between authoritarian attitudes and 
leadership selection in a military setting. Un- 
published doctor's dissertation, Columbia Uni- 
ver., 1952. 

Horrawpzm, E. P. Peer nominations on leadership 
as a predictor of the pass-fail criterion in Naval 
Air Training. J. appl. Psychol., 1954, 38, 150-153, 

Jenkins, W. O. A review of leadership studies 
with particular reference to military problems. - 
Psychol. Bull., 1947, 44, 54-79. 


. LEWN, K., LiePrrr, R., & Warre, R. K. Patterns 


of aggressive behavior in experimentally created 
“social climates.” J. soc. Psychol., 1939, 10, 271- 
299. 

Masrow, A. H. The authoritarian character struc- 
ture. J. soc. Psychol., 1943, 18, 401-411. 

McCrunz, G. E., Tores, E. C., & Daney, J. T. 
Research on criteria of officer effectiveness. 
USAF Hum. Resour. Res. Cent., Res. Bull., 1951, 
No. 51-8. 

Ronpz, K. J. The relation of authoritarianism of 
the air crew member to his acceptance by the 
airplane commander. Amer. Psychologist, 1951, 
6, 323. (Abstract) 


. SANFORD, F. H. Authoritarianism and leadership. 


Philadelphia: Institute for Research in Human 
Relations, 1950. 

Scope, A., & Mussen, P. Social perceptions of 
authoritarians and nonauthoritarians. J. 
norm. soc. Psychol., 1953, 48, 181-184. : 

SrocpitL, R. M. Personal factors associated with 
leadership: a survey of the literature. J. Psycholy 
1948, 25, 35-71. 

SroUrrER, S. A., SucHMAN, E. A., DEVINNEY 
L. C., Sraz, S. A., & WizLiAMs, R, M. Adjust 
ment during army life. Vol. 1. The American 
soldier. Princeton: Princeton Univer. Pres | 
1949, 


. Wuerry, R. J., & Freyer, D. H. Buddy ratings! 


popularity contest or leadership criterion? Per- 
sonnel Psychol., 1949, 2, 147-159. k 

WrnLiAMS, S. B., & Leavirr, H. J. Group opinion 
as a predictor of military leadership. J. cons 
Psychol., 1947, 11, 283-291. 9 

Leadership identification and acceptance. is 
delphia: Institute for Research in Human Ri 
tions, February, 1952. 


Received April 1, 1953. 


nn 


TEST ANXIETY AND RORSCHACH PERFORMANCE 


FRANCIS N. COX 
New England University College, Australia. 
AND SEYMOUR B. SARASON 
Yale University 


ECENT studies (8, 9, 10, 11) have been 
` concerned with the analysis of 
drives and responses elicited in the 
test situation and with the experimental 
evidence supporting hypotheses concerning 
telationships between level of anxiety and 
quality of performance under different condi- 
tions of test administration. In light of these 
studies it now seems possible to attempt to 
extend the theoretical analysis to performance 
in other test situations, both to determine the 
generality of the hypotheses and to test the 
validity of certain clinical tests of anxiety. 
Projective techniques afford an obvious field 
for such experimentation, and this study is an 
Investigation of the effects of the above vari- 
ables upon Rorschach performance. 

Since Benton (2) commented upon the lack 
of adequate experimental investigations of 
Rorschach anxiety indices, Eichler (4) has 
Teported results of a carefully controlled study. 
His procedure was to compare the performance 
of two matched groups of college students on 
fifteen alleged Rorschach anxiety signs before 
and after exposure to stress conditions. 

Stress? consisted of the administration of 
electric Shock of increasing intensity, the 
implication by the experimenter of forthcoming 
application of stronger shock, and the use of 
Certain equipment for suggestive effects” (4, 
P. 354). These conditions had a significant 
effect on four indices, and three others yielded 
Suggestive results, 

In the present study the performances of 
Questionnaire-selected high and low’ anxious 
Subjects (Ss) were compared under two differ- 
ent conditions; half of each group was given 
e Instructions, and the other half was 
a their intelligence was being tested. Those 
» Eichler's anxiety signs which came out 

LORI or nearly significantly were used. 
5 "T predictions were derived from the 

Sumptions to be presented below, Tt should 

© emphasized that one of our predictions is 


the direct opposite of one of Eichler’s—namely, 


that relative to low anxious Ss, high anxious 
individuals will give fewer surface shading 
(Fc, cF, and c) responses.’ 


"THEORETICAL CONSIDERATIONS 
AND HYPOTHESES 


The following is in part a summary from a 
more detailed presentation in a previous 
publication (8). 

Assumplion 1. Anxiety is assumed to be a 
learned drive with the characteristics of a 
Strong stimulus. When anxiety has been 
learned as a response to situations involving 
intellectual achievement, two incompatible 
responses are likely to be evoked: 

a. Responses which are not task-relevant 
(designated as Ra). These responses are 
assumed to be associated with subjective, 
personalized, self-centered feelings. 

b. Task-relevant responses which reduce 
anxiety by leading to completion of the task. 
These responses (designated as Rar) are held 
to be functionally equivalent to other task- 
relevant responses (Rr) evoked by cues in the 
test situation. 

Assumption 2. It has also been assumed 
that there is a positive relationship between 
level of anxiety and frequency of evocation of 
responses of the order Ra. 

Assumption 3. The stronger the drive the 


1 It will be recalled that Eichler (4) reported an 
increase in the frequency of weighted shading re- 
sponses under stress conditions. Our assumption 1 


* that high anxiety is associated with subjective, per- 


sonalized, self-centered responses implies that high 
anxious individuals will be so preoccupied with them- 
selves in giving a response that they will tend to pay 
less attention to the stimulus material than will less 
self-centered, low anxious individuals. Since responding 
to surface shading seems likely to require paying con- 
siderable attention to the stimulus material, we pre- 
dicted that low anxious ihdividuals would yield pro- 
portionally more Fc, cF, and c responses than high 
anxious Ss. Like Eichler, however, we predicted the 
opposite for responsiveness to diffuse shading (FK 
and K). 


371 


372 


stronger must be the defense against the drive 
when the need for defense is evoked. The 
stronger the drive the more variable will be 
the inhibiting effects of the defensive reaction. 

The following hypotheses are relevant to 
the present study: 

1. Individuals with a strong anxiety drive 
(HA Ss) will yield higher scores than Ss with 
a low anxiety drive (LA Ss) on Rorschach in- 
dices which can be presumed to measure sub- 
jective response tendencies, because of evoca- 
tion of more R, responses. 

2. HA Ss will yield more variable scores 
than LA Ss on Rorschach indices which can 
be presumed to measure response readiness, 
both because of evocation of more Ra re- 
sponses and of defenses against making such 
responses. 

3. The above differences between HA and 
LA Ss will be gradually reduced as the test 
progresses because of differential reinforce- 
ment of Rar responses and a concomitant 
extinction both of R4 responses and of defenses 
against making such responses. It should be 
stressed that it is ot predicted that the indices 
will become identical (and certainly not 
reversed), since it seems unlikely that Rar 
responses will be as strongly reinforced in this 
situation as in the Kohs Block Design test 
(8)—objective success and observable im- 
provement being absent from the Rorschach. 
It is predicted, however, that other reinforce- 
ments? will result in a significant increase in 
the frequency of Rar responses and a cor- 
responding decrease in the incidence of Ra 
responses among HA Ss. Consequently, as 
the test progresses, there should be a decrease 
between HA and LA Ss in the magnitude and 
variability of anxiety indices. 

4. When a stimulus situation contains ele- 
ments which specifically arouse achievement 
anxiety, the increase in anxiety drive and 
in defense against this drive should lead to a 


greater disparity between HA and LA Ss. 


than when anxiety-arousing cues are mini- 


mized. 


2 Other possible sources of reinforcement which 
could reduce anxiety might be responses by the subject 
to subtle cues from the experimenter which seem to 
imply acceptance of his performance, or anticipated 
social approval because of his ability to produce some 
responses. 


Francis N. Cox AND Seymour B. SARASON 


D 


METHOD 
Section I 


The test anxiety questionnaire described in pre- 
vious articles (11, 12) was administered to 366 students 
in an introductory psychology course at Yale College, 
and Ss with the highest and lowest anxiety scores were 
selected. This sample consisted of 40 Ss: 20 who scored. 
in the upper seventeen percentiles (designated as HA), 
and 20 in the lower fifteen percentiles (LA) of the dis- 
tribution. Each of these two groups was randomly 
divided into two subgroups, one of which received “ego- 
involving,” the other standard Rorschach instructions, 
Approximately two months after the questionnaire was 
administered, these students were asked to participate 
in a psychological experiment. Apparently none of the 
Ss was aware of any connection between the question- 
naire and the experiment. 


Section IT 


The experimenter (E) had no knowledge of the 
anxiety classifications, merely knowing the type of 
instructions he had to administer to each S. The in- 
structions were: 

a. Standard: After spending five minutes establish- 
ing rapport in a predetermined way, E gave each of this 
group of Ss the following instructions: "I'll tell you 
what we are going to do, You know you can drop ink 
on a sheet of paper, fold it, squeeze it, and when you 
open it up get a series of differently shaped things it 
such inkblots—some see this, some that. There are no 
‘right’ or ‘wrong’ answers because we all see different 
things. Well, I have got some cards made in exactly the 
same way, and I am going to give them to you and ask 
you to tell me what different things you see in them. 
Remember, there are no ‘right’ or ‘wrong’ answers.” 
Tn the inquiry S was asked to trace each response after 
responding to the question: *What was it in the blot 
that made you think of...?" (Verbatim reading of 
response.) 

b. "Ego-involved": Under these conditions there 
was no "rapport establishment" period, and E promptly 
gave the following instructions in a curt and incisive 
voice: “I want you to do a new kind of test, one which 
is designed to measure the range of your imagination, 
the quality of your intelligence. In essence, the aim 0! 
this experiment is to see how skillful you are in handling 
strange and novel material.” The remainder of the 
instructions were as in the standard group except for 
the inclusion of the statement, ‘ what people see de 
pends upon their imagination, their creative capacity. 

The E presented the first four cards making no fur- 
ther comments, and recorded the same data as under 
standard conditions. After the fourth card, however, 
he remarked: “Remember, this is a test of your d 
nation, of how creative you are. It is just because 
test is strange to you that you are in a position to E 
your intelligence." No further comments were m i 
until after the seventh card when the above su 
were repeated. From there on, the procedure wes x 
tical with that used under standard conditions. 


WO 


TEST ANXIETY AND RORSCHACH PERFORMANCE 


Section IIT 


When all tests had been administered, E scored the 
records—still being unaware of the anxiety classifica- 
tions, Scoring was after Klopfer and Kelley (7), with 
these modifications: 

a, Responses in which no form was intended were 
given only the label of the appropriate determinant 
(F, CF, K, etc.). Responses so treated were rocks, 
clouds, fire, or sticks. 

b. All other responses were given a plus or minus 
form level rating on the basis of Hertz's (6) norms? 

€. Popularity ratings were made from a combina- 
tion of Klopfer and Kelley’s (7) and Beck’s (1) lists of 
P responses, 

d. If more than one determinant was scored for a 
particular response, then a predetermined order was 
adhered to throughout: human or animal movement, 
color, surface or diffuse shading, and inanimate move- 
ment. 

Finally, 63 responses were selected for an independ- 
ent reliability check in such a way that one response 
came from each record and, on the average, six from 
each card. 


RESULTS 


The reliability check yielded complete 
agreement for scoring of main determinants 
in 59 of the 63 responses (93.6% agreement). 
In view of this high degree of compatibility 
and because of the necessarily arbitrary char- 
acter of any system of differential weighting 
of Secondary determinants, it was decided to 
restrict analysis to main determinants.* 

, After homogeneity of variance was estab- 
lished, the test was used to determine the 
Significance of difference in mean numbers of 
Tesponses. Since, as is well known, the fre- 
quency of determinants is positively cor- 
related with the number of responses, all other 
results were analyzed in the following way. 
First, each S’s score on every determinant 
Was expressed as a percentage of his total 
number of responses, However, when these 
individual results were combined, it was 
found that the distributions for most indices 


ee The E made his own evaluation of form level 
Is referring to Hertz's norms, and his ratings agreed 
With hers in all cases where she had listed the response. 
xm RES Tesponses not tabulated in her charts were 
independently by the writers, and 11 of their as- 
Sessments Were identical. 
ae Significance of the results seems to justify 
Procedure, Further, the frequency of secondary 
S K Tesponses was computed, and the signifi- 
Um of the difference between HA and LA Ss was vir- 
Nally unchanged (p = 40). 


373 


were markedly skewed and that very few 
individual scores reached 50 per cent, so it 
was necessary to transform these individual 
percentages by means of the angular trans- 
formation table (5, pp. 13 and 56; 13, p. 316). 
To determine the extent to which these trans- 
formations had normalized the scores, the 
cumulative percentage distributions of each 
of the transformed variables were plotted upon 
normal-probability paper. Since the results 
closely approximated straight lines, it was 
possible to apply / tests to calculate the 
significance of the mean differences between 
high and low anxious Ss on these indices. 
One-tailed distributions were used only when 
the mean difference was in the predicted 
direction. Í 

The following predictions arise out of the 
assumptions and hypotheses stated previously. 
and from Eichler’s (4) findings: i 

Prediction 1. From Eichler's data it is 
predicted that, under standard conditions, 
the high anxiety group (HA) will yield more 
card rejections, fewer total responses, fewer 
whole responses, fewer popular responses, 
and fewer weighted color responses than the 
low anxiety group (LA). 

Prediction 2 (Hypothesis 1). From assump- 
tions 1 and 2, it is predicted that, under 
standard conditions, the HA group will yield 
more subjective, personalized, self-centered 
responses than the LA group. More specifi- 
cally, relative to the LA Ss, HA individuals 
will yield more irrelevant verbalizations 
(defined operationally in terms of extraneous 
remarks and qualifying remarks said by the 
subject during the actual test); more move- 
ment (FM, M, and m) responses; more 
intended vague form responses; more poor 
form responses; more personal, bodily re- 
sponses (sexual and anatomical content); 
more diffuse shading responses (FK and K); 
more form secondary responses (K, D I 


. & cF, CF, and C); fewer form primary re- 


sponses (FK, FC', Fc, and FC); more repeti- 
tive, perseverative responses (defined opera- 
tionally as a response given more than twice 
in one record); and finally, fewer surface 
shading responses (Fc, cF, and c). The logic 
of the derivation of the surface shading predic- 
tion has already been discussed (see footnote 


374 


1), and the other expectations followed from 
this self vs. card orientation.® 

Prediction 3 (Hypothesis 2). From assump- 
tions 1, 2, and 3, it is predicted that, under 
standard conditions, HA Ss will display more 
variability in their records than will LA Ss. 
More specifically, relative to LA Ss, HA indi- 
viduals-will yield more extreme results on the 
following indices: total number of responses; 
number of locations used per card; reaction 
times for each response; total time for each 
card; and finally, in the way they respond to 
the stimulus material (defined operationally 
in terms of frequency of movement of cards). 
AII these predictions involved the assumption 
that these indices are likely to be reliable indica- 
tors of "response readiness" (see Hypothesis 2). 

The validity of these predictions (and hence 
of these Rorschach indices) was tested by 
comparing the results of HA and LA Ss under 
standard conditions. However, as will be 
shown later, similar results were obtained 
under “ego-involving” conditions. Since type 
of instructions did mot have a significant effect 
upon performance, it was decided to combine 
the data from standard and "'ego-involving" 
conditions, and these results are presented in 
Table 1 where the horizontal lines divide the 
three groups of predictions. It must be em- 
phasized that all results are in the same direc- 
tion when analysis is restricted to comparison 
of Ss who received standard or “‘ego-involving” 
instructions. 

The results relevant to Predictions 1, 2, and 
3 were: 

Prediction 1. 'Three of Eichler's predictions 
(card rejections, weighted color, and total 
number of responses) are not borne out. The 
inverse relationships between level of anxiety 
and frequency of whole and popular responses 
confirm Eichler's findings. 

Prediction 2 is strongly supported—all ten 
results are in the expected direction. Since 
seven differences are statistically significant, 
it seems reasonable to conclude that there is a 
positive relationship between level of anxiety 
and frequency of evocation of subjective, 


5 Some of the intermediary steps between the as- 
sumptions and these specific predictions have been 
omitted. This was because detailed consideration of 
each prediction would have involved problems beyond 
the scope of an article of this kind. However, in a later 
publication, one of the writers (S.B.S.) intends to dis- 
cuss the intermediary steps. 


^ HA: Mean 25.6 responses, SD 11.9 responses; L: 


Francis N. Cox AND SEYMOUR B. SARASON 


TABLE 1 ; 
Comparison or HicH AND Low Anxious SUBJECTS 
UNDER STANDARD AND “Eco- 
INVOLVING” CONDITIONS 


Hic Anxtery Low ANXIETY 
Grove (N = 20) Grove (N = 20) 
VARIABLE MEAN OF MEAN OF ? 
TRANSFORMED TRANSFORMED 
PERCENTAGES PERCENTAGES 
No. of card rejections 1.30 2.40 <.50 
No. of W responses 19.60 27.80 D 
No. of W responses 23.28 26.50 10 
No. of W and W re- 32.24 42.68 1 
sponses 
No. of P responses 34.20 39.70 02 
No. of weighted color 14.40 15.10 <.50 
Tesponses 
No. of irrelevant ver- 32.84 31.42 30 
balizations 
No. of FM, M, and m 39.42 31.13 0t 
responses 
No. of “no form" re- 7.90 2.14 02 
sponses 
No. of poor form re- 18.70 10.50 0t 
sponses 
No. of form primary 17.12 22.84 02 
Tesponses 
No. of form secondary 18.68 14.40 05 
responses 
No. of sex and anat. 15.52 14.20 30 
Tesponses 
No. of FK and K re- 10.32 8.94 3 
sponses 
No. of perseverative 10.52 6.00 d 
responses 
No. of Fe, cF, and c 10.32 15.94 n 
responses 
ZO EST E h e aa 
No. of cards given 32.42 31.48 bu 
only one location 
No. of cards given 12.82 8.24 p» 
more than three 
locations 
Reaction times under 43.68 38,34 aa 
5 seconds 
Reaction times over 17.52 6.48 un 
15 seconds 
Total times under 25. 32.58 31.16 n 
seconds 10 
Total times over 75 12.98 9.62 k 
seconds 0 
No. of times moved 29.52 28.12 
cards 


Note.—The respective means and standard see eae 


i en 
responses, SD 13.3 responses. Neither the difference br 
means nor the difference between deviations is significant ($ > / 


personalized, self-centered response tendencies 
on the Rorschach. 

Prediction 3 is also well supported by the 
data. With but one exception (variability n 
number of responses) the results are 10 the 
predicted direction. Since three of the seven 


Trsr ANXIETY AND RORSCHACH PERFORMANCE 


| differences are statistically significant, and 


one other difference would occur only ten 
times in a hundred by chance, it can be con- 
duded that there is a positive relationship 
between level of anxiety and variability of 
performance on certain Rorschach indices. 
The evidence is also consistent with the as- 
sumption that this variability in performance 
can be deduced from a postulated positive 
relationship between level of anxiety and 
strength of defense against this anxiety when 
the need for defense is evoked. 

Two additional predictions were made: 

Prediction 4 (Hypothesis 3). From assump- 
tions 1, 2, and 3, it is predicted that, under 
Standard conditions, the differences between 
HA and LA Ss cited in Predictions 1, 2, and 3 
will be significant for responses to the first 
four Rorschach cards, but these differences 
are expected to be reduced on the last six 
cards, 

This prediction was tested by comparing 
performances of HA and LA Ss on these two 
groups of cards in the manner described above. 
The results can be summarized as follows: 

a. There were insufficient cases to consider 
results on four indices: no form intended 
Tesponses; number of cards given more than 
three locations; reaction times over 15 
seconds; and number of card rejections. 

„b. If results from all other indices are con- 
Sidered, then 15 of the 19 are in the expected 
direction, 

G If analysis is confined to those indices 
that did discriminate significantly between 
the groups in the validity study, then all ten 
are in the predicted direction. 

5 d. While eight of the changes were as pre- 
a from a significant to a nonsignificant 
erence, in no case was this change itself 

Statistically reliable. 

: Prediction. 5 (Hypothesis 4). From assump- 
4915 1, 2, and 3, it is predicted that, under 
ne conditions, the differences 

1 Qm HA and LA Ss cited in Predictions 

» © and 3 should be significantly increased. 
pius Prediction was tested by comparing 

opus of the four subgroups on all indices. 
i vs Inspection of these data showed that, 
F comparison with level of anxiety, the type 
ee actions had relatively little effect 
iii Performance, it was not necessary to 

^lyze results for all indices. While selection 


375 


of particular indices is necessarily arbitrary, 
the most obvious choice is to take two extreme 
cases: one in which inspection of the data 
indicates the instructions apparently had 
little effect, and one in which performance 
seems to have varied under different conditions 
of administration. Two other criteria were 
used in our selection of indices—an insistence 
on high scores so that the role of chance 
fluctuations would be minimized, and a 
demand that the indices discriminate signifi- 
cantly between the two groups under standard 
conditions; that is, they had to be valid 
measures. 

The two anxiety indicators selected were 
number of whole and cut-off whole responses, 
and number of movement (FM, M, and m) 
responses. Since the distributions of individual 
percentage scores on these indices were skewed 
and since very few individual scores reached 
50 per cent, it was again necessary to transform 
these percentages in the manner described 
above. Analysis of variance of these data 
revealed that in neither case was there a 
significant interaction between level of anxiety 
and type of instructions (with W and W 
responses F was 3.14 with 1 df, and with 
FM, M, and m responses F was 0.29 with 
1 df). Consequently, it seems reasonable to 
conclude that the type of instructions did not 
have a significant effect upon performance. 


DISCUSSION 


From the viewpoint of the efficacy of these 
Rorschach anxiety signs, both the known 
reliability and validity of the questionnaire 
(8, 10, 11, 12) and the internal consistency of 
our data afford encouraging evidence of the 
diagnostic efficiency of certain indices. Of 
particular interest are two controversial 
findings: the megative relationship between 
level of anxiety and frequency of surface 
shading responses (Fc, cF, and c), and the 
form secondary results. The former finding is 


' congruent with our analysis (see footnote 1) 


and at variance with Eichler's data. While 
further research is clearly indicated, it is 
interesting to note that, while 16 of the 20 
LA Ss obtained higher Fe scores than the 
median percentage, only four of the HA Ss 
did, and a chi-squáre test (corrected for 
continuity) yields p values between .01 and 
.02. Exactly the same number (16 vs. 4) of 


376 


Ss scored above and below the median per- 
centage with the poor form index, although 
here the direction of the difference was re- 
versed, Other measures yielding results which 
suggest that they may be applicable to in- 
dividual diagnosis were number of whole and 
cut-off whole responses, number of movement 
(FM, M, and m) responses, and proportions 
of reaction times under five seconds; in all of 
these cases 15 of the HA, and only 5 of the LA 
Ss deviated from the median percentage in the 
direction predicted for the HA group. In all 
these instances a chi-square test, corrected for 
continuity, gave p values between .02 and .05. 
While the group results for certain other in- 
dices yielded significant differences (see Table 
1), more overlapping in individual scores re- 
duced the clinical significance of these findings. 

The other controversial finding was ob- 
tained with the form secondary (K, C'F, C’, 
CF, c, C, and CF) index. While the group 
results with this measure were significant, if 
Wittenborn's (14) formula is used, precisely 
the opposite result is obtained (p = .002, 
with a two-tailed distribution). The discrep- 
ancy between these results is due to the great 
disparity in the number of whole responses 
given by our HA and LA Ss. Further work is 
needed before conclusive evidence can be 
obtained as to the relative diagnostic efficacy 
of these two indices. 

The following picture of the effects of a 
specific anxious experience upon functioning 
seems to emerge: self-oriented or self-centered 
responses are evoked, awareness of the nature 
of the objective stimulus is impaired, and 
effectiveness of responsiveness to the external 
stimulus is reduced. 

Apart from supporting the assumptions, the 
results have once more demonstrated the 
precision which can be obtained in personality 
experimentation if preselected groups of Ss 


are used. The present writers are convinced , 


that attempts to vary motivation in randomly 
chosen Ss not only yield less precise data, 
but also, that such results are extremely 
difficult to interpret. For instance it is highly 
improbable that all Ss respond in the same 
way to situational pressures and, more signifi- 
cantly, it is virtually impossible, at least at 
the present time, to obtain any accurate 
estimate of the degree of efficacy of such 
experimentally induced states. If “ego- 


Francis N. Cox AND SEYMOUR B. Sarason 


involved" instructions are given to randomly 

chosen Ss, and if, as is quite likely, both high 
and low anxious individuals are selected, then 

their different reactions to such instructions 

(10) may easily result in a “cancelling out” 

of a psychologically significant difference, 

Finally, if induced motivations are used in 

attempts to test the validity of projective 

tests (4), it is only too apparent that the 

conditions created can be so different from 

those under which clinical tests are normally 

administered that considerable doubt is cast 

upon the generality of such experimental | 
findings. 

| SUMMARY 


1. This study was designed to test the 
validity of certain Rorschach anxiety indices 
which have been shown (4) to reflect the 
effects of exposure to a stressful situation, and 
to examine the generality of certain hypotheses 
derived from a theoretical framework. 

2. Two groups of 20 high and 20 low anxious 
college students were selected by means of à 
validated and reliable (8, 10, 11, 12) test 
anxiety questionnaire. Half of the subjects 
from each anxiety group were given. the 
Rorschach test under standard conditions, j 
and half were given allegedly “ego-involving 
instructions. í l 

3. The results provide encouraging evidence 
as to the validity of certain anxiety indicators 
and support the theoretical assumptions from ~ 
which most of them were derived. The data | 
are also consistent with a hypothesis about 
the effects of “learning” (adaptation to the 
test situation) on Rorschach performance, 
but these results lack statistical reliability: 
Similarly, although there are some differences 
in performance under standard and allegedly | 
“ego-involving” conditions, these results al 
not significant. 

4. Finally, it seems reasonable to argue 
that thése data are sufficiently consistent E 
justify further research on the Rorschac 
and other projective tests with preselto 
groups of high and low anxious subjects. 


| 


REFERENCES 


1. Beck, S. J. Rorschach’s test. Vol. I. Basic process 
New York: Grune & Stratton, 1944. | y the 

2. Benton, A. L. The experimental validation 9 
Rorschach test. Brit. J. med. Psychol, 1 
22, 45-58. 


Test ANXIETY AND RORSCHACH PERFORMANCE 


3, CronBacH, L. J. Statistical methods applied to 
Rorschach scores: a review. Psychol. Bull., 1949, 
46, 393-429. 

4, Ercuter, R. M. Experimental stress and alleged 
Rorschach indices of anxiety. J. abnorm. soc. 
Psychol., 1951, 46, 344-355. 

5, Fisner, R. A., & Yates, F. Statistical tables for 
biological, agricultural and medical research. (3rd 
Ed.) New York: Haffner, 1948. 

6, Hertz, MARGUERITE R. Frequency tables for scoring 
responses to the Rorschach inkblot test. (3rd Ed.) 
Cleveland, Ohio: Western Reserve Univer. 
Press, 1951. 

T. Kioprer, B., & Kerrey, D. M. The Rorschach 
technique. Yonkers: World Book Co., 1942. 

8. MANDLER, G., & Sarason, S. B. A study of anxiety 
and learning. J. abnorm. soc. Psychol., 1952, 47, 
161-173. 


377 


9. Sarason, S. B. The test situation and the problem 
of prediction. J. clin. Psychol., 1950, 6, 387-392. 

10. Sarason, S. B., MANDLER, G., & CRAICHILL, P. 
G. The effect of differential instructions on 
anxiety and learning. J. abnorm. soc. Psychol., 
1952, 47, 561-565, 

11. Sarason, S. B., & MANDLER, G. Some correlates 
of test anxiety. J. abnorm. soc. Psychol., 1952, 
47, 810-817. 

12. Sarason, S. B., & Gordon, E. M. The test anxiety 
questionnaire: scoring norms. J. abnorm. soc. 
Psychol., 1953, 48, 447-448. 

13. SNEDECOR, G. W. Statistical methods. (4th Ed.) 
Ames, Ia.: Iowa State Coll. Press, 1946. 

14. WITTENBORN, J. R., & METTLER, F. A. A lack of 
perceptual control score for the Rorschach test. 
J. din. Psychol., 1951, 7, 331-334. 


Received April 2, 1953. 


RELIGIOUS BELIEFS OF CATHOLIC COLLEGE 


STUDENTS AND 


THEIR ATTITUDES TOWARD MINORITIES 


CHARLES T. O'REILLY 
University of Notre Dame 


j HE relationship between religion and 
prejudice is not clear. Some investiga- 
tors, among them Rosenblith (11), Tur- 

beville and Hyde (14), Brown and Lowe (3), 
and Gough (7), have reported that people who 
acknowledge church membership tend to be 
less tolerant than those who do not. Other in- 
vestigators like Parry (10), Harlan (8), Bet- 
telheim and Janowitz (2), and others have 
found either slight differences or no differences 
between the attitudes of members of different 
religions. Most studies can only relate attitudes 
to church membership because they only secure 
information about the subject's religious pref- 
erence, and by itself this tells us nothing about 
the subject's “religiosity.” 

Evans (4) found that religion played an im- 
portant role in the lives of those who fell into 

` the high or the low quartile of the Levinson and 

Sanford (9) scale for the measurement of 
anti-Semitism. His suggested explanation for 
this was that the low in prejudice adhere to a 
humanitarian, positive religion, while the high 
in prejudice follow a narrow religion which 
tends to foster ethnocentrism. This explanation 
may do something to clarify the relationship 
between religion and prejudice and help to 

‘explain why people who admit church member- 

ship often reveal themselves as prejudiced, 

thus calling into question the traditional con- 
ception of religion as fostering values like 
brotherhood and tolerance. 

The study reported here does not attempt to 
distinguish between the humanitarian and the 
narrow types of religion. It is concerned with 
the attitudes toward Jews and Negroes held 
by Catholic college students, and the relation- 
ship between these attitudes and their attitudes 
toward certain teachings of the Catholic 
Church. Important differences were found in 
the religious beliefs of people who acknowl- 
edged membership in the Catholic Church, as 
well as the fact that these differences in beliefs 
were related to differences in attitudes toward 
minorities. It follows, then, that the mere 
statement of church preference is not enough 
in order to study the relationship between re- 


ligion and attitude. 


378 


AND 


EDWARD J. O’REILLY 
DePaul University 


METHOD 


As part of an investigation of the attitudes of Catho- 
lic students, students in Catholic colleges in three sec- 
tions of the United States answered a questionnaire 
which contained, among other things, the 52-item 
Levinson and Sanford (9) anti-Semitism scale; a 16- 
item scale to measure attitude toward the Negro; and 
a 15-item scale to measure the extent to which the 
subjects (Ss) agreed with the Catholic Church on 
certain social, moral, and religious questions, This 
religion scale was adapted from one developed by 
Fichter (6). Material obtained from two of the groups 
studied is reported here. 

The, M group included 92 white males attending a 
Catholic, liberal arts college for men in a large southern 
city. All these Ss were seniors. The O group included 
120 white females attending a Catholic, liberal arts 
college for women in the same city. They were almost 
evenly divided into juniors and seniors. 

Two men did not answer questions about attendance 
at Mass and about reception of Holy Communion dur- 
ing the Easter season, which were designed to deter- 
mine whether the Ss were practicing Catholics. One 
woman answered these questions negatively. All 5s, 
however, identified themselves as Catholics, and, with 
the above exceptions, all of them regularly attended 
church services. 

The anti-Semitism scale had a split-half reliability 
of .92 raised to .96 by the Spearman-Brown formula. 
The anti-Negro scale had a split-half reliability of 93 
raised to .96, and the religion scale a reliability of 90 
raised to .95. When tested with another group, the 
religion scale had a correlation of .60 with the Thur- 
stone and Chave (13) scale for measuring attitude 
toward the church. The correlation between the antl 
Semitism and anti-Negro scale was .69 for the women 
and .68 for the men; and the correlation between the 
anti-Semitism and religion scale was .39 for the women 
and .31 for the men. 


RrsuLTS 


Mean scale scores, etc. are shown in Table 


“1. The Ss were divided into those whose scores 


fell into the high and the low halves of the 
religion scale. Seventy-one per cent of thos? 
scoring high on the religion scale also were m 
the high scoring half of the anti-Negro scale; 
while 71 per cent of those who scored low on 
the religion scale also scored low on the ant 
Negro scale (x? = 39.92; p < .01). Sixty P 
cent of those scoring high on the religion sc? 
also scored high on the anti-Semitism scale 
while 61 per cent of those scoring low on the | 


bil 


RELIGIOUS BELIEFS AND ATTITUDES TOWARD MINORITIES 


TABLE 1 
Mean ANTI-SEMITISM, ANTI-NEGRO, AND 
RELIGION ScoRES, AND SD’s, O AND 
M Grovrs* 


379 


TABLE 2 
PERCENTAGES OF SUBJECTS, VARIOUS STUDIES, 
By DEGREES OF INFLUENCE OF 
RELIGION ON UPBRINGING 


RELIGION 
SD 


AntI-SEMITISM| ANTI-NEGRO 


Grour| N 


Mean | SD |MreaN| SD |MEAN 


O |120 143.12 | 44.8 | 49.3 | 21.3 | 41.9 | 12.9 
M |92 161.57 | 45.4| 57.9 | 20.5 | 48.5 | 13.6 


* All statements in each scale were negative. Com- 
plete agreement with a statement indicated an unfavor- 
able attitude toward a minority or toward the teachings 
of the church, and received a score of 7.0. Complete 
disagreement with a statement indicated a favorable 
attitude and received a score of 1.0. Hence, low scores 
indicate a generally favorable or unprejudiced attitude 
toward a minority, or a favorable attitude toward the 
Church, and vice versa. Possible range in the anti- 
Semitism scale was 52-364. The O range, 56-278; M. 
range, 64-281. The possible anti-Negro range was 
16-112. The O range, 16-107; M range, 16-95. The 
possible religion scale range was 15-105. The O range, 
18-78; M range, 15-86. 


religion scale also scored low on the anti-Semi- 
tism scale (x? = 9.99; p < .01). 
Nearly all the Ss said that religion had either 
a great or a moderate influence upon their up- 
bringing. In an Allport and Kramer (1) study, 
28 per cent, and in Rosenblith’s (11) study, 34 
per cent of the Ss said that religion had a very 
marked influence upon their upbringing. The 
Very marked" category in these two studies 
Corresponds to the “great” category in the 
Present study, which accounted for 74.5 per 
Cent of the Ss. The higher percentage of Catho- 
lics Teporting a great or very marked influence 
of religion upon their upbringing seems to 
confirm the findings of Ferguson (5), Thurstone 
wm Chave (12), and others who found that 
atholics were more influenced by religion than 
Rost Protestants and Jews. 
Em Allport and Kramer combined the A 
p B Categories found in Table 2, they found 
ta significantly (p « .01) larger number of 


those in the combined group fell into the more © 


Bonne group. Rosenblith did not find such 
that eee in her study, but she did find 
Nae € number who reported no religious in- 
Ca was significantly more in the less prej- 
ae uir lower scoring half in regard to anti- 
the - Only about 2 per cent of the Ss in 
hee study fell into the C and D cate- 
es, and no such tendency was observable. 
eligion had greatly influenced 74.5 per cent 


ALLPORT | ROSEN- 
DEGREE OF O AND 
& KRAMER| BLITH 
INFLUENCE (1) (11) M 
A. Very marked 28.0 34.0 14.5 
B. Moderate 41.0 47.0 23.1 
C. Slight 21.0 13.0 1.8 
D. None 10.0 5.0 .6 


of the Ss, and 47 per cent of these people fell 
into the high scoring half of the religion scale, 
while 59 per cent of those who said that they 
were moderately influenced by religion fell into 
the high scoring half of the religion scale (x? — 
2.28; p > .05). Those who reported that re- 
ligion had greatly influenced their upbringing 
had 53 per cent of their number in the more 
prejudiced half of the anti-Negro scale and 52.5 
per cent in the more prejudiced half of the 
anti-Semitism scale (not significant). 

When asked whether Negro Catholics should 
have their own parishes, 46 per cent of the 
men and 56 per cent of the women opposed 
segregation. Those who believed that Ne- 
groes should have their own parishes had 76.5 
per cent of their number in the high scoring 
half of the religion scale; while those who were 
against segregation had 74.5 per cent of their 
number in the low scoring half of the religion 
scale (x = 55.09 ; p < .01). Those favoring 
segregation had 66.6 per cent of their number 
in the more prejudiced half of the anti-Negro 
scale; while 65.4 per cent of those opposing 
segregation were in the less prejudiced half of 
the anti-Negro scale (x? = 21.82; p « 01). 


SUMMARY AND CONCLUSIONS 


The anti-Semitism and anti-Negro scores of 
92 white, Catholic, male college students and 
120 white, Catholic, female college students, 
who were attending Catholic colleges in a 
southern city, were related to their scores on a 
religion scale. The Ss’ estimation of the in- 
fluence of religion upon their upbringing was 
not significantly related to scores on the re- 
ligion scale, or to anti-Semitism and anti- 


1In two middle western groups, 82 per cent of the 
men and 69 per cent of the women opposed segregation 
in parishes, indicating a possible regional difference. 


380 


Negro scores. There was a very significant 
tendency, however, for those scoring high on the 
religion scale to be less favorable toward Jews 
and Negroes and to favor the segregation of 
Negroes in their own parishes, while those who 
scored low on the religion scale were signifi- 
cantly less prejudiced and were opposed to 
segregation. 

In view of the differences in religious atti- 
tudes among these students, all of whom listed 
themselves as Catholics, and because of the 
relationship between their religious attitudes 
and their attitudes toward minorities, it is 
evident that a mere statement of religious 
preference or church membership is not an 
adequate criterion to use when investigating 
attitudes toward minorities. 


REFERENCES 


1. ArrPonT, G. W., & Kramer, B. M. Some roots of 
prejudice. J. Psychol., 1946, 22, 9-39. 

2. BETTELHEIM, B., & JANowrTZ, M. Dynamics of 
prejudice. New York; Harper, 1950. 

3. Brown, D. C., & Lowe, W. L. Religious beliefs 
and personality characteristics of college stu- 
dents. J. soc. Psychol., 1951, 33, 103-129, 

4, Evans, R. I. Personal values as factors in anti- 


O’ReILLy AND O'REILLY 


, d 

Semitism. J. abnorm. soc. Psychol., 1952, 47, 
749-756. i 

5. FERGUSON, L. W. Socio-psychological correlates of - 
the primary attitude scales: I. Religionism, J, 

soc. Psychol., 1944, 19, 81-98. E 

6. Ficurer, J. H. Southern parish. Vol. I. Dynamicsof 
a city parish. Chicago: Univer. of Chicago 
Press, 1951. 

7. Goucm, H. G. Studies in social intolerance; IV, 
Related social attitudes. J. soc. Psychol., 1951, 
33, 263-269. 

8. Harran, H. H. Some factors affecting attitude 
toward Jews. Amer. sociol. Rev., 1942, 7, 816-821, 

9. Levinson, D. J., & Saxronp, R. N. A scale for 
the measurement of anti-Semitism. J. Psychol, 
1944, 17, 339-370, 

10. Parry, H. J. Protestants, Catholics and prejudice, 
Int. J. Opin. Altit. Res., 1949, 8, 205-213. 

11. RosENaLITH, Jupy F. A replication of “some roots” 
of prejudice.” J. abnorm. soc. Psychol., 1949, 44, 
470-489. 

12. Tuurstone, L. L., & Cuave, E. J. The measurt: | 
ment of attitude. Chicago: Univer. of Chicago 
Press, 1929. i 

13. THURSTONE, L. L., & CHAVE, E. J. Instructions for 
using the attitude scale. Chicago: Univer. of 
Chicago Press, 1930. 

14. TURBEVILLE, G., & Hype, R. E. A selected sample | 
of attitudes of Louisiana State University stu- | 
dents toward the Negro: A study in public 
opinion. Soc. Forces, 1946, 24, 447-450. 


Received April 11, 1953. 


ASSUMED SIMILARITY MEASURES AS PREDICTORS OF 
TEAM EFFECTIVENESS: 


FRED E. FIEDLER 
University of Illinois 


teams, committees, or crews, conduct 


MALL groups of individuals, working as 
the overwhelming proportion of the 


nation’s civic, industrial, and military business, 


The effectiveness with which these groups 
function is thus of practical as well as of 
theoretical concern in our society. 

This paper reports a series of related in- 
vestigations which deal with interpersonal rela- 
tionships within a team as correlates of the 
team’s total effectiveness. 

Certain aspects of interpersonal perception 
have been investigated in previously published 
studies on therapeutic relationships (3) and 
interpersonal relations in a social group (5). 
These studies present techniques for obtaining 
the so-called “Assumed Similarity (AS) 
Scores,” which appear to be correlates of 
liking and warmth in interpersonal relation- 
ships. The present series of investigations 
telates these interpersonal perception vari- 
ables to team effectiveness. 

A first, frankly exploratory study used 14 
high school basketball teams. The most 


- Promising measures which emerged from this 
- Study were then validated on a second group 
- of 12 high school basketball teams (6). The 


hypotheses derived from both studies were 


p tested on teams which differed markedly 
rom the first group, namely, three- and four- 


man surveying parties (4). 


THE First BASKETBALL STUDY 


Tren School basketball squads are composed _ 

» rom 9 to 18 players. These are chosen by 
© coach from a larger pool of interested . 

Tis Competing for places on the first team. 
1s study was conducted in the Midwest, 


‘The research reported here was carried out under 
SPD N6ori-07135 between the University of Ili- 
Dun the Office of Naval Research. The investiga- 
nr conducted by a group in which several 
a TS of the project staff played a part during 
us phases. In particular, the writer wishes to 
he appreciation to Dr. L. J. Cronbach, then 
Han, § A the project, and to Messrs. Walter Hart- 
Micra Rudin, Mrs. Mary E. Ehart, Miss Vivian 
Taven, and Dr. W, G. Warrington. 


381 


where basketball is of considerable importance 
to high school players and coaches and where 
a large number of teams can be found. 


The Instrument 


The tests used in the basketball study were forced- 
choice questionnaires. These questionnaires consisted 
of 100 descriptive statements grouped into 20 blocks 
of five statements each. An attempt was made to 
construct the blocks so that, in the main, statements 
within each block would be equally acceptable to the 
subjects (Ss), but descriptive of different personality 
dimensions. 

One block of statements is given as an example. 
Least 


Most 


1a. I find it easy to understand what 
others are trying to tell me........ 
. People think I am a hard worker. 
I don’t mind losing my temper 
when provoked.....--.-..+1+250+ 
. I like people who don’t worry about} 


In a self-description S would answer these statements 
the left square opposite the state- 
ment which S considered most characteristic of him- 


well will describe himself.’ 3 

In addition to these interpersonal perception tests, 
players named the three team members—not neces- 
sarily the best players—with whom they could co- 
operate best, and the three with whom they could 


cooperate least well during games.” 


Interpersonal Perception Scores 

Conventional tests are scored by comparing S's 
response with the “ight” response of a key. Our 
scores are obtained by comparing two questionnaires 
of the same person. This comparison yields the so- 


2 This paper discusses only procedures pertinent to 
the present hypothesis. A detailed account of the 
studies is presented in (2, 6, 7, 9). 


382 


TABLE 1 
CORRELATIONS BETWEEN THE DEC. 31, 1951 CRITERION 
AND INTERPERSONAL PERCEPTION SCORES IN 14 
BASKETBALL TEAMS 


CORRELATION (rho) 


Fiol s MEDIAN Score or Most 
MEASURE Score IN PREFERRED Co- 
TEAM AND WORKER AND 
CRITERION CRITERION 
ASp 225 —.63* 
ASo —.03 — .69** 


* p < .05 according to Olds's tables (11). 
** p < .01 according to Olds's tables (11), 
The large number of exploratory tests run on this first sample 
does not allow interpretation of significance levels. These are 
here given only as a point of reference. 


called “Assumed Similarity” measures listed imme- 
diately below. Tentative interpretations are provided 
which are based in part on evidence from previous 
studies. 


Assumed Similarity Scores 


1. ASp—a measure of assumed similarity obtained 
by comparing (correlating) the S's self-description 
with his prediction of the self-description of his positive 
choice, the best co-worker. High ASp appears to be 
related to personal liking and perhaps warmth for the 
chosen person (3, 5). 

2. ASn—a measure obtained by comparing the S's 
self-description with his prediction of his negative 
choice. A high ASn score may, on the basis of the inter- 
pretation above, indicate a feeling of personal closeness 
and warmth for the negative choice. 

3. ASo—a measure obtained by comparing .S’s 
prediction for his positive choice with his prediction 
for his negative choice. This measure is interpreted as 
“set” to differentiate between people. Since ASn and 
ASo are highly correlated, we will concern ourselves 
here only with the scores ASp and ASo. 


The Criterion 

Group effectiveness was measured by propor- 
tion of games a team had won. The date 
used here was December 31, 1951,‘ before 


3 AS scores can be computed as correlations or as 
difference scores. To avoid possible confusion we will 
in this paper speak of high AS as meaning high as- 
sumed similarity, i.e., a high correlation between either 
the self and the predicted self of another person, or 
between the predicted self-descriptions of two other 
persons. In terms of D scores, this would imply small 
differences between the two descriptions on which 4S 
is based. : : 

4 The original report (5) deals with December 15 as 
the criterion date. Since some teams had at that time 
played fewer than 4 games, the December 31 date 
appears to be a better criterion estimate. 


FRED E. FIEDLER $ 


many changes in team personnel had taken 
place and by which time each team had 
played from 8 to 12 games. | 

In general, small schools are handicapped | 
by having relatively few eligible students, - 
However, teams generally compete with 
other neighboring schools of comparable size, 
thus equalizing some of the differences which 
would favor large schools. The criterion 
reliability was estimated by comparing the | 
proportion of games a team had won during 
the first and second halves of the season, At 
this time it was possible only to estimate 
reliability for the second sample which was 
tested with end-of-season criteria. The cor. | 
rected criterion reliability estimate for these 
12 teams was .88. The corresponding reliability 
for the December 31 date could not be com- 
puted because too few games had been played, 
but it is undoubtedly lower (7). 


Relation of Assumed Similarity Measures to 
Basketball Criteria 


Our hypothesis states that interpersonal 
perception scores are related to the proportion 
of games a team wins. We tested whether 
team members of an effective team will, on | 
the average, perceive each other differently 
from members of ineffective teams. Here we 
correlated the team median of the AS scores 
with the criterion. As can be seen from the 
second column in Table 1, correlations between 
the criterion and median scores are generally 
near zero. f 

The group may, however, also express its 
attitude by the type of person whom most 
members of a team choose as their best co- 
worker. In order to get at this attitude we 
considered only the AS scores of those men 
bers of the various teams which had received 
the greatest number of “best co-worker 
votes. As can be seen in Table 1, this procedure 
Suggests that the AS scores of the teams 
“most preferred co-worker” correlate with 
the criterion in the negative direction. 

In previous studies, high ASp seemed to be 
related to warm, empathic interperso 
relationships. We expected to find a 
relationships—hence also high AS—to ; 
prevalent in effective teams. The Be 
findings are thus in the direction oppo 
to that which was anticipated. They sugg |. 
that the most preferred cooperators in ef^ |. 


SgcoND STUDY: 


ASSUMED SIMILARITY MEASURES AS PREDICTORS OF TEAM EFFECTIVENESS 


TABLE 2 


Point BisERIAL CORRELATIONS 
BETWEEN THE CRITERION AND ASSUMED SIMILARITY 
Scores oF Most PREFERRED CO-WORKER 


| AsSUMED SIMILARITY 
MEASURE Tt. bis. t $ 
ASp —.20 .58 
ASo —.58 2.20 .03 


tive teams tend to be somewhat less warm 
and emotionally less involved with persons 
whom they choose as work companions than 
is the case of keymen in less effective teams. 

The correlations in Table 1 are, of course, 
based on a small sample of teams, and on only 
moderately reliable scores, .62 and .61 for 
ASp and ASo respectively (9). In addition, 
they are the survivors of a considerable 
mumber of exploratory measures. A valida- 
tion attempt, therefore, became essential. 


VALIDATION STUDY on BASKETBALL TEAMS 


The second sample of teams was studied 
solely for the purpose of testing relationships 
E^ were significant at the .05 level in the 
^ t study, i.e., on measures ASP and ASo of 

€ most preferred co-worker. 

Eur major modifications were in the 

od of choosing teams and testing signifi- 
tance. We selected, toward the end of the 
ES 9 teams which had had a predomi- 

a y winning season and 9 teams which had 
Br cenanty losing season, and re- 

: ed their cooperation. These came from 

d and lower third of a roster of over 
dris We tested 7 "good" teams and 5 
E. teams which agreed to cooperate. 
qs s teams were dichotomously selected, 
Ese us correlations were here used to 
significan, e degree of the relationship. The 
Ey à of the difference between the 
ly the good” and “poor” teams was tested 
in Bou t test; inspection of the data 

Dua that the conditions for applying a 

a ere not violated, The small samples 
Sggest get very high reliability of the scores 
d ‘ution in interpreting these data. 
iil n be Seen from Table 2, the point 

Correlation between the criterion and 


5 The Ri 
assistance. riter gratefully acknowledges the invaluable 
received from Clyde Knapp and Harry A. 


Comi x 
3 of the University of Illinois. 


383 


AS» ASo 
Ọ Corre- Goop Poor  Q Corre- Goop Poor 
LATIONS Teaus TEAMS LATIONS Teams TEAMS 
„S5 .55 
.50 -50 
445 x xx 45 x 
40 x x 40 
35 +35 x 
.30 x x .30 
+25 +25 x xx 
+20 x 20 x x 
415 x x AS x 
10 +10 x 
.05 .05 x 
.00 x .00 
—.05 —.05 
—.10 x 


Fic. 1. ASo AND ASp or Most PREFERRED Co- 
WORKER PLOTTED AGAINST THE CRITERION IN THE 
SEcoND SAMPLE 


ASo of the team's most preferred co-worker 
is —.58. We attempted to validate two 
measures. Only one of these (ASo) reached 
significance. We are therefore not justified 
to consider the relation of A.So to team effec- 
tiveness in basketball as established. ASP 
of the teams’ most preferred co-workers did 
not correlate significantly with the criterion 
even though the correlation is in the antici- 
pated direction. We have plotted the measures 
ASp and ASo of the most preferred team 
members from good and poor teams. (See 
Fig. 1.) 
In addition, we also computed validities 
for the end of the season when all league 
es had been played. These validities are 
generally lower than those of December 31. 
Tt is clear from these data that ASp in con- 
trast to ASo is not a promising predictor of 
team effectiveness (7). 


Srupy ON SURVEYOR Teams? 


. The Hypothesis 


' The basketball team studies led to one 
major hypothesis: Members of effective teams 
will prefer co-workers who “assume relatively 
little similarity between the persons whom they 
choose and those whom they reject as their own 
co-workers. 

M. O. Schmidt, 
of Illinois, 
phase of 


‘The writer is indebted to Prof. 
Civil Engineering Department, University 
whose interest and cooperation made this 
the study possible. 


384 Frep E. 


Since we interpret high AS to be indicative 
of warmth toward, and acceptance of, others 
we also explored whether team effectiveness 
and congeniality are negatively related. 

The following study was designed to test 
this ‘hypothesis, and to obtain evidence 
regarding the additional question. 


Selection of Groups 


Student surveying parties work in teams of three to 
- four men. The Ss were 71 civil engineering students tak- 
ing a required course in surveying. The course consists 
of two parts. The first part is taught on the university 
campus on a full-day basis, lasting three weeks; the 
second part covers five weeks. This is offered at a 
university-operated surveying camp in northern Min- 
nesota where students concentrate on field problems 
in relatively difficult terrain. The camp is almost com- 
pletely isolated and self-contained. Students as well as 
faculty members eat, sleep, and work there, and stu- 
dents are under practically continuous supervision of 
their instructors. 

Organisation of the course at camp. While at camp, 
the students were divided into six sections, one instruc- 
tor remaining in charge of each of the sections through- 
out the camp period. Each section consisted of three or 
four parties, and each of the parties consisted of three 
or four men. A total of 22 surveying parties was formed. 

Differences between basketball teams and Surveying 
parties. Obviously basketball teams differ in many re- 
spects from surveying parties. The differences which 
we considered among the most important are the fol- 
lowing: 

a, Basketball teams require physical coordination, 
relatively little verbal interaction. Surveying is pri- 
marily an intellectual task requiring frequent verbal 
communication. 

b. While basketball squads consist of 9-18 members, 
the surveying parties in our study were no larger than 
four men. 

¢. Basketball teams work under considerable time 
Pressure. Speed in surveying is only of secondary im- 
portance. 

d. Members of basketball teams are highly identi- 
fied with the team, and personally involved with their 
team’s success. This identification and involvement is 
almost completely absent in surveying teams. The 
students were graded individually, and no benefits 
were derived from being in a ‘“ ? surveying party. 
This is shown by the fact that none of the students were 
interested in their instructor's opinion of their team. 


The Instrument 


As in the basketball studies, Ss responded to three 
identical questionnaires, predicting a preferred, and 
predicting a rejected co-worker. Unlike the forced- 
choice questionnaires used in the basketball study, the 
surveyor tests consisted of 60 statements, each of which 
was to be marked on a seven-point scale ranging from 
definitely true to definitely untrue. The statements were 


FIEDLER 


pretested on a 180-item questionnaire.” Statements 

were selected on the basis of an item analysis to obtain 

items with large variances on self-descriptions, State. 

ments such as “I am very discriminating in my choice 

of friends," *T am not likely to admit defeat,” or 

“when a person is a failure it is his own fault,” were 

used. The instrument presents a considerable improve- 

ment over the tests used in the basketball studies, The 
reliabilities for ASp and ASo are .83 and .93 respec- 
tively (2). In addition, the tests require less time for 

administration. 

By comparing the two-tests by means of the statis- 
tic D (1), it is possible to obtain a score indicating how 
similarly any two of the questionnaires have been 
marked. 

Test procedure and instructions. The instructions and 
administration of questionnaires followed those of the 
basketball studies, with a few, relatively minor, ex- 
ceptions. The Ss could predict their preferred and least 
preferred persons from among those with whom they - 
had previously worked. These did not have to be per- 
sons at the camp. The Ss again completed sociometrics 
regarding the three persons within their section (10- 
15 Ss) whom they personally liked most and liked 
least. They similarly named three Ss whom they pre- | 
ferred most, and the three whom they preferred least | 
as co-workers. | 


The Criteria of Effectiveness 


Instructors were asked to rank all teams in 
their section in terms of the following: 

a. Accuracy with which surveying jobs were 
done by various parties. 

b. Speed with which the jobs were done. 

c. Congeniality of the teams in terms of 
lack of conflict and smooth-running field 
operations. 

In addition, students in all sections wel 
asked to "rank all parties in the section from 
best to poorest.” This constitutes our síudens 
ratings criterion. 

Accuracy is the main criterion in sur 
veying. It was, therefore, the only criterion 
on which we attempted to validate the hy- 
pothesis derived from the basketball study: 
The one-tailed test of significance was applied, 
therefore, to the accuracy criterion only: 
Tests relating to other criteria were explora 
tory. 

7 We are indebted to Col. R. W. Faubion, Com 
manding Officer, Det. #3, Human Resourcés Ret 
Center, Chanute Air Force Base, for permission ™ 
pretest this instrument. > i 

* Students’ ratings of Section V could not be n 
The Ss in that section had been in more d 
surveying party, and a number of students rated t of 
other than the main teams rated by the instructor 
that section. 


j 


m 


' Significant. ASp shows only a negligible dif- . 


ASSUMED SIMILARITY MEASURES AS PREDICTORS OF TEAM EFFECTIVENESS 


Each instructor could rank only the three 
or four surveying parties in his own section. 
Ranking of parties from different sections 
was standardized to permit comparison of all 
parties. AS scores for all 22 teams could then 
be correlated with the various criteria. 

The fact that instructors’ frames of refer- 
ence differ decreases to some extent the 
criterion reliability. This would tend to obscure 


- any relationships present, and it would in- 


crease the probability of accepting the null 
hypothesis when a true difference exists. 
Table 3 presents the intercorrelations of the 
four criteria used in this study for three- and 
four-man parties. 


| Relation of Assumed Similarity Measures to 


Criteria in Surveying 


The accuracy criterion. Our major hypothesis 
States that the assumed similarity of most 
preferred co-workers in good teams will be 
telatively low. The preferred co-workers in 
relatively poor teams will have high ASo scores. 

Our population of teams consists of 22 sur- 
Veying parties, divided into six different sec- 
tions. This division presents difficulties in 
Statistical treatment of the data since no 
criterion was available for comparing teams 
from different sections. We have here tested 
the hypothesis by two methods. 

a. We compare the best and the poorest 
teams within each of the six sections. We can 


_ then ask whether the AS scores of the pre- 


ferred co-worker in the best team from each 
Section are lower than the AS scores of the 
Preferred co-worker in the poorest team in 
fach section. Since the two teams for each 
Section are evaluated by the same instructor, 
the matched / test can here be used. This does, 
however, reduce to 12 the number of teams 
(cases) used in the analysis. (See Table 4.) 
As can be seen in Table 4, ASo differences are 


ference. 

b. A somewhat more satisfactory indication 

the degree of relation comes from a second 

passis, Criterion ratings were converted to 

E and then correlated. with the most 

erred co- , 

SEEN co-workers’ AS scores, both ASp 
Table 4 also presents the r’s between the 

Primary criterion, accuracy, and these two 


of 


385 


TABLE 3 


INTERCORRELATIONS (r) OF CRITERIA FOR THREE- 
AND Four-Man Surveyor TrAMs* 


An bcd ot CONGENIALITY 
RATINGS 
Speed 79 
Congeniality 15 52 
ratings < 
Student —.34 AS .39 
ratings 


* (See footnote 8.) Based on N's of 22, except correlations with 
the student rating criterion in which N's — 18. 


TABLE 4 


Comparison Or AS or PREFERRED CO-WORKERS IN 
Trams RATED HIGHEST AND LOWEST IN ACCURACY 


(N = 12) 
INTERPERSONAL MEAN Or MEAN OF ? 
Pexcertion Hicmest Poorest t b (N=22) 
Scores Trams* TEAMS 
AS) 12.96 12.24 .56 
ASo 20.61 15.32 3.30  '.025 SL 


* In termsof D’s. A high score indicates low assumed similarity. 


AS scores. As can be seen, the hypothesized 
relationship has been found between the cri- 
terion and ASo of the most preferred co- 
worker. Hence, persons chosen as most pre- 
ferred co-workers in effective (ie., accurate) 
teams perceive a greater difference between 
those whom they prefer and those whom they 
reject as co-workers than keymen in less ac- 
curate teams. The findings thus are consistent 
with the hypothesis induced from the results 
which were obtained in the study on basket- 
ball teams. 


The Secondary Criteria 

No significant relation was found between 
AS scores and secondary criteria. The relation 
between AS scores on students’ ratings is, ^ 
however, in the opposite direction of those 
found for accuracy and other criteria based on 
instructors’ ratings. In other words, the stu- 
dents tended to rate those teams as better in 
which the preferred co-worker assumed. rela- 
tively high similarity to his negative choice. 
This appears to lend some support to the inter- 
pretation that preference for a person with 
high AS is related positively to congeniality 
within the team. (See Table 3.) 


386 Frep E. 

The relation of congeniality and effective- 
ness was explored by one further step. An 
Intrateam Preference Index (IPI), defined 
as an index of congeniality, was devised and 
correlated with each of our four criterion 
ratings. 

This measure of congeniality is based on the 
following considerations: 

a. Each person had rated the other 10-15 
members of his section in terms of how well 
he liked them as co-workers. 

b. A subject worked in a three- or four-man 
team. He could choose his preferred co-workers 
within his own three- or four-man team, or he 
could prefer others in his section who were 
not in the team. 

c. We assume that a team whose members 
choose one another is more congenial than one 
whose members reject one another or choose 
outsiders, 

The measure is computed by the formula: 


Intrateam Preference Index (IPI) = 
choices an rejections| ( choices 
within ' without without 

nk—n 


rejections 
within 


n = number of men within the team 
k — number of choices made by each individual 


When this intrateam preference index (IPI) 
was correlated with our four criterion ratings, 
the following relationships appeared: The 
correlation between accuracy and the IPI 
was negative (—.23), while other criterion 
ratings correlated positively with the IPI. 
The highest correlation was found between 
students’ ratings and the [PI (.37). When we 
compare the most preferred co-worker’s ASo 
scores of teams considered best and those con- 
sidered poorest by students, we find higher 
ASo for teams rated high by students and 
lower ASo scores (greater perceived difference 
between most and least preferred work com- 
panions) for teams which students rated as 
poor. This relationship is not statistically 
significant (¢ = 2.24; t = 2.57 is required for 
significance of .05). The direction would 
tend to indicate, however, along with our other 
data, that effectiveness and congeniality may 

be inversely related in informal teams, This 
finding is in accord with the findings in the pre- 
ceding basketball study. Moreover, it is sup- 
ported by the results in Halpin’s recent study 


FIEDLER 


of air crew leaders (8) and a Paper by Schach- 
ter, Ellertson, and McBride on experimentally 
assembled student groups (10). The results are 
not in agreement with Van Zelst’s study on 
construction workers (11). All in all, these data 
suggest that further study is needed to deter. 
mine whether or not effectiveness and con- 
geniality are inversely related. 


DISCUSSION | 


Two interpersonal perception scores on 
assumed similarity were correlated with tle 
criterion in each study. One relation reaches. 
the prescribed significance level every time, 
The studies thus support the hypothesis that 
the interpersonal perception variable AS) 
plays a part in group effectiveness. | 

We feel that the present findings serve 
primarily to emphasize that research on inter- 
personal perception in task groups is a fruitful 
area for continued efforts. This discussion will 
therefore, be largely concerned with the impli- 
cations of these findings for further research. | 

Let us first examine the measure which | 
yielded significant results. i 

ASo. This score was obtained by comparing 

S's prediction of his positive choice for work 
companion with his prediction of his negative 
choice. It is thus the similarity which S as- 
sumes to exist between the person with whom 
he says he can, and the one with whom he says 
he cannot cooperate. According to our data, 
the most preferred co-worker in effective teams í 
tends to perceive these two persons as rela- | 
tively dissimilar. On the other hand, the most 
preferred co-worker in ineffective teams ten 
to perceive these persons as relatively similar. | 
Low ASo (i.e., large perceived difference be- 
tween most and least preferred work com- 
panions) may, therefore, reflect an eval 
critical attitude toward others, as contraste 
with warm, empathic interpersonal relations. 
(Further research is needed to clarify the rS 
ing of this measure.) We have found | 
tends to correlate positively with reput i 
therapeutic competence (3); in subseque 
studies it was shown that Ss assume Sl! " 
cantly more similarity to a person Me. 
liked than to someone who is relatively ie 
liked (5, 6). Since ASo and ASn (AS to d 
least preferred) are highly correlated, we s 
lieve that the person with low A.So is relativi 


) 


AssuMED SIMILARITY MEASURES AS PREDICTORS OF TEAM EFFECTIVENESS 


unaccepting, and perhaps rejecting, to the 
person who is not a good work companion. 

The most preferred co-worker. While ASo 
in key persons appears to measure relevant 
factors in team effectiveness, it also points 
to a phenomenon which may be of more 
general theoretical importance. Only the 
scores of the most preferred co-workers corre- 
late with the criterion. When we correlated 
the team’s median ASo with the criterion, no 
significant relationships were found. At present 
we are inclined to take these results as an indi- 
cation that members of effective teams use a 
basis different from that of members of in- 
effective teams for choosing and rejecting 
others as cooperators. This interpretation is 
supported by the positive correlations be- 
| tween ASo of the most and of the second most 
preferred co-workers in basketball teams (.63 
and .27 for the first and second samples, re- 
spectively). In light of our current interpreta- 
_ tion of ASo, this would mean that members 
of effective teams prefer highly task-oriented 
persons as co-workers. Members of relatively 
Ineffective teams list as their most preferred 
co-workers the more accepting, relationship- 
oriented team members. 4So in the most pre- 
ferred team worker is thus possibly an indica- 
tion of the entire team’s attitude toward the 
task, eg. an index of the team’s morale. 
Whether or not these relations hold in radically 
different teams, e.g., formally structured 


groups, permanent crews, etc., remains to be 
established. 


y 
ok SUMMARY AND CONCLUSIONS 


boe present investigations test the hypothe- 
Be àt group effectiveness is related to the 
‘tpersonal perceptions which members of 
© group have toward one another. 
a terpersonal perceptions were measured by 
Correlating identical questionnaires which sub- 
Jects Were instructed to fill out (a) describing 
‘abe (6) predicting the responses of 
E preferred co-worker, and (c) predicting 
responses of their rejected co-worker. 
i nint studies used 14 high school basket- 
E ams, tested at the beginning of the 
“Poor” A second sample of 7 “good” and 5 
E ae teams was collected toward the end 
SUA m for the purpose of veritying rela- 
snl *ntifled in the first study. A third 
Ple consisted of 22 surveying teams. 


387 


The criterion of basketball team effective- 
ness was the proportion of games the teams 
had won (at midseason in the first sample, two 
weeks before the end of the season in the 
second sample). The criterion in the surveyor 
study was the instructor rating on accuracy. 
"There was no correlation between the criterion 
and the median of any assumed similarity 
Score within a team. The assumed similarity 
score, ASo, of the team’s most preferred work 
companion was negatively correlated with the 
criterion in all three samples. The finding 
supports the hypothesis that ASo of the most 
preferred co-worker in surveying, and pos- 
sibly also in basketball is related to team 
effectiveness. 

The interpersonal perception scores of the 
chosen person are believed to reflect his out- 
look on other persons and on the task. Low 
ASo is thought to reflect lack of emotional 
involvement with teammates and task- 
oriented attitudes. The group which chooses 
a differentiating person as preferred co-worker 
is thus likely to be more concerned with ef- 
fective task performance, and correspondingly 
more successful. Some evidence suggests that 
the more effective surveying teams tend to 
be less congenial than relatively ineffective 
teams. 

As in previous studies, we found that Ss 
assumed greater similarity between themselves 
and their positive, than between themselves 
and their negative choices. 


REFERENCES 


1. CRONBACH, L. J., & GLESER, GOLDINE C. Similar- 
ity between persons and related problems of profile 
analysis. Urbana, Ill.: Bureau of Research and 
Service, Univer. of Illinois, 1952. (Mimeo.) (Tech. 
Rep. No. 2, Contract N6ori-07135.) 

2. CRONBACH, L. J., HARTMANN, W., & EHART, 
Mary E. Investigation of the character and 
properties of assumed similarily measures. Urbana, x 
Ill.: Bureau of Research and Service, Univer. of 
Illinois, 1953. (Mimeo.) (Tech. Rep. No. 7, Con- 

° tract N6ori-07135.) i h 

3. FreprxR, F. E. A method of objective quantifica- 
tion of certain counter-transference attitudes. 
J. clin, Psychol., 1951, 7, 101-107. 

4. Freprer, F. E. Assumed similarity measures as 
predictors of team effectiveness in surveying. 
Urbana, Ill: Bureau of Research and Service 
Univer. of Illinois, 1953. (Mimeo.) (Tech. Rep. 
No. 6, Contract N6ori207135.) 

5, Frepier, F. E., BLAISDELL, F. J., & WARRING- 
ton, W. G. Unconscious attitudes as correlates 
of sociometric choice in a social group. J. ab- 
norm. soc. Psychol., 1952, 4, 790-796. 


388 


6. FIEDLER, F. E., HARTMANN, W., & RUDI, S. A. 
The relationship of interpersonal perception to 
effectiveness in basketball teams. Urbana, IIL: 
Bureau of Research and Service, Univer. of Illi- 
nois, 1952. (Mimeo.) (Suppl., Tech. Rep. No. 3, 
Contract Nóori-07135.) g 

T. FrEDLER, F. E., HARTMANN, W., & RUDIN, S. A. 
Correction and extension of the relationship of 
interpersonal perception to effectiveness in basket- 
ball teams. Urbana, Ill.: Bureau of Research and 
Service, Univer. of Illinois, 1953. (Mimeo.) (Tech. 
Rep. No. 5, Contract Nóori-07135.) 

8. Harri, A. W. The relation between the crew’s 

+ perception of the leadership behavior of airplane 
commanders and superiors' ratings of their com- 
bat performance. Paper read at Amer. Psychol. 
Ass, Washington, D. C., 1952. 


FreD E. FIEDLER | 


3 

| 

9. Rupin, S. A., Lazar, L, Exart, Mary E., & Cro. 

BACH, L. J. Some empirical studies of the reliability 

of social perception scores. Urbana, Il.: Bureau 

of Research and Service, Univer. of Illinois, 1952, 

(Mimeo.) (Tech. Rep. No. 4, Contract Nôori- 

07135.) 

10. ScHacurER, S., ErreRTSON, N., McBumy,| 

Doxzornv, & Grecory, Doris. An experimental 

study of cohesiveness and productivity. Hum, 
Relat., 1951, 4, 229-238. 

11. VAN Zeist, R. H. Sociometrically selected ORA 

teams increase production. Personnel Psychol, 

1952, 3, 175-185. 


Received April 13, 1953. 


WN THER book Social Learning and Imita- 
tion Miller and Dollard (6) assert that 
imitative behavior is learned by the same 

principles as other kinds of behavior, the 

principles of instrumental conditioning. This 
assertion is supported by the findings of a series 
of experiments carried out with animals and 
children. The study to be reported below at- 
| tempted to test the Miller and Dollard theory 

- ina somewhat wider context with adult human 

- subjects. Specifically, the purpose of the experi- 

ment was (a) to determine whether rewarding 

| imitative responses would make such responses 

more probable, and (b) to determine whether 
such responses, once "learned," would general- 
ze to a new situation. 


METHOD 


The subjects (Ss) were 300 Army inductees at a 

. Teception center? They participated in the experiment 
| m groups of five shortly after they had completed 
- their preliminary processing. When they arrived at 
€ experimental room each of these groups was told 
they were about to take two tests which were 

Somewhat different from the ones they had just com- 
- Pleted as part of their normal processing, but which 

Were, nonetheless, part of the regular testing procedure 

and just as important. 
| à Altogether, 60 groups (300 Ss) were run according 

0 the experimental design shown in Table 1. 

,Pach row in the table represents the treatment 
Em to a different set of 10 groups. For the experi- 
iin groups Test 1 was a learning situation in which 

itation was Systematically rewarded. In the control 
| tps all conditions were identical except that imita- 
t dal not systematically rewarded. Test 2 was a 
ot generalization, administered in an identical way 
l diate imental and control groups. There were three 
1 E types of generalization tests, varying in pre- 
E Ey to Test 1, but each set of experimental 
ol Ss received only one type. 


Throughout the experiment Ss sat next to each 


' 1 

E s Present study was facilitated by the Labora- 

"author Social Relations, Harvard University. The 
d mus to express his appreciation to Professors 

at EM - Allport, Richard L. Solomon, and Fred- 

the supe for their helpful advice in carrying out 


Din 5 
; First Lieutenant, MSC, USA. 
Fot x 2 Were obtained at the Reception Center at 
the Cento e Maryland, I wish to thank the staff of 
iex er for their Cooperation in making it possible 
se inductees for the experiment. 


THE EFFECT OF REWARD ON ADULT IMITATIVE BEHAVIOR? 


EDGAR H. SCHEIN? 
Army Medical Service Graduate School 


other facing E. Each S was given one copy of a card 
which had five groups of four letters and numbers on 
it The Ss were told that the order and magnitude of 
the letters and numbers in each of the groups was de- 
termined according to some organizing principle, and 
that it was their job to discover what these organizing 
principles were, by learning to match other sets of 
letters and numbers to one of the sets on their card. 
The E held up 25 cards, one at a time, each with a 
group of three letters and numbers on it. Each of these 
groups was ostensibly constructed by the same prin- 
ciple as one of the five groups on Ss' cards. If the 
principle used in making the match was correct, S 
would match correctly; if the principle used was wrong, 
he would make the wrong match and have to tty a 
different principle. 

The Ss gave their answers out loud by calling out 
the number of the group on their card which they 
chose as the correct match for the one held up by E. 
They always gave their answers in order, ostensibly 
to avoid confusing E, who was recording them. After 
all five men had given their answer for any particular 
problem, E would announce the number of the group 
which was the correct match, and then go on to the 
next problem. y 

Actually there were no principles underlying the 
construction of any of the groups of letters and num- 
bers. The groups were arbitrarily constructed by E to 
allow for any number of principles to be applicable. 
Any matching of groups could be “correct” or “in- 
correct,” since any of a large number of principles 
“might” be applicable. 

This situation made it possible for E to choose 
arbitrarily amy answer as the correct „one without 
arousing Ss' suspicion in any way. Imitation was then 
“rewarded” by the following procedure: On items 6 to 
25 of the first test, E chose as correct the answer that 
the second man in the order of responding gave, no 
matter what it was. Therefore, the only way a man 
answering after the No. 2 man could get the correct 
answer, was by copying the answer of the No. 2 man. 
If he did this, he would be rewarded by the acknowl- 
edgement on the part of E that he had obtained the 


correct answer. 


* The five. were as fol 
LABCE T ZF9A 5.7H3] 
2KAC9 4B1H8 

5 These cards read: 
1BD7 10. B3A 18. FSK 
2.4G8 11.BK2 19.7 E2 
3. TEC 12. FDC 20.5 A3 
4C1E 13. GE3; 21.4 BA 
RCS 14. K EG 22.25 B 
6.436 15.157 23.28 7 
7.023 16K 82 24. AT D 
8.34C 17. A B E 25.61 K 
97ED 


q 389 


390 
TABLE 1 
EXPERIMENTAL DESIGN 
ANSWERS 
Nor 
Groups ANSWERS ANNOUNCED ANNOUNCED 
Experimental Test 1 No. 2 rewarded Test 2a 
Test 1 No. 2 rewarded Test 2b 
Test 1 No. 2 rewarded Test 2c 
Control Test 1 random reward Test 2a 
Test 1 random reward Test 2b 
Test 1 random reward Test 2c 


On items 1 to 5, E chose as correct an answer which 
no one in the group had given if this was possible. In 
the control groups the entire procedure was identical 
except that the correct answers to the problems were 
assigned at random by E. 

Each experimental or control group was given the 
second test immediately following the first. The test 
material assumed to be most similar to Test 1 consisted 
of 25 multiple-choice perceptual problems presented on 
slides. The problems consisted of series completions, 
mirror images, and matrix completions.* The first slide, 
showing the problem, was flashed for 10 seconds; then 
a second slide was flashed for 10 seconds with five 
plausible looking alternatives on it, none of them the 
Correct answer. The test assumed to be intermediate 
in similarity to Test 1 consisted of 25 dot numerosity 
judgments. Each slide had from 100 to 300 dots on it 
and was flashed for five seconds? The Ss were told 
that the problem was one of estimating large numbers 
of objects such as airplanes when these can only be 
seen for short periods of time. The Ss simply reported 
how many dots they thought were on each slide. The 
test material assumed to be least similar to Test 1 
consisted of 25 multiple-choice problems, on each of 
which the “best” photographs from the standpoint of 
technical proficiency had to be selected from a group 
of five in 25 seconds. 

Each of these tests again called for verbal responses, 
and Ss gave their answers out loud and in the same 
order as they had on the first test. However, on this 
second group of tests no announcement was made as 
to which answers were correct on any of the problems, 
thus removing any "reward" for imitation. The ex- 
perimental and control groups were treated in exactly 
the same way. At the end of the two tests each group 
of Ss was given a short questionnaire in an attempt to 
survey some of their attitudes toward the experimental 
procedure and toward each other. 


*Items were selected from Cattell’s Culture Free 
Intelligence Test with permission of the Institute for 
Personality and Ability Testing, 1608 Coronado Drive, 
Champaign, Illinois. 0 

7 The actual numbers of dots on the 25 slides pre- 
sented were: 160, 160, 150, 140, 100, 300, 240, 220, 200, 
200, 180, 170, 160, 160, 150, 140, 120, 100, 300, 240, 
220, 200, 200, 180, 170. 


Epcan H. SCHEIN 


This procedure was designed to test the following 
hypotheses: (a) that the No. 2 man in the experimenta] 
groups would be imitated more than the No. 2 man in 
the control groups on Test 1, and that the amount of 
imitation would rise through the items of the test- 
(with increasing number of rewards); (b) that the No.2. 
man in the experimental groups would be imitated | 
more than the No. 2 man in the control groups on Test 
2, and that the difference between the groups would 
decrease as the similarity between Test 1 and Test 2 
decreased. 

Advantages of the experimental situation. The use of 
Army inductees in an Army setting had several meth- 
odological advantages: (a) It was possible to reduce 


uncontrolled prestige factors to a considerable extent, 


In most cases Ss did not know each other personally, 
or at most had an acquaintance of one or two days’ 
standing. Furthermore, since all the inductees wert 
dressed in fatigues, overt status differences were com- 


pletely eliminated. All formal communication between 


Ss which might have served as a basis for prestige 
was prohibited during the course of the experiment, 
(b) The 300 Ss constituted a representative cross sec- 
tion of the population of young men with respect to. 
intelligence, education, personality, and social-cultural 
background. (c) It was possible to present the experi- 
ment as a routine test situation. The behavior that 
was called out was therefore untainted by motives 
such as “pleasing the experimenter,” “not trying hard 
because it was only research," “making the ri 
come out a certain way," or “guarding against that 
guy (the experimenter) finding out anything about 
me." 

Analysis of the data. An imitative response was de 
fined as any agreement between a man's answer an 
the answer of some man who had responded before 
him on that item. Since the No. 1 man in order of 
responding could not imitate anyone, and the No.? 
man was chosen as the model, most of the results con 
cern the imitative behavior of the No. 3, No. 4, an 
No. 5 men in each group. Since no systematic. differ: 
ence in the amount of imitation that these differen 
positions showed was observable, we averaged acros 
the positions in all our analyses. 

If the different alternatives for each problem hs 
equally probable, imitation could be measured again? 
a chance level in both experimental and control groups 
The Ss did not in fact choose all of the answers ed! 
often, and we therefore used comparisons betwee? 
experimental and control groups only. i 

The analysis of dot numerosity judgments e 
carried out in terms of the absolute distance betw! 
the estimates of the different positions without Ie 
to the true numbers of dots on the slides. The 
numbers affected these differences, but the 3 
approximately the same in the experimental and 
trol groups, thus making comparisons possible. 


RESULTS í 
Test 1. Figure 1 shows the mean amount" 
imitation of the No. 2 man by the No. 3, 4, 9^ 
5 men (averaged together) over success 
groups of five items on Test 1. Each 
based on 30 groups. 


anil i 


effect v | 


NO.OF IMITATIONS PER 5 ITEMS 


i [ — EXPERIMENTAL GROUP 
OF ---CONTROL GROUP 


SS —— L 
Il-15 16-20 21-25 
ITEMS 
| Fi. 1. AMOUNT or IwrrATION THROUGH THE ITEMS 
1 or Test 1 


Each point on a curve shows the amount that the 
- No. 3, 4, and 5 men imitated the No. 2 man on that 
_ Eroup of five items. 


6-10 


TABLE 2 


Frequency DISTRIBUTIONS OF IMITATION 
ON Items 16 ro 25 or TEsT 1 


NUMBER OF NUMBER OF 
NUMBER OF EXPERIMENTAL CONTROL 
Thoratrye Susyects SUBJECTS 
ResponsEs* (N = 90) (N = 90) 
0 2 11 
1 7 13 
2 17 27 
3 10 14 
4 21 13 
i 14 9 
10 2 
7 6 1 
po cm 
0 0 
10 9 0 


. 
E ed number of imitative responses for the experimental 
(d e (SD = 2.02); for the control Ss the meah was 2.50 


D = 1,63) 


The initial high level of imitation in both 
de ental and control groups is probably 
m © the fact that all Ss tended to use the 

me simple hypotheses for matching at the 
; Ls of the test. Since none of the answers 

e © experimental groups and very few of 

b NO in the control groups were ac- 

Vledged by E as correct, Ss probably began 


THE EFFECT OF REWARD ON ADULT IMITATIVE BEHAVIOR 


391 


to use more varied and complex hypotheses, 
thus lowering the level of imitation on items 
6 to 10. By the time items 11 to 15 were 
reached, Ss in the experimental groups prob- 
ably noticed that the No. 2 man was consist- 
ently correct, that he had “caught on" to the 
principles. The results from that point on show 
a consistent rise in amount of imitation in the 
experimental groups, and a consistent decrease 
in amount of imitation in the control groups. 
The difference between the experimental and 
control means for items 16 to 25 is significant 
at the .001 level (one-tailed / test). 

The first hypothesis is thus strongly sup- 
ported, but several additional findings should 
be taken into account in making an unambigu- 


==r= ous interpretation of the data. (a) Individual 


differences were large (see Table 2). (b) The 
Ss were highly inconsistent in their behavior 
from one item to the next. Very few men settled 
down to a pattern of either imitating or not 
imitating. (c) The level of imitation reached 
in the experimental groups was not very high 
in an absolute sense. If Ss had “learned” to 
imitate, one might expect an average level 
somewhat higher than 2.1 imitative responses 
on the last five trials. 

Test 2a. Figure 2 shows the results for the 
generalization test most similar to Test 1. 
Through all the 25 items the No. 3, 4, and 5 
men (averaged together) in the experimental 
groups imitated the No. 2 man more than the 
comparable men in the control groups. The 
difference between the means for all 25 items 
taken together is significant at the .01 level 
(one-tailed / test), thus showing clear-cut posi- 
tive generalization of the imitative response. 

Individual differences in responding con- 
tinued to be large, and Ss remained inconsist- 
ent in their behavior. The rank-order correla- 
tion between the amount of imitation on items 
16 to 25 of Test 1 and the amount of imitation 
on Test 2a is only +.21, indicating that 
group results cannot be used to infer Ss’ in- 
dividual reactions. Those Ss who imitated on 
Test 1 did not always imitate on Test 2a and 
vice versa. 

Test 2b. Figure 3 shows the results for the 
generalization test of intermediate similarity. 
The large fluctuations in the curves reflect the 

5Tt was not possible from our results to determine 


what might have happened to the curve of imitation if 
the test had contained more items. 


TEST 2a 


— EXPERIMENTAL GROUP ^w 
==- CONTROL GROUP 


NO.OF IMITATIONS PER 5 ITEMS 


6-10  !!-I5 16-20 21-25 
ITEMS 


Fic. 2. AMOUNT OF ÍMITATION THROUGH 
THE Items OF TEST 2a 


Each point on a curve shows the amount that the 
No. 3, 4, and 5 men imitated the No. 2 man on that 
group of five items. 


TEST 2b 


— EXPERIMENTAL GROUP 
--CONTROL GROUP 


MEAN DIFFERENCE BETWEEN ESTIMATES PER 5 ITEMS 


6-10 11-15 16-20 21-25 


ITEMS. 


Fic. 3. DEGREE OF AGREEMENT BETWEEN ESTIMATES 
OF NUMEROSITY THROUSH THE Items or Test 2b 
Each point on a curve shows the degree to which 
the No. 3, 4, and 5 men agreed with the No. 2 man 
on that group of five items. 


1-5 


TEST 2c 


vV 

= 

m 

E 

N 

ex 

m 

a 

eo 

z 

S 

E 

É 

= 

u L 1 

Q yak —— EXPERIMENTAL GROUP | 

Q = f| ---CONTROL GROUP ` 
Mor. ide. 
SC e 

1-5  6-I0 11-15 16-20 21-25 
ITEMS 


FiG. 4. AMOUNT OF IMITATION THROUGH 
THE Items OF TEST 2c 
Each point on a curve shows the amount that the 
No. 3, 4, and 5 men imitated the No. 2 man on that 
group of five items. 


effects of the actual numbers of dots on the 
slides. Slides with large numbers of dots tended | 
to produce uniformly larger differences between 
the estimates of positions than slides with small 
numbers of dots on them. On this test a rather | 
surprising reversal occurred. The agreement 
between the estimates of the No. 2 man and 
the average of the No. 3, 4, and 5 men in tht | 
experimental groups was lower than the agree 
ment between these positions in the control 
groups. There was, in fact, a tendency for all 
positions to agree Jess with each other in the | 
experimental than in the control groups. The 
difference between the means for all 25 items 
together is significant at the .05 level (two 
tailed / test, since the results are the opposite | 
what was predicted). f 
Individual differences and inconsistencies 
were again large. The rank-order correlation 
between imitation on items 16 to 25 of Test 
and agreement on Test 2b was +.14. 
Test 2c. Figure 4 shows the results for be 
generalization test of least similarity. 
mean imitation in the experimental groups bi 
slightly higher than the mean in the pu. 
groups, but the difference is not significa” 
and the curves cross. The rank-order co n 
tion between items 16 to 25 of Tests 1 and ^ 
was +.18. de | 
Further. findings. Data obtained from v 
questionnaires indicated that E's judgme? " 


the relative similarity of the generalization 
tests to the first test did not agree with Ss’ 
= perceptions. We asked Ss to underline which of 
"the following alternatives they considered to be 
“most important” in getting a high score on 
each of the tests: (a) ability to concentrate, 
(b) ability to think quickly, (c) mostly luck, 
(d) intelligence, or (e) previous experience 
with tests (or practice in judging dots or photo- 
graphs in the case of Tests 2b and 2c). 

Table 3 shows the percentage of Ss choosing 
the different categories on all the tests. It can 
be seen that Test 2b was thought to be pri- 
marily a test of practice in judging, and less a 
test of intellectual ability than Tests 1, 2a, or 
2c. It was actually the test of Jeast similarity to 
Test 1. Those Ss who picked luck as being the 
factor most important in getting a high score 
. on Test 1 tended to imitate the No. 2 man 

less, both on Test 1 and on Test 2. 

In order to discover what factors might have 
been responsible for some of the individual 
differences and the inconsistencies in Ss’ be- 
havior, some further analyses of the data were 
carried out. We observed that two-thirds or 
more of the Ss in both experimental and con- 

| rol groups tended to choose the same answer 
. Onitems 1, 2, 12, and 14. We also observed that 
| the No, 2 men in the control and experimental 
groups differed in the extent to which they 
tended to pick these “popular” alternatives. 
_ We assumed that the tendency to give popular 
_ Answers on these items reflected a general tend- 
ency on the part of S to use a simple set of 
hypotheses for obtaining his answers. The 
, Question arose: Would such an S be imitated 
more than one who tended mot to pick simple 
hypotheses for obtaining his answers? 


TABLE 3 
Factors THovGnr py SUBJECTS TO BE “Most 
IMPORTANT” m GrerrING A HICH SCORE 
ON A TesT 


ABILITY TO 
| THINK QUICKLY 
Teeny INTELLI- OR ÅBILITY TO PRAC- 


GENCE CONCENTRATE TICE LUCK 


T 
Teta 39 S% 63% 10% 1795 
Meat a 1005:10% 18% 2% 10% 
Test 2c 1005 Hop 40% o 976 
10 25 61% 14% 2305 

* The generali 


tion test: i 
similarity gem) 3 are arranged in order of presumed 


Tur EFFECT OF REWARD ON ADULT IMITATIVE BEHAVIOR 


393 


NO. OF IMITATIONS ON ITEMS 16 TO 25 


— EXPERIMENTAL GROUP 


=== CONTROL GROUP 


pa Jj À 
g o [j 2 3 4 
NO. OF PLAUSIBLE RESPONSES CHOSEN BY NO, 2 
Fic. 5. IMITATION OF THE NUMBER 2 MAN IN 
RELATION TO THE NUMBER OF PLAUSIBLE 
(Poputar) Responses CHOSEN BY Him 


Each point on a curve shows the amount that the 
No. 3, 4, and 5 men imitated the No. 2 man on items 
16 to 25 of Test 1. The small numbers below each 
point show the number of groups which entered into 


the average. 


The answer to this question can be found in 
Fig. 5. We rated the No. 2 men according to 
whether they gave zero, one, two, three, or 
four popular answers on items 1, 2, 12, and 
14, and calculated the amount that these men 
were imitated by the No. 3, 4, and 5 men 
(averaged) on items 16 to 25. It can be seen 
that in the experimental groups the greater 
the number of popular answers given by the 
model, the more that model was imitated. No _ 
such trend is observable in the control groups. . 
Since the two curves never cross, a one-tailed 
sign test shows that they differ at the .05 level 
of significance. 

Tn order to determine what effect Ss had on 
each other, an analysis was carried out for 
each position, relating the reactions of the man 
occupying that position (whether he imitated 
the No. 2 man, or some other man, or failed to 
imitate altogether) to the combination of re- 
actions of the men in preceding positions. This 
analysis revealed two very definite trends: (a) 
An S in an experimental group was more likely 
to imitate the No. 2 man if someone before 
him had already given an imitative response 
on that item. If there was any kind of "prece- 
dent" for imitation, S was more likely to imi- 
tate. (6) If several Ss in the group agreed on a 
single answer for a given item, a later S was 


394 


likely to give that same answer also, even if, in 
the experimental groups, the No. 2 man had 
chosen a different answer? 


Discussion 


Some of the inconsistencies in the results can 
be partially explained by analyzing the social 
situation in which Ss were making their judg- 
ments. Several motives were probably operating 
in Ss which demanded that they not imitate: 
(a) Imitation might have been viewed as being 
equivalent to cheating. (b) Even if a man had 
no moral compunctions about cheating, he 
might have been afraid of being caught and 
punished by E, or of being branded by his fel- 
low Ss as a cheater or copycat. (c) The Ss might 
have had a genuine desire to solve the problems 
for themselves, viewing them purely as an 
intellectual challenge rather than a test to get 
a high score on. (d) The Ss might have felt 
highly competitive and reasoned that they 
could not beat out anyone else’s score if they 
merely imitated. 

Any or all of these motives pitted against the 
desire to obtain a high score by copying would 
produce a conflict situation which could be 
resolved in one of several ways: (a) An S could 
leave the field; this implies becoming disinter- 
ested in the whole proceedings and a relapse 
into aimless guessing. Only a few Ss seemed to 
react in this way. (b) An S could give in to 
either one of the impulses and suppress the 
other one; this implies imitating a great deal 
or not at all. The number of Ss choosing this 
course also seemed relatively small. (c) An S 
could compromise; this implies that S will 
imitate, but only part of the time. Had the 
tests not been multiple-choice tests, a com- 
promise solution could have been achieved by 
picking answers that lay in between that of 
the model's and S's own, but on all tests with 


9 The fact that majority judgments strongly affect 
an individual's judgments has been amply documented 
in the literature on suggestion (5, 7, 9) and has beer 
demonstrated in an interpersonal setting by experi- 
ments such as Sherif's (8), Asch's (1), and Berenda's 
(3). The effects of “precedent” in condoning an imi- 
tative response have only been brought out by Asch 
(1, 2), however, in his study showing the effects of 
giving the minority members in a group some social 
support by having another S make “correct” judg- 
ments when the majority is making "incorrect" judg- 
ments. Our results support Asch's formulations inter- 
preting Ss’ behavior as a rational adaptation to the 


total situation. 


Epcar H. SCHEIN 


the exception of Test 2b, an S had to make the 
decision to give either his own or the model's 
answer—there was no in-between alternative 
available. A compromise solution therefore 
consisted in an alternation between imitation 
and nonimitation. If Ss tended to adopt sucha 
resolution the results would reflect this in the 
low level of imitation and the kinds of “in. 
consistencies" in behavior which were previ- 
ously pointed out. 

It can also be argued that any of the resolu- 
tions to the conflict situation involve frustra- 
tion, and that this may express itself in the 
form of hostility toward E or to the No. 2 
man, who came through with a high score. If 


such hostility was produced, it might wel | 


have expressed itself in a desire to show up the 


No. 2 man, to beat him out on a new test. The | 


test involving dot numerosity judgments of- 
fered a good opportunity to experimental Ss 
to express such feelings since it was the sort of 
task that depended partly on practice. The Ss 
might well have reasoned that they now had as 
good as, or a better, chance to get the correct 
answers than the No. 2 man. If such reactions 
were fairly common among Ss, they would 
account for the reversal found on Test 2b.” 

The large individual differences among $$ 
can be partially accounted for by the fact that 
Ss interacted with each other in a complex way. 
They were influenced by whether others imr 
tated before them, by majority opinions, an 
by the plausibility of the answers that were 
given. We pointed out that No. 2 men who 
tended to give popular answers were imitated 
more than No. 2 men who did not. This sug 
gests the possibility that Ss imitited more 
they “understood” the behavior of the model, 
a condition which obtained if he used simple? 
hypotheses to make his matches. 

We may conclude by pointing out that the 
Miller and Dollard paradigm can be made 
operational for adult imitation, and that ou 
results tended to support the theory, but sar 
reward, defined in any terms sufficiently n4" 

1 Festinger (4) distinguishes tasks based on “phy 
cal reality” A e anes can be checked apie 
actual events from those based on “social er 
where answers can only be checked against the opini d 
of others. If “physical reality" is high as in the b 
judgments of dot numerosity, and S feels he has * 
necessary skill to make the required judgments, vus 
conformity may be expected to be at a minimum. 
results are in line with this expectation. 


Qur 


THE EFFECT oF REWARD ON ADULT IMITATIVE BEHAVIOR 


row to make experimental control possible, was 
not the only important variable which deter- 
mined Ss’ behavior. 

It is of interest to note that many Ss ac- 
tively resisted experimentally created forces 
toward imitation. The extent to which Ss ex- 
pressed their individuality, even while imitat- 
ing, and the extent to which they attempted to 
avoid imitating altogether, points to an aspect 
of social behavior that has only recently begun 
to be stressed experimentally (1, 2, 3). Re- 
searches in social psychology have tended to 
put primary emphasis on conformity behavior 
and have not focussed on the ways in which 
members of social groups resist being influenced 
and find ways to express their individuality. 
The norms of a group not only define areas 
where conformity is demanded, but also define 
areas where deviance is permitted, in fact en- 
couraged, if the group is to be at all creative. 


SUMMARY 


A study was carried out to test the Miller 
and Dollard theory of imitation on adult hu- 
man 5s. A significant number of Ss “learned” 
to imitate a model when such imitation was 
tewarded, and the imitative response general- 
ized to a similar but new situation where it was 
no longer rewarded. Three separate tests of 
generalization were run, varying in the degree 
of Similarity to the original test situation. Posi- 
tive generalization appeared on the most simi- 
lar test. Some negative generalization appeared 
on the test of intermediate similarity, but no 
generalization appeared on the test of least 


395 


similarity. This reversal was interpreted to be a 
result of the fact that Ss perceived the test of 
intermediate similarity to be the least similar 
test. 

The results tended to support the Miller and 
Dollard theory, but Ss' reactions were highly 
complex. The results were interpreted as re- 
flecting a complicated compromise between 
the various motives that were operating in the 
experimental situation and the degree to which 
the model's behavior was "understood" by 
Ss. 

REFERENCES 


1. Ascu, S. E. Effects of group pressure upon the 
modification and distortion of judgments. In H. 
Guetzkow (Ed.), Groups, leadership and men. 
Pittsburgh: Carnegie Press, 1951. Pp. 177-190. 

2. Ascn, S. E. Social psychology. New York: Prentice- 
Hall, 1952. 

3. BEnENDA, R. W. The influence of the group on the 
judgments of children. New York: King's Crown 
Press, 1950. 

4. FEsrENGER, L., et al. Theory and experiment in 
social communication. Ann Arbor: Research Cen- 
ter for Group Dynamics, Univer. of Michigan, 
1950. 

5. Marpte, C. H. The comparative susceptibility of 
three age levels to the suggestion of group versus 
expert opinion. J. soc. Psychol., 1933, 4, 176-186. 

6. Mutter, N. E., & Dorranp, J. Social learning and 
imitation. New Haven: Yale Univer. Press, 1941. 

7. Moors, H. T. The comparative influence of ma- 
jority and expert opinion. Amer. J. Psychol., 
1921, 32, 16-20. 

8. SHERIF, M. A study of some social factors in per- 
ception. Arch. Psychol., 1935, 27, No. 187. i 

9. WHEELER, D., & JonpAN, H. Change of individual 
opinion to accord with group opinion. J. abnorm. 
soc. Psychol., 1929, 24, 203-206. 


Received April 13, 1953. 


IS THERE A MECHANISM OF PERCEPTUAL DEFENSE? 
A REPLY TO POSTMAN, BRONSON, AND GROPPER 


RICHARD S. LAZARUS 


The Johns Hopkins University 


N THE April 1953 issue of this journal, 
Postman, Bronson, and Gropper (10) pre- 
sented an argument and some data which 

were offered in opposition to the controversial 
concept of perceptual defense. Since there are 
some very serious flaws in their argument, and 
since they appear to ignore a number of rele- 
vant studies which have recently been pub- 
lished, it is important to offer a brief defense of 
the concept and a rebuttal of the position of 
these authors. 


THE CONCEPT OF DEFENSE 


The hypothetical construct of psychological 
defense originated, not from experimental 
data, but from observations of people with be- 
havior disorders. It is useful because it offers us 
an explanation for the apparently illogical and 
contradictory behavior of persons whose rea- 
soning, memory, and interpretation of inter- 
personal situations seem to deviate from a 
realistic evaluation of an actual state of affairs. 

Perceptual defense as it is used by the per- 
sonality-oriented experimenter is merely one 
manifestation of the general idea of a defense 
process which should produce observable effects 
in any cognitive situation where the indi- 
vidual's security or self-esteem is threatened, 
and where there is offered some opportunity 
for him to reduce the affective disturbance 
attendant upon such a threat or to gratify the 
motivation of seeing himself in a particular 
light. Therefore, some of the rules by which the 
process of defense operates should be the same 
whether the subject is asked to perceive threat- 
ening material, learn it, or recall it. Certainly 
these rules will be modified in terms of what 
we know about the specific perceptual process 

as compared with the learning or recall process. 
And the exploration of these rules from a per- 
ception or recall point of view has not even 
been begun systematically. Therefore, the 
question before us is concerned with whether or 
not the concept of defense, in the general sense, 
is reasonable. 

The criticism of the data which McGinnies 
(9) has used to establish perceptual defense is 
perfectly appropriate. Actually, if the concept 


396 


stood or fell on these data, it would have little 
support. There are two reasons why the Mc 
Ginnies-type methodology is seriously de. 
fective. In the first place, when verbal stimuli 
which are judged emotional or threatening are 
placed in opposition to material which is. 
judged nonemotional or nonthreatening, and 
when learning, recall, or perception of them is | 
studied, it is absolutely impossible to establish 
whether differential performance in a group of | 
subjects is associated with the emotional char- 
acter of the material or other associative factors 
such as familiarity, attention value, etc. Yet 
one finds current experimenters still perform- 
ing these kinds of experiments in learning, | 
retention, and perception (2, 3, 6, 10). 
Second, it was recognized long ago that the | 
concept of defense as we now use it does not | 
predict that all persons will deal with threat 
in the same way. It appears that we have not 
even caught up with Freud himself, who long | 
ago decided that repression was not the only 
mechanism of defense. The only experiments 
which are relevant to the notion of a process of 
defense are those which look, not for main | 
effects (e.g., mean differences in group pef | 
ception of emotional vs. nonemotional ma 
terial), but for interactions between different 
kinds of personalities and different ways of 
dealing with threat. The McGinnies-type ES 
periment is therefore not an appropriate em- | 
pirical relationship with which to test the 
notion of psychological defense. The Po 
Bronson, and Gropper attack on the concept 
of defense in perception on the basis of 4 fen 
experiments which are not really appropriate 
tests of it, and without recourse to those v 
periments which are appropriate, is perhaps 1 
the height of need in perception. I shall p 
sider later those experiments which Ti 
have been cited with better justification. 


Tue Concept or Worp FREQUENCY 


In place of the notion of defense, Posty i 
Bronson, and Gropper, as well as others ie 
offer the notion that the differential fr 1 
of usage, in written English, of the critic? ial 
noncritical words accounts for the perc" 


| selectivity found in experiments in this area. 
The same argument may be used in dealing 
with selective recall. 

If there really were a marked correlation 
between word frequency and recognition 
thresholds, if we were willing to overlook the 
| limitations of the written-word counts that are 
usually used, and if we overlooked the fact 
that these word counts tell us little about the 
familiarity with the experimental words for 
| any one individual, then this would still be a 
limited argument with which to attack the 
need and perception experiments. We should 
note that word familiarity is an intervening 
| Variable which is inferred from verbal behavior, 
just as motivation or personality is inferred. 
It is likely that word frequency itself is a 
product of the individual’s personal needs and 
social experience. At best, it is possible to say 
that the correlation between needs or values 
and recognition thresholds may be mediated 
by this differential word familiarity. 

But the fact is that the correlations between 
| Word frequencies and visual recognition thresh- 


olds are minimal, particularly at the higher 
Word frequencies which are typically found in 
Most experiments. Most of the covariation 
Seems to occur when words of zero frequency 
are Compared with words with slightly greater 
. Jequencies and over a very wide range. This 
fact may be established by examining the data 
reported by Solomon et al. (5, 13), who are the 
Primary exponents of the word-frequency 
hypothesis, Little covariation seems to occur 
When highly frequent words of differing fre- 
quency value are used, and it is these high- 
.equency words which are typically employed 
| M perceptual studies. There is not much doubt 
E. Word frequency is a variable in perceptual 
E. Cognition under certain conditions, but it is 
- T^t a variable of such great importance and 
Eu às has been suggested. 
| 5 Should also be pointed out that Postman, 
a dm and Gropper cite data of their own 
TN n they assigned frequencies to their 
dm 3 us words and found that recognition of 
E 4500 words was faster than recognition of 
Eus words. Even the above authors 
is d be aware of the inadequacy of the 
Wr 5 xs assigned word frequencies. To two 
3 teque olex and penis, they give the lowest 
E matching them with such rather 
all words as cleat and mixer (even though 
males have penises and therefore must be 


SELECTIVE PERCEPTION, SELECTIVE MEMORY, AND PSYCHOLOGICAL DEFENSE 


397 


highly familiar with the concept). Words like 
raped get almost the same rating in frequency 
as words like capon, lathe, and lotus. Whore and 
tiara are equally matched. Now, whatever the 
written language looks like, it seems rather 
unlikely that a normal sample of college 
students will find such a word as tiara equally 
unfamiliar with words like bitch and whore; or 
capon, lathe, and lotus comparable with words 
like raped. Surely a vocabulary test or some 
other indication of familiarity with the words 
would serve a good purpose here. 

There is also no doubt that findings in 
terms of mean differences in thresholds for 
emotional and neutral words could be made to 
go in either direction by the appropriate selec- 
tion of words on the basis of such variables as 
context, contrast, sequential probabilities, and 
a host of other factors not directly related to 
word frequency. We have no way of knowing 
which variables were operating in this type of 
study. But this is irrelevant to the concept of 
defense. 


Some EXPERIMENTAL DATA 


There are some recent data relevant to the 
question of psychological defense in memory 
and perception which might be cited here, since 
they are apparently not widely known. These 
data include differential thresholds in the per- 
ception of recall of emotional and nonemotional 
stimuli which have been found to correlate 
with clinical typologies, or with the results of 
additional exposure to different kinds of 
stresses. For example, Lazarus, Eriksen, and 
Fonda (7) were able to show that poor auditory 
perception of emotional material was charac- 
teristic of patients with hysterical personalities, 
while the reverse was true for obsessive-com- 
pulsives. In addition, Rosenzweig and his 
collaborators (11) some years ago related 
selective memory for successes under ego- 
involving conditions to impunitiveness and 
hypnotizability. The work of Alper (1) on 
recall and ego strength should also be noted in 
this context. 

Furthermore, Eriksen (4) has shown that 
subjects who remembered their successes had 
higher recognition thresholds and slower learn- 
ing-for words on which they blocked in a word 
association test than for fast-response-time 
words; while subjects who showed superior 
recall of failures exhibited no retardation of 
learning, and had just about the same recogni- 


398 


tion thresholds for words on which they 
blocked as they had with words on which no 
blocking occurred. 

Recently, Lazarus and Longo (8) have dem- 
onstrated that selective recall of failures in an 
interrupted task experiment was associated 
with superior recall of nonsense syllables which 
had been associated with a painful electric 
shock, while selective recall of successes was 
associated with better recall of nonshocked 
syllables. There seems to be some consistency 
in the way subjects deal with stress from situ- 
ation to situation; that is to say, selective recall 
and perception depend upon personality char- 
acteristics of the subjects rather than on the 
specific nature of the material. 

No one questions the observations that are 
regularly made about individuals who, in every- 
day life and in the clinic, misperceive and mis- 
interpret the social events in which they 
participate. The extent of this perceptual or 
interpretive failure may range from mild “blind 
spots” to psychotic “lack of contact.” Even if 
the laboratory were unable to duplicate or 
verify this kind of observation with its feeble 
experiments with words and its minimally 
stressful situations, the compelling force of 
these observations still could not be disre- 
garded. 

There is beginning to be a considerable num- 
ber of appropriate experimental studies of 
defense mechanisms as they appear to operate 
in cognitive situations. Despite the age of these 
concepts, only a beginning has been made. 
Experiments have shown that it is reasonable to 
speak about processes of defense, at least in 
terms of consistent styles or habits of respond- 
ing to threat. However, the experiments have 
certainly not gone far in identifying the rules 
by which the process operates. Ultimately, 
perhaps, experiments along these lines could 
tell us more specifically how defenses operate in 
such forms of behavior as perception and 
memory. But in the absence of such data, our 
experiments have begun to confirm the clinical 
belief that the postulation of such a process is 
both reasonable and useful. It can lead us to 
make predictions which would not be made in 
the absence of the concept. Moreover, it should 
be noted that a parsimonious explanation 

which could account for a limited series of 
experimental data might not be an explanation 
at all (much less parsimonious) of the manifold 


Ricwarp S. LAZARUS 


clinical observations which have led to the 
development of the concept in the first place, | 


SUMMARY 


The present article is a rebuttal of the point 
of view expressed in this journal by Postman, 
Bronson, and Gropper (10), who criticized the 
concept of perceptual defense and argued that 
it is not supported by the experimental eyi- 
dence. Issue was taken with these authors con- 
cerning their understanding of the concept of 
defense, their espousal of the notion of word 
frequency as an explanation of perceptual 
recognition, and the limited variety of studies 
which they criticize and cite as relevant to the 
defense concept. 


REFERENCES 


1, ALPER, THELMA G. Memory for completed and in 
completed tasks as a function of personality: 
correlation between experimental and personality: 
data. J. Pers., 1948, 17, 104-137. 

2. Comps, A. W., & Tavrom, C. The effect of the 
perception of mild degrees of threat on perform- 
ance. J. abnorm. soc. Psychol., 1952, 47, 420-424, 

3. Cowen, E. L., & Brrer, C. G. The influence of 
"threat expectancy" on perception. J. Pers, 
1950, 19, 85-04, 1 

4. ERIKSEN, C. W. Defense against ego-threat in 
memory and perception. J. abnorm. soc. Psychol 
1952, 47, 230-235. , 

5. Howes, D. H., & Sotomon, R. L. Visual duration 
threshold as a function of word-probability. 
J. exp. Psychol., 1951, 41, 401-410. E 

6. LAFFAL, J. The learning and retention of words with 
association disturbances. J. abnorm. soc. Psychol 
1952, 47, 454-462. 

7. Lazarus, R. S., EnrksEN, C. W., & FONDA, E 
Personality dynamics and auditory perceptu 
recognition. J. Pers., 1951, 19, 471-482. 

8. Lazarus, R. S., & Lonco, N. The consistency 
psychological defenses against threat. J. abmori. 
soc. Psychol., 1953, 48, 495-499. } 

9. McGmnies, E. Emotionality and perceptual de 
ense. Psychol. Rev., 1949, 56, 244-251. 

10. Postman, L., Bronson, Wanna C., & Go 
G. L. Is there a mechanism of perceptual ay 
fense? J. abnorm. soc. Psychol., 1953, 48, 215- 

11. RoszNzwEIG, S., & Sarason, S. An experimen 
study of the triadic hypothesis: reaction to 1 
tration, ego-defense, and hypnotizability 7 
Correlational approach. Charact. & Pers. 1^*^ 
11, 1-20. enc 

12. Soromon, R. L., & Howes, D. H. Word frequ E 
personal values, and visual duration thresh 
Psychol, Rev., 1951, 58, 256-270. ae 

13. Sotomon, R. L., & Postman, L. Frequency “tas for 
as a determinant of recognition te 
words. J. exp. Psychol., 1952, 48, 195-201. 


Received May 18, 1953. 


PSYCHOLOGICAL MOVEMENT* 


ALVIN R. HOWARD? anp GEORGE A. KELLY 


The Ohio State 


A THEORETICAL APPROACH TO 

CIENTISTS almost universally agree that 

S their manifold efforts are directed toward 

prediction and control of their subjects, 

animate or inanimate, human or not. In this 
paper, we are concerned with the human being, 
specifically the maladjusted individual, and 
the changes he evidences over a given time 
"interval. We feel that movement is of prime 
interest to the student of human behavior 
and that a fuller comprehension of this phe- 
-momenon will enhance the effectiveness of 
therapeutic (broadly defined) techniques. 

- It is assumed that the therapeutic situation 
“Ws one that is designed to hasten the process of 
“Change. We infer also that all changes in be- 

havior follow upon the construing of these 

changes (2), Behavioral changes do not just 
happen; they are man-made. Therapy pro- 
vides a framework wherein accelerated change 

May occur. 

Saying that our ideas, our aspirations, pre- 
Cede the actual working out of these ideas and 
"aspirations is another way of stating that be- 
havioral change follows the construction of 
“Change. For example, today we plan an in- 

Vestigation which necessarily must precede 
‘our doing the investigation. Our ideas tend to 

un ahead of our “actual” performance, or our 

actual” behavior. A further example, simple 
ut Cogent, is furnished in the area of literary 
Composition. We tend to compose our thoughts 
ter than we can put them on paper. In the 

Sense that we are aware of what we intend to 

Write, we anticipate extensive movement. But 

We were halted at any given point and our 
wl movement measured to that point, 
E 2 IE that little change had oc- 
ÁÉE- e writer, however, affected by his 

Ver-all construction of the situation, perceives 
E own movement in terms of this greater 


1 
wi En from a dissertation submitted by Howard, 
Bitinent irection and assistance of Kelly, to the De- 
Deui of Psychology, The Ohio State University, in 
E. ulfillment of the requirements for the Ph.D. 
netee. Grateful acknowledgement for their generous 

made also to Drs. H. Gordon Hullfsh, Francis 


aid is 
; abinson, Julian B. Rotter, and Robert J. Wherry. 
gu ens Administration Hospital, Chil- 


399 


University 


plan and thereby distorts his perceptions to 
that extent. More will be said of this contrast- 
ing process below. 

In therapy, we offer the patient a situation 
in which the construction of change may be 
facilitated. The therapeutic situation pro- 
vides him with a framework in which he may 
structure changes taking place. Although he 
may accomplish this to some extent in “un- 
guided real life," his therapist aids him in a 
more rapid realization of what is transpiring. 
Under the therapist's guidance, he becomes 
more effective in perceiving the changes, ex- 
tensive and limited, that he has undergone. 
In addition, the patient is helped to become 
more aware of future changes. He broadens his 
perspective to include more distant goals; his 
construction of events runs ahead of the more 
readily observable behavioral changes. 

The reader may have noted the synonymous 
utilization of the terms, “movement” and 
“change.” It is necessary that our present 
use of these terms be understood. In general, 
we regard movement or change as a constantly 
varying behavioral process. The individual 
manifests behavioral variations throughout 
his existence. Measurement is accomplished 
through arbitrary segmentation along the time 
dimension. Movement is here discussed in two 
contexts, measured and perceived. Measured 
movement refers to the variation ascribed to 
an individual's behavior by "outside" ob- 
servers. It is the change noted in behavior de- 
riving from observation of the individual on 
two separate occasions. Perceived movement 
denotes the variation ascribed by the indi- 
vidual to himself at two instants in time. This 
is a more complex process than appears at 
first glance. It is not simply a matter of twice 
reporting on himself. At one instant, his per- 
ceiving of present status will also be somewhat 
affected by his anticipation of future move- 
ment, and, to an even lesser degree, by con- 
structions of prior events. At the second 
instant, his perception of past status is slightly 
affected by anticipation of future movement, 
and, to a much greater degree, influenced by 
a reorganization of the past derived through 


SE-A SE-L. 


SE-LA 


Fic. 1. OPERATION or CONTRAST-ENHANCEMENT ON 
PERCEIVED SrRENGTH (+) AND PERCEIVED 
THREAT (—) 


the construction of the present. In any event, 
the individual's perception of the temporal 
changes in himself can be expected to be more 
distorted than that of a "neutral" observer. 
Examination of the effect of future anticipa- 
tions upon behavior remains largely for later 
study. More fully treated below is the percep- 
tion of past status. 

It is hypothesized that perceived movement 
is greater than measured movement. It is our 
contention that this derives from a “contrast” 
phenomenon not unlike that which may be as- 
cribed to the work of such investigators as 
Bartlett (1), Zeigarnik (4), and Ovsiankina 
(3). As a result of Bartlett's pioneer efforts, 
for example, we have since accepted the notion 
that perception is functionally selective. He 
demonstrated effectively that past experiences 
influence present perceptions and that recall 
of these perceptions changes with time through 
reorganization of the past. In the repeated 
reproduction of stories, for example, certain 
elements are successively distorted so that 
they become increasingly removed from the 
originals. This process of distortion we have 
chosen to term a contrast from the measured 
(or original) perception. The studies of Zei- 
garnik and Ovsiankina may be similarly 
viewed. They demonstrated the tendency of 
individuals to recall interrupted tasks more 
often than tasks which had been completed. 
The aim was completion, and when the aim 


Atvin R. HOWARD AND GEORGE A. Ketty 


was not met, the contrast was sufficiently great 
to engender greater remembrance of the 
failures. : 


sharper details out of a time telescoped; that 
is, an individual's segmentation of time differs | 
from an "actual" time interval. The past be- 
comes a comparatively more undifferentiated | 
mass in which earlier and later events merge 
or telescope. From this mass the individual 
recalls only the “figures” or “sharper details,” 
Concomitant with “actual” movement, the 
individual perceives himself to be “playing 
a new role(s).” As a result of these new com | 
structions, any degree of change tends to be | | 
“overexaggerated” so that the contrast be 
tween past and present events is “overen- | 
hanced.” Perceiving movement in himself, the | 
individual will, upon contrasting what to himis |. 
the relatively static past with the ever-moving | 
present, tend to construe a continually widen- 
ing gap between his present and past behavior, } 
An illustration of the manner in which con- 
trast-enhancement is deemed to operate may | 
be helpful (Fig. 1). Let us suppose that a given | 
individual is requested to render three selt- 
evaluations (SE): (a) at instant A concerning | 
current status at A (SE-A); (b) at instant L | 
(for later) concerning current status at this |. 
subsequent point in time (SE-L); (c) also at | | 
instant L but concerning status at instant À 
(SE-LA). Let us further suppose that, when | 
rendering SE-LA, this individual perceives 
himself to feel much better at instant L than | 
he felt at an earlier point in time, A. The con- 
struction system at A, now perceived relative 
to the (forward) construction system at L, 5 
viewed in “sharper contrast,” i.e., the indi- 
vidual says, in effect: “I see this constellation ! 
at L which depicts my assets and liabilities: | 
Now (L) I have more assets than liabilities 
and I*know I wasn’t feeling as well be 
this (A). In fact, according to the way I f ; 
now, I must have been pretty low at us 
time.” Thus, as a result of contrasting pa 
and present, a "distorted" evaluation x 
pears at LA. A formerly perceived threat ‘ 
now recalled as a strength, and vice versa, 
formerly perceived strength is now s 
as a strength of even greater ER. 
This apparent perceptual distortion leads 


ete | 


A THEORETICAL APPROACH TO PSYCHOLOGICAL MOVEMENT 


to hypothesize that perceived movement is 
greater than measured movement, i.e., the 
difference perceived by the individual as 
represented by his evaluations at L and LA 
exceeds the difference noted by external ob- 
servers at L and A. Thus L serves as a kind of 
- buffer through which perceptions of the past 
| are filtered and distorted. The “figures” of the 
| pattern constituting A are heightened by this 
contrasting process and the “ground” becomes 
even further removed. But the figures are not 
carried over to LA unchanged. In the course 
of time, and as the result of filtering through 
L, they undergo alteration. The “outside” 
_ observer, noting behavior merely on two sepa- 
tate occasions does not respond to these 
“filtering” conditions. 

A further basis for understanding the process 
of movement presents itself. An individual 
becomes aware of the variance in his con- 
struction system as events are successively 
construed by him. At L he construes only 
limited variation in that he tends to confine 
his perceptions to a relatively shorter period 
of time than he does at LA. At “point” L in 

the evaluation of his psychological processes, 
he is aware of changes which have transpired, 

- but to a greater degree on a level of “less 
Cognitive awareness” (2). At LA he is re- 
quested to bring this awareness to a higher 
(more cognitively aware) level whereupon 
fee processes occur: (a) removal from re- 
Strictions of less cognitive awareness at L; 
(b) “conscious” perception of a greater number 
of Successive constructions occurring over a 
An time span tending to remove the person 
Chis from the limited perspective at L; 
€) evaluation of the concept of change itself. 
ii Ovement may be regarded as a manifesta- 
E. 9f the rate of organization of experience 
Es à time line. Differences in the degree of 
B vement between individuals may be viewed 
the result of differences arising in such rates. 

= two Persons exposed to the same (thera- 
Eo Situation reveal significantly dis- 
| Eu degrees of movement, one is obviously 
Bird at a slower rate than the other. 
Bin id as a therapeutic objective the maxi- 
devise Sorption in the briefest time, we must 
divi means for gauging a particular in- 
; Ual’s organization rate. As the point of 

y inishing returns is approached, the process 


oj em 
termination may be profitably initiated. 


401 


Tf, then, we examine historical factors in order 
to group various “kinds” of subjects, we should 
be able to predict the extent of their movement 
in a given situation and within a specified 
period of time. 

In this regard it is felt that case histories 
may indeed afford a fruitful source of hy- 
potheses. Our predictions would cluster about 
the “hunch” that certain construction systems 
held earlier, or certain cultural characteristics 
to which an individual has been exposed, tend 
to hinder movement; others tend to facilitate 
movement. For example, the person who con- 
strues life apart from his mother as unbearable 
would be expected to anticipate backward 
movement in an institutional framework. On 
the other hand, the individual who has been 
actively striving for years to sever the um- 
bilical cord may welcome a hospital environ- 
ment relatively free of patronizing authority 
figures and construe forward movement within 
such a situation. We would hypothesize “re- 
gression” also on the part of adherents to an 
atmosphere of cultural propriety when they 
are exposed to the customary inelegancies of 
institutional existence. 

We have tended to view the fundamental 
aim of therapy, broadly conceived, as the 
stimulation of more rapid change in the patient 
than would result from his own unguided ef- 
forts. The individual who perceives himself 
as remaining relatively stationary—‘I don’t 
seem to be getting anywhere"—lacks the ex- 
perience of felt movement. Left to his own 
devices, it would appear that his future chances 
for rapid movement are rather slight. If 
therapy is to prove successful, the patient 
must feel that he is changing, or at least per- 
ceive himself to be capable of changing. 

Extending our discussion to the classroom, 
we note that the teacher teaches and the 
learner learns. Some pupils assimilate knowl- 
edge at a faster rate than others. If a student 
is to realize the maximum from his attendance, 
he must somehow develop the notion that he 
is in the process of changing; for if he per- 
ceives no change, he may inquire, “What good 
is an education?” As he perceives that he is 
learning, as he perceives that school experiences 
are making him different from what he was 
earlier, his assimilation rate may be expected 
to accelerate (at least for a time). Perhaps test 
results and grades (the measured change) will 


402 


fail to reveal movement as extensive as that 
perceived by the student; nevertheless, if he 
perceives a greater change in terms of some 
personal norm, it should, for the serious student 
at least, provide sufficient incentive for further 
change. As long as he contrasts a prior status 
with his “better educated” status of the mo- 
ment, he will continue to view his school ex- 
periences as desirable events. 

Perhaps then, our use of objective tests (in 
school and elsewhere), group norms, and grades 
has too long overemphasized status and dis- 
regarded the essential aim, movement. More 
stress must be placed upon the individual’s 
perceptions of the learning process. If he views 
a situation to be beneficial to him, his con- 
tinuance in that situation should be facilitated. 
Tf he feels that he is learning something in the 
classroom, he will seek further education. Why 
risk stifling incentive, particularly in the 
formative years, with our noneradicable “final 
grades” which so frequently result in the in- 
ference of “final” attainment rather than in 
the inference of having achieved merely a 
given termination? Perhaps a more useful 
approach would be to “ask him" what he be- 
lieves he has learned, and then clarify the 
"facts" relating to those issues on which he 
appears to be misinformed. In this manner, 
we first pay heed to the person's perceptions, 
and we do not hamper initiative to the degree 
that may occur when an arbitrary standard 
(a desired measured status) is imposed from 
above. 

Viewing the phenomenon of change in terms 
of "contrast" permits us to link the not-too- 
dissimilar areas of (formal) education and 
therapy. In the former, we attempt to convey 
diverse information, often relying, in our text- 
books and lectures, upon examples which 
tend to clarify the issues under discussion. 
As a result, frequently we resort to over- 
enhancement of differences and similarities; 
we draw sharp contrasts, emphasizing dis- 
crepancies; and we offer oversimplified analo- 
gies, highlighting likenesses, in order to “make 
the point stick." Similarly, in therapy, we draw 
upon a relatively minor change that the patient 
has perceived in himself at an earlier date and 
utilize this to stimulate further change. We 
recall to him the movement he reported earlier, 
contrasting it with a movement he now re- 
ports, in our endeavor to encourage his antici- 


Atvin R. Howarp AND GEORGE A. KELLY 


pation of further change. As in the educational 
situation, we resort to overenhancement, We 
are relatively uninterested in the actual, or 
true, change that occurs. We value more 
highly the perception of change taking place | 
than a more objective appraisal of what is 
“actually” occurring. 

In the school this means that we foster the 
perception that one is learning, that one is 
different from what he was at a period when 
he knew less. Our sharp, clear-cut examples 
readily register with the student, with the re- 
sult that he perceives movement in his rapid 
grasp of a concept, a solution, or a technique. 
In the therapeutic situation, this means that 
we guide the patient in the perception of con- | 
trast between earlier behaviors and contempo- | 
rary ones. We strive to get him to see that he | 
is changing or has changed. In both situations, 
so long as the individual perceives that he has 
benefited, he is less likely to terminate in | 
opportunely. It matters little if, by all objec- 
tive criteria, he has evidenced less movement 
than he has perceived. Indicating that he is 
capable of perceiving movement, he has indi- 
cated that he is capable of continuing such 
behavior in the future. In both situations, we |. 
help the individual adopt a point of view by 
means of which he considers himself to be à | 
continuing learner; we guide him toward e 
tablishing a framework of thinking within | 
which he may structure future changes 8$ 
learner, or as an "improved" patient. One use 
ful principle leading toward the attainment 
of this goal is that of “contrast,” whereby the 
teacher or the therapist may help the perso Y 
feel that his change may well be consid 
more extensive than it might be in terms 9 
some arbitrary measured standard. : 

We have, at several points, referred briefly 
to anticipations of future movement. Similarly; i 
the sole mention of the direction movemen 
may take has been confined to illustrati? 
material. Simultaneous elaboration of 
areas is possible. ES 

The fundamental postulate in Kelly's E 
chology of personal constructs" reads: 
person's psychological processes evolve tow?” 
what he construes to be an optimal sue ‘ 
tion of events” (2, p. 29). It is the pik di 
tionary" aspect of the postulate upon Mn 
primary stress must be placed at this PS 
Perhaps the person who revamps his cons! 


A THEORETICAL APPROACH TO PSYCHOLOGICAL MOVEMENT 


tion system to the extent that he perceives 
more change in himself than does another 
observer (even if this is not now improvement 
in the customary sense) is actually in the 
process of anticipating greater prediction and 
control of his behavior in the future. As an 
individual initiates movement, a wider range 
of subsequent movement becomes potentially 
available to him. The more movement occur- 
ring, the more numerous the chances of even- 
tual satisfactory "anticipation of events." 

We have said that therapy is designed to 
hasten movement. If one “kind” of therapy 
is more accelerating than another, we would 
expect persons in the former situation to have 
available to them even more opportunities 
for anticipating events. For example, if we 
select two groups of hospital patients on the 
basis of whether this is the first confinement 
or à subsequent one, we would hypothesize a 
greater degree of movement on the part of the 
new admission. This would apply equally to 
both perceived and measured movement and 
derives from the theoretical deduction that 
first admissions are in a more accelerated 
therapy situation than are readmissions for 
two reasons. First, the new patient has not 
been in the same milieu before. He thus per- 
ceives more freedom to move, in whatever 
direction; he may either be frightened by the 
novelty and therefore move backward, or 
take advantage of the new environment and 
the removal from one in which he was poorly 
adjusted and thus change “for the better." 
Second, institution personnel tend to minister 
to the first admission patient to a greater degree 
than to the readmission patient. The psy- 
chiatrist conducts a more detailed initial inter- 
view with him. In rapid succession, he under- 
goes a series of routine examinations—physical, 
Psychological, social service. Again he may 
Move in either direction—backward or for- 
Ward. All this attention may serve to make him 
feel that people are helping to cure him, or it 
may result in his feeling that he must indeed 
Y ill to have so many persons consulting 


We have, then, what is essentially an ac- 
celerated therapy situation for the first ad- 
mission. The readmission, on the other hand, 
has undergone the hospitalization process a 
igs of times. For him, there is less novelty, 
“Ss contrast from one confinement to the next. 


403 


The routine is the same, the personnel are the 
same, the wards are the same. Returned to 
the same situation, he remains less likely to 
perceive changes in himself and less likely to 
be observed by others to have changed. 

Thus we note that before an individual can 
change, he must construe change as being 
potentially available to him. Therapy may aid 
by structuring the need for change. Once such 
construction has occurred, certain behaviors 
will be observable to others. The person who 
perceives that he is moving forward will be 
noted to be moving forward; the person who 
perceives that he is moving backward will be 
noted to move in a backward direction. Analo- 
gous observation will be made of the person 
who perceives that he is at a standstill, that he 
is “getting nowhere.” There are occasions, 
however, wherein an individual is observed to 
be changing to a greater degree than he per- 
ceives. It is felt that this trend will become 
reversed, in view of our earlier statements 
concerning behavior following one’s construc- 
tion system. This person will soon be observed 
to be behaving more in accord with his own 
perceptions. If he perceives backward move- 
ment, or no movement.at all, his behavior will 
likewise reflect this. Should his perception of 
movement alter so that now he sees more move- 
ment in himself than others do, we have then 
the situation originally posited. 

Thus the personal construct system is re- 
flected through successive modifications of 
behavior. By periodically observing behavior, 
we may infer the nature of the perceptual 
system. Rather than content ourselves with 
a cessation of operations at that point, we 
would undoubtedly desire to gauge the extent 
to which a person’s behavior is a consequence 
of his construction system. One method would 
be to request periodic self-evaluations of an 
individual. During the intervals between the 
rendering of these, observational records of 
the individual’s behavior in various activities 
should be maintained. The primary objective 
would be to predict from any given self-evalu- 
ation, i.e., the construction system at a given 
point in time, the behavior that the individual 
is likely to exhibit during a specified future 
period. The observer's records would serve as 
a check on the predictions made and, of greater 
import, may be expected to yield principles of 
prediction applicable to other situations. 


SUMMARY 

An approach to psychological movement, or 
change, has been presented. Stress has been 
placed on the role of therapy in accelerating 
movement. Earlier investigations have af- 
forded the basis for currently offering a ‘“con- 
trast” phenomenon to explain certain aspects 
of movement and to link the fields of (formal) 
education and therapy. Implications of these 
and other fields are noted. 


Arvin R. HOWARD AND GEORGE A. KELLY 


REFERENCES 


1. BARTLETT, F. C. Remembering. Cambridge, Eng.: 
Cambridge Univer. Press, 1932. 


2. Ketty, G. A. The psychological construction of — 


life. Unpublished manuscript, Psychol. Clinic 
Library, Ohio State Univer., 1951. 


3. OvsraNKINA, M. Die Wiederaufnahme von unter. — 


brochener Handlungen. Psychol. Forsch., 1928, 
11, 302-389. (In Psychol. Abstr., 1929, 3, 2599.) 


4. ZEIGARNIK, B. Das Behalten erledigter und uner- 
ledigter Handlungen. Psychol. Forsch., 1927, 9, — 


1-85. (In Psychol. Abstr., 1927, 1, 2365.) 
Received April 14, 1953. 


——"————————M— ooo .€um—mCamE——-———M——Tce 


THE EFFECT OF EXPERIMENTAL 


AROUSAL OF THE AFFILIATION 


MOTIVE ON THEMATIC APPERCEPTION 


JOHN W. ATKINSON, ROGER W. 
University of 


HIPLEY and Veroff (9) have reported scor- 
ing criteria for thematic apperception 
stories to provide an index of strength 

of the affiliation motive (n Affiliation). Fol- 
lowing the logic of earlier motivation studies 
(1, 3, 6, 7), they found that experimental 
arousal of n Affiliation increased the frequency 
of imaginative stories dealing with affective 
concern over separation from another person. 
In the present experiment, conducted for the 
same purpose with several differences in experi- 
mental procedure and a different set of pic- 
tures, separation imagery, as a criterion for 
designating affiliation-related stories, did not 
by itself discriminate between experimentally 
aroused and control conditions. This paper 
presents the experimental findings which sup- 
port a broader definition of affiliation imagery 
and scoring procedures for obtaining a measure 
of strength of n Affiliation, and a reconciliation 
of the differences between the two studies. 
The theoretical basis for the procedures 
employed is elaborated elsewhere (3; 5, ch. 
12; 6, chs. 2, 7; 10). A series of pictures was 
selected to which male college students might 
be expected to reveal motivation for social 
Acceptance (n Affiliation). Then, since the aim 
of this research is to define clearly the kinds of 
imaginative responses that are symptomatic of 
n Afüliation, two test situations were designed: 
One in which affiliation-motivating cues prior 
to the writing of stories would be maximized, 
and one in which such cues would be mini- 
mized. A 20-minute period of sociometric rat- 
ings conducted in the dining room of a fra- 
tetnity house (also used in the Shipley-Veroff 
Study) met the requirement for a situation ' 
Containing many cues which would arouse 
anticipations of affective changes contingent 
upon being liked and accepted by others or 


liked and rejected. A college classroom in 
m We wish to acknowledge the initial support of 
Suess by a grant from the Horace H. 

y a of Graduate Studies, support of its continuation 
JWA Social Science Research Council Fellowship to 
durin; and the portion of RWH's time made available 
uring 1952-53 as a result of a Ford Foundation grant 


f 
Sr related research on conformity. 


405 


HEYNS, AND JOSEPH VEROFF: 
Michigan 


which the subjects (Ss) performed a 10-minute 
anagrams task following a task-orientation- 
type instruction fulfilled the theoretical re- 
quirement for a situation having fewer affilia- 
tion cues and, if anything, incompatible cues. 
It is assumed that this control condition con- 
tained fewer affiliation-motivating cues than 
the fraternity dining room setting without the 
sociometric procedure which constituted the 
control condition of the Shipley-Veroff experi- 
ment. 
METHOD 
Procedure 


Aroused condition. The Ss were 31 members of a 
fraternity at the University of Michigan (spring, 1951), 
which received $25.00 for participation in this experi- 
ment. They congregated in the dining room following 
an evening meal. The rationale offered for the socio- 
metric procedures included reference to research find- 
ings on conferences which showed how important inter- 
personal relations were in relation to satisfaction with 
group performance, The sociometric procedure is de- 
scribed in detail elsewhere (9). It consisted of ranking 
traits according to the degree to which possession of 
the trait would make a person likeable, description of 
self and members of one’s own class delegation in terms 
of these traits, and finally choice of a£ least three persons 
as most desirable close personal friends. f 

After about 20 minutes of activity of this sort 
(assumed to arouse n Affiliation), the imaginative 
measure of motivation was administered according to 
the procedures described in earlier work on the achieve- 
ment motive (6, 7). Each S wrote four-minute stories 
in response to six pictures, The pictures used were the 
five originally developed by Henry and Guetzkow (4) 
for use in research with small groups and a sixth from 
the n Achievement series. The pictures in order of 
presentation with their code designations for the pres- 
ent study were: (a) the heads of two young men facing 
each other in conversation; (b) a man standing in an 
open doorway looking away from the viewer at the 
landscape outside; (c) six men seated about a table as 
if in conference; (d) an older woman with a troubled 
expression looking away from a younger man; (QJ four 
men sitting and lounging informally in a furnished 
room; and (f) a boy seated at a desk holding his head 
in one hand (H of the n Achievement series). 

Control condition. The 36 Ss were male students in 
an introductory psychology course (spring, 1951). The 
E was introduced during a class period as a person 

in collecting preliminary data for some other 
work? Prior to administration of the six pictures, the 


2 We wish to acknowledge the assistance of David 
Klaus in running this condition. 


406 


classroom group was given an anagrams task (6). They 
were instructed to make as many words out of the 
word "generation" as they could during the limited 
time period. No particular emphasis was put on the 
importance of this task. Its purpose was to reduce the 
influence of any affiliation-motivating cues in the 
Classroom situation by introducing incompatible 
achievement-related cues. After 10 minutes of this, 
the measure of motivation was introduced. 


n Affiliation Scoring System 


The breakdown of imaginative behavioral sequences 
into the categories developed earlier for hunger (1) and 
achievement motivation (6, 7) was accomplished for 
these stories in terms of affiliation. The affiliation 
imagery criterion, which defines the concept of n Affilia- 
tion, is given in detail. A complete scoring manual with 
illustrative examples is in preparation. 

Affiliation Imagery (Aff Im) is scored when a story 
contains evidence of concern, in one or more of the 
characters, over establishing, maintaining, or restoring 
@ positive affective relationship with another person(s). 
Concern of this sort is immediately evident if the 
relationship is described as friendship. 

In the absence of a clearly stated relationship of 
friendliness, affiliation concern can be inferred from 
statements about how one person feels about another 
or their relationship. Statements of liking, or wanting 
to be liked, accepted, or forgiven, or wanting under- 
standing or sympathy indicate the desire for an affilia- 
tive relationship. 

When no statement of feeling or wanting is present, 
the nature of the relationship can be inferred from 
companionate activities such as parties, reunions, 
visits, or relaxed small talk as in a bull session. These 
are regarded as affiliative activities from which affilia- 
tive feelings can be inferred. 

Often the reaction of some character in a story to 
separation or disruption of an interpersonal relationship 
is indicative of affiliative concern (9). Negative feelings 
such as sorrow in parting or grief over some action 
which has led to separation imply the desire to restore 
the affiliative relationship. 

Unrelated Imagery (UI) is scored when there is no 
evidence of affiliative concern as defined above, and 
the remaining categories are then not scored. 

The following categories are scored when the behav- 
orial sequence has been identified as affiliation related. 
Each is scored only once per story. 

Need (N) is scored when someone in the story ex- 
presses a desire for the establishment, maintenance, or 
restoration of an affiliative relationship. 


Instrumental Activity is scored when there is evi- - 


dence in the story that a person is doing something 
(including thinking) to bring about an affiliative goal 
state. The instrumental act is scored successful (I+), 
unsuccessful (I—), or of doubtful success (I?), depend- 
ing on the over-all outcome of instrumental activity in 
the story. 1 
Anticipatory Goal State is scored when someone in 
the story is anticipating goa! attainment or frustration. 
When a person in the story thinks about the pleasant- 


3 Prior to publication, early drafts may be obtained 
rom the authors. 


Jonn W. Arxrnson, Rocer W. HEYNS, AND JOSEPH VEROFF 


ness accompanying an affliative relationship or its 
restoration, the anticipatory state is scored positive 
(Ga+). Negative anticipations (Ga—) are indicated by 
thoughts about pain of rejection or future separation, 
Doubtful anticipations are scored Ga—. 

Obstacle or Block is scored when goal-directed activ- 
ity is blocked or interfered with. The obstacles may be 
in the environment (Bw) or within the person (Bp). 

Affective State is scored when some statement indi- 
cates an affective (emotional) state associated with an 
affiliative activity. When someone in the Story experi- 
ences joy and satisfaction in affiliation, the affective 
state is scored positive (G++). When pain over separa- 
tion or rejection is present, the affective state is scored 
negative (G—). 

Affiliation Thema (Aff Th) is scored when the main 
plot of the story has to do with establishing, maintain- 
ing, or restoring an interpersonal relationship charac- 
terized by friendship, mutual interest, and sympathetic 
understanding. If there is a competing behavioral 
sequence, this category is not scored. The thema - 
Category is an indication that affliation-related 
thoughts dominated S's thinking to the exclusion of 
other thoughts during the four-minute period in which 
he was writing the story. 

Scoring reliability. The total scores assigned to the 
five story records of 20 Ss (not previously scored) by 
two coders working independently correlated .96. The 
Percentage agreement in scoring affiliation imagery 
(twice the number of agreements divided by the total 
number of times the category was scored by both 
coders) was 95 per cent. The percentage agreement in 
Scoring subcategories appearing in stories already 
scored for affiliation imagery was 90 per cent. 


RESULTS 


Comparisons between the control and 
aroused conditions were accomplished by 
breaking the combined control-aroused distri- 
bution of frequencies per S for a particular 
category as near to the median as possible. 
Table 1 shows that the expected increases in 
frequency of Imagery, Successful Instrumental 
Activity, Positive Anticipatory Goal State, 
Environmental Obstacle, Positive Affective 
State, and Thema are significant. Four cate- 
gories—Unsuccessful Instrumental Activity, 
Negative Anticipatory Goal State, Personal 
Obstacle, and Negative Affective State— 
failed to increase. 3 

For the Need category, an alternative 
breaking point which took cognizance of the 
sizable number of Ss in the control condition 
who showed this category only once yield 
clearer evidence of a difference than did the 
approximate median break (Table 1). | 
recognizing that a comparison suggested bY 
examination of the data favors acceptance $ 
a chance difference, we nevertheless accep! 


AFFILIATION AND THEMATIC APPERCEPTION 


407 


TABLE 1 


NUMBER OF Ss IN CONTROL AND AROUSED CONDITIONS ABOVE AND BELOW BREAKING POINT OF COMBINED 
CowrROL-AROUSED DISTRIBUTION OF FREQUENCIES FOR VARIOUS AFFILIATION-RELATED IMAGINATIVE 


CATEGORIES 
CONTROL AROUSED 
BREAKING Pr. (N = 36) (N = 31) 

IMAGINATIVE CATEGORY (Range 0-6) Asove BeLow AsovE Brrow x 2* 
Affiliation Imagery (Im) 2-3 11 25 23 8 12.69 .001 
Need (N) d 21 15 22 9 1.16 15 

1-2)** 5) 31 11 20) s i 
Instrumental Activity ‘ B RE ix Te C 
Successful (I++) 1-2 7 29 12 19 3.04 
Doubtful or unsuccessful (I—) 0-4 d he NAA E sin 4A 
Anticipatory Goal State 
Re EE 0-1 5 31 12 19 5.42 .01 

egative (Ga— 0-1 11 5 

QR 2 12 19 .49 

Personal (Bp) 0-1 11 25 8 23 448 

Environmental (Bw) 1-2 6 30 13 18 5.23 .025 
Affective State 

Positive (G+) 0-1 15 21 20 11 3.49 .05 
urne (G—) 0-1 22 14 18 13 64 

ema (Th) 1-2 9 A TAS 1016: 090 3206 151025 
n Affiliation score (based on Im, N, I+; "738 10 26 21 10 10.70 .001 

Ga+, Bw, G+, Th) 


* Tn the predicted direction, 


** Figures in parentheses indicate alternative breaking point, not the approximate median. 


this category as symptomatic of n Affiliation 
pending further experimentation. 

An over-all index of strength of the affiliation 
Motive in each S, the n Affiliation score, was 
computed by assigning a value of +1 for each 
occurrence in an 55's stories of the seven cate- 
Eories of response which discriminated between 
presumed low and high motivation conditions. 
The median n Affiliation scores of the control 
and aroused conditions were 6.14 (range 1-15) 
and 9.75. (range 2-18), respectively. The ex- 
dg difference between the two conditions 
S significant at the .001 level (see Table 1). 


DISCUSSION 


prbipley and Veroff (9) defined affiliation 
and exclusively in terms of separation 
m le., affective concern over separation 
oa pele Person or persons. The present 
ds argue for a broader definition of affilia- 
of Imagery and hence a broader conception 

e afüliation motive. Any evidence of con- 
rug establishing or maintaining a positive 
(in ad ee with another person(s) 
relatio, i ie to Concern over restoring broken 
motivati ips) is viewed as symptomatic of 
fet lon to affiliate according to the present 


cern 


The difference between the two scoring 
systems can be attributed to (a) differences 
in the pictures used to elicit stories in the two 
studies, and (b) differences in the presumed low 
motivation control conditions. 

Effect of picture differences. Five pictures 
used by Shipley and Veroff were selected from 
the TAT, noted for its ability to arouse fan- 
tasies concerning basic interpersonal problems. 
A fifth picture was especially designed by them 
to elicit thoughts concerning separation from 
others. The present pictures, on the other hand, 
portray a greater variety of common everyday 
situations. From this we expect and find im- 
portant differences in the kinds of stories 
elicited. When the Shipley-Veroff stories were 
rescored according to the present criteria, 
comparison of the frequencies of various imagi- 
native categories in the almost identical 


‘aroused conditions of the two experiments 


showed that the earlier pictures elicited dis- 
proportionately more Needs, Obstacles, and 
Negative Affective States than the present 
pictures.“ The pictures of the earlier investiga- 


4Tables showing the frequency of imaginative re- 
sponses to different pictures in the two studies have 
been deposited with the American Documentation In- 
stitute. Order document 4095 from the ADI Auxiliary 
Publications Project, Photoduplication Service, Library 


mz 

= 

OTN 

5 ^" Hope of Affiliation 

n^ v (Approach) 

p N 

e» 

a hj 

$ N 

H Fear of Rejection 

e N (Avoidance) 
FRATERNITYand FRATERNITY COLLEGE 
mee DINING ROOM CLASSROOM 


1—(Shipley-Veroff exp)— 
Present” exp) —À 


AFFILIATION CUES 
Fic. 1. ASSUMED MOTIVATING EFFECT OF CUES IN 


THE ÁROUSED AND CONTROL CONDITIONS 
OF THE Two EXPERIMENTS 


tion, in other words, were structured to favor 
perception of them as affiliation-deprivation 
Situations. This accounts in part for the rela- 
tively restricted definition of affiliation imagery 
growing out of the earlier experiment. 

As might be expected, the more restricted 
Scoring procedures did discriminate between 
the control and aroused conditions of the 
present experiment, but less adequately than 
the more comprehensive criteria proposed 
here. The median score obtained for the con- 
trol condition of the present experiment was 
3.86 and for the aroused condition 5.00 (x? = 
3.23, p = .05 in the predicted direction).5 

To expect, in addition, that the broade 
scoring system proposed here would discrimi- 
nate between the two conditions of the earlier 
experiment as adequately as the earlier scor- 
ing system would ignore the effect of an im- 
portant difference between the control condi- 


of Congress, Washington 25, D. C. remitting in advance 
$1.25 for 35 mm. microfilm or $1.25 for photostats. 
Make checks payable to Chief, Photoduplication Serv- 
ice, Library of Congress. 

5 The earlier report (9) recommends scoring +1 for 
Affiliation Imagery, Instrumental Activity (+, —, or 
?), Affective States (positive or negative), Obstacles 
(personal or environmental), and Thema; and —1 for 
Unrelated Imagery. In this comparison, Unrelated 
Imagery was scored 0 instead of —1 since the frequency 
of Unrelated Imagery is completely dependent upon 
the frequency of Affiliation Imagery, and scoring —1 is 
gratuitous, equivalent to weighting the imagery cate- 
gory twice as heavily as any other category. 


Joun W. Arxrnson, Rocer W. HEYNS, AND JOSEPH VEROFF 


tions of the two experiments on the scoring 
procedures that were developed by comparing 


the particular control and aroused groups in 


each experiment. 


Effect of differences between control conditions, 


Figure 1 contains our conceptualization of the 
motivating effects of the cues in the socio- 
metric arousal and control conditions of the 
two experiments. The general logic of the usual 
diagrammatic presentation of approach-avoid- 
ance conflict is implied in this conceptualiza- 
tion (3, 8). 

The approach gradient (positive motivation 
to affiliate, or hope of affiliation) is presumed 


to be at its maximum as a result of the com- | 


bined effect of cues in the fraternity dining 
room settings and the sociometric rating pro- 
cedures in the nearly identical aroused condi- 
tions of the two experiments. The Shipley- 
Veroff control condition, a fraternity dining 
room after supper and including a food prefer- 
ence inventory before the imaginative measure, 
is assumed to have contained more cues which 
would arouse motivation to affiliate than the 
more neutral college classroom condition of 
this experiment which included some incom- 
patible achievement-motivating cues. > 
A fear of rejection or avoidance gradient is 
also plotted. It represents the degree to which 
anticipations of painful rejection are aro 
by the situational cues. It is strongest in the 
aroused conditions of both experiments and 


falls off sharply since it is assumed to be pri- 


marily determined by the sociometric rating 
procedure which virtually guaranteed a certain 
amount of rejection for all Ss. 

In the present study, the arousal of both 
hope of affiliation (approach) and fear of re- 
jection (avoidance) is low in the control con- 
dition and high in the aroused condition. In 
the earlier study, the difference between con- 
trol and aroused conditions was relatively 
greater for fear of rejection than for hope 0 
affiliation.* As a result, the Shipley-Veroff scor 
ing system based on pictures already favoring 
affiliation-deprivation imagery should be sensi 
tive to the expression of affiliation motivation 


*In light of the positive relationship br ae 
roval seeking and interest in eating repor! 2 
ia Miller (2) in work on the “oral character, m 
might argue that the food preference inventos 
cluded as part of the control procedure may, m 
provided additional cues associated with affilia! 
tendencies. 


by Blum. 


AFFILIATION AND THEMATIC APPERCEPTION 


| when rejection is threatened, but rather in- 
sensitive to the effect of an increase in positive 


motivation alone because their control condi- 
tion was already, to a certain extent, positively 
motivated. 

Since the present control group was not in- 
fluenced by either the positive-motivating cues 


1 of the fraternity dining room or the fear-pro- 


ducing cues of the sociometric arousal pro- 
cedure, the present scoring criteria should be 
sensitive to positive motivation aroused by 
the fraternity dining room cues in the Shipley- 
Veroff control condition. This expectation is 
confirmed. Our scoring of the Shipley-Veroff 
aroused condition for affiliation imagery was 


- virtually identical with the original scoring of 


it. However, where they report a mean (per 5) 
imagery count of only 2.53 in their control 
condition, the more comprehensive imagery 
criteria suggested here produce a higher mean 
of 2.93. As a result, the total number of sub- 
Categories scored in their control stories is 
also increased, and the over-all difference be- 
tween the control and aroused conditions is 
found to be much less than when separation 
Imagery alone was the basis for comparison. 
The scoring procedures developed in the 
Present experiment are recommended as a 
More general definition of affiliation motiva- 
tion than those developed in the earlier study 
in which the particular properties of the pic- 
tures and the nature of the difference between 
Control and aroused conditions may be as- 
sumed to have had the effect of restricting the 
definition of affiliation imagery to separation 
anxiety as outlined in the arguments above. 
.^ Affiliation and popularity. There is one 
bit of evidence to indicate that both scoring 
Procedures were measuring the same variable 
In the aroused conditions of the two studies. 
During the sociometric rating procedure of 
both Studies, Ss were asked to rank a number 
of traits in order of the degree to which pos- 
Session of that trait would make a person like- 
able, and later they were to assign two of the 
Gala on the list to each member of their 
legation. The mean rank of traits assigned 
d à person is an index of the degree to which 
a 'S considered likeable by his associates. In 
( e Shipley-Veroff study, the rank correlation 
tau) between individual n Affiliation scores 
and this popularity index was —.33 (p = .05). 


In the present study, the rank correlation (tau) 


409 


between these same measures was —.27 (p = 
-05) and rose to —.36 when the one case having 
the lowest n Affiliation score and the least 
likeable ratings was eliminated. 

n Affiliation is an approach tendency. Analysis 
of the traits used to describe Ss during the 
sociometric rating procedure supports a con- 
ception of the affiliation motive as a disposi- 
tion to move toward others to elicit positive 
affective responses from them. The Ss above 
the median n Affiliation score in the aroused 
condition were described as approval seeking by 
at least one other person significantly more 
frequently than Ss below the median n Affilia- 
tion score (x? = 7.43, p = .01). They were, 
in addition, described as self-assertive and con- 
fident. (indicative of positive anticipations) by 
at least one other person more frequently than 
Ss low in n Affiliation (x? = 2.88, p = .10). 
Unanticipated by us was the additional finding 
that they were also more often rated by at 
least one other person as egotistical than were 
Ss low in n Affiliation (x? = 7.43, p = .01). 

The correspondence of these results with the 
Blum and Miller (2) findings on children that 
indices of orality are positively related to ap- 
proval seeking and negatively related to socio- 
metric status provides a promising lead con- 
cerning the origins of the affiliation motive. 


SUMMARY 


The purpose of this study was to develop a 
method of scoring thematic apperception 
stories to measure strength of motivation for 
social acceptance, or n Affiliation. Imaginative 
stories were written in response to pictures by 
two groups of male Ss under experimental 
conditions designed to differ in the degree to 
which motivation to be accepted and liked by 
others would be aroused. The presumed high 
motivation condition consisted of a socio- 
metric procedure conducted in a fraternity 
dining room before stories were written. The 
low motivation control condition was a college . 
‘classroom. l i 

Behavioral sequences dealing with attempts 
to establish, maintain, or restore positive affec- 
tive relationships with other persons occurred 
more frequently in the imaginative stories of 
the group in which n Affiliation had been 
aroused. The differences between the scoring 
definitions proposed here and a more restricted 
set growing out of a similar experiment by 


410 


Shipley and Veroff are reconciled in terms of 
an analysis of differences between the pictures 
and low motivation control conditions in the 
two experiments. 


REFERENCES 


1, ATKINSON, J. W., & McCrELLAND, D. C. The 
projective expression of needs. II. The effect of 
different intensities of the hunger drive on 
thematic apperception. J. exp. Psychol, 1948, 
38, 643-658. 


2. Brom, G. S, & Mrs, D. R. Exploring the 
psychoanalytic theory of the oral character. 
J. Pers., 1952, 20, 287-304. 

3. CLARK, R. A. The projective measurement of 
experimentally induced levels of sexual motiva- 
tion. J. exp. Psychol., 1952, 44, 391-399. 

4, Henry, W. E., & Guerzkow, H. Group projection 
sketches for the study of small groups. J. soc. 
Psychol., 1951, 33, 77-102. 


Jonn W. Arxrnson, Rocer W. HEYNS, AND JOSEPH VEROFF 


5. McCrznawp, D. C. Personality. New York: 
William Sloane Associates, 1951. 

6. McCrenawp, D. C, ATKINSON, J. W., CLARK, 
R. A., & LoweLL, E. L. The achievement motive, 
New York: Appleton-Century-Crofts, 1953, 

7. McCtetxanp, D. C., CLARK, R. A., Rosy, T. B, 
& Arxmson, J. W, The projective expression of 
needs. IV. The effect of need for achievement 
on thematic apperception. J. exp. Psychol, 
1949, 39, 242-255. 

8. Miter, N. Comments on theoretical models 
illustrated by the development of a theory of 
conflict behavior. J. Pers., 1951, 20, 82-100. 

9. Suptey, T. E., & VEROFF, J. A projective measure 
of need for affiliation. J. exp. Psychol., 1952, 48, 
349-356. 

10. Verorr, J., Wizcox, S., & ATKINSON, J. W. The 
achievement motive in high school and college - 
age women. J. abnorm. soc. Psychol., 1953, 48, 
108-119. 


Received May 2, 1953. 


INTERPERSONAL KNOWLEDGE AND INDIVIDUAL AND GROUP 
EFFECTIVENESS 


F. LOYAL 


GREER 


Institute for Research in Human Relations 


EUGENE H. GALANTER 
University of Pennsylvania 


IHE major hypothesis tested in this re- 
Ts was that the knowledge of inter- 

personal relationships by individuals in 
à group is related to effective group behavior. 
The more one knows about his physical world, 
other things equal, the better able he is to 
move in that world to achieve his purposes. 
It seems reasonable to extend this concept to 
the social environment. 

A study by Chowdhry and Newcomb (2) 
provides evidence which indicates that socio- 
metrically nominated leaders of face-to-face 
groups have more accurate perceptions of their 
Broups' attitudes toward group-relevant issues 
than nonleaders and isolates. Such leadership 
can be conceived as indicative of personal 
effectiveness in the group situation. The hy- 
Pothesis follows that groups composed of indi- 
viduals with more accurate social perceptions 
about their group than groups with less per- 
Ceptually accurate individuals will be more 
Productive and effective. 

The problem was to locate a criterion which 
Would provide a means of differentiating be- 
tween groups in terms of their effectiveness or 
Productivity, and in situations where the effec- 
tiveness of the group would depend on the 
aor of its members. In order to examine 
urther the finding concerning the accuracy of 
Social perceptions of the leader as compared 
With followers, the groups needed a leader- 
ollower Structure. In addition, these groups 


ad to have past histories as groups. Other- 


e differences would have been unlikely be- 
in i. leaders and followers or between groups 

€ accuracy of the perceptions of the inter- 
B relationships and attitudes of the 


METHOD 


ee The above requirements for the research 
esearch m a study conducted by the Institute for 
€ Perso in Human Relations under contract with 
era's Gan Research Section of The Adjutant Gen- 
Brie: e. Havron, McGrath, and Fay (4) developed 
"on. performance problem designed to measure 


411 


AND 


PETER G. NORDLIE 
University of Maryland. 


the effectiveness of infantry rifle squads in carrying 
out common combat missions. This criterion was then 
used to validate various predictors. Moreover, the 
scores on the criterion field problem for infantry 
rifle squads made it possible to test the hypotheses in 
this study. 

The criterion field problem was based on Army field 
manuals and consultations with military tacticians at 
The Infantry School, Fort Benning, Georgia. Regard- 
less of the ultimate validity of the criterion, the scores 
obtained on the problem depended on the teamwork 
of the nine men in a squad. The criterion was a six-hour 
squad field problem. There were four different missions: 
(a) the squad as a point of advance guard, (b) the squad 
as a reconnaissance patrol, (c) the squad in a defense 
position, and (d) the squad as an attacking unit. These 
performances were objectively scored by trained ob- 
servers in terms of very specific actions. Interrater 
reliability was 4-.88 (4). 

The Army infantry rifle squad includes a squad 
leader, an assistant squad leader, an automatic rifleman, 
an assistant automatic rifleman, and five riflemen, 
These men must work together as a team in the field 
in order to achieve group goals. The men used in this 
study had lived and worked together anywhere from 
a month to two years. 

Hypotheses. Three hypotheses were formulated: 

1. Squad leaders will be more accurate in their per- 
ceptions of the preference hierarchy structure within 
their squads than will other squad members. 

2. The more popular individuals in a squad will 
have more accurate perceptions of the preference 
hierarchy structure in their squads than will the less 
popular men. i 

3. Effective groups as measured by the criterion 
field problem will be characterized by members with 
greater accuracy in the perception of the preference 
hierarchy structure in their squads than by members 
in less effective groups. 

Procedure. The measures of the group-relevant atti- 
tudes and perceptions were obtained during individual 
interviews. A squad member was asked first to list the 
men in his squad according to how well he liked them. 
He then was asked to list them in the order that he 
thought they were liked by the group as a whole. In 
this second listing the interviewee included himself. The 
first listing was used to determine the ranking of a man 
in the preference hierarchy structure of his group. Each 
individual in a squad was ranked in this preference 
hierarchy structure according to the average listing 
position given to him by all of the other men in his 
squad. This derived preference hierarchy ranking was 
compared with each squad member’s perception of the 
preference hierarchy structure in his squad. From this 
comparison a discrepancy score was obtained for each 


412 


man. The larger the discrepancy score, the less accurate 
the interviewee's perception of the preference hierarchy 
Structure within his squad was assumed to be. 

Subjects and interview. Sixty-three infantry rifle 
Squads were tested on the criterion field problem at 
Camp Atterbury, Indiana. The members of the 13 
highest-scoring or effective squads and the members of 
the 13 lowest-scoring or ineffective squads were inter- 
viewed by eight different civilian interviewers. Neither 
interviewers nor interviewees knew whether a squad 
member came from an effective or an ineffective squad. 
Assurances were given to squad members that what 
they said would not be revealed to anybody in the 
Army. Toward the end of the interview the questions 
relevant to this research were asked. 


RESULTS 


Leaders vs. nonleaders. Discrepancy scores 
for the accuracy of the perception of the pref- 
erence hierarchy structure within a group 
could be as high as 40. The actual range for 
this sample was between 4 and 35. The 23 
Squad leaders interviewed had a mean dis- 
crepancy score of 15.0. The inability to inter- 
view 18 squad members did not bias any of 
the results. It meant only that the attitudes of 
eight men instead of nine determined the pref- 
erence hierarchy ranking for some Squads. 
The mean discrepancy score for the 193 non- 
leader men interviewed was 16.5. The differ- 
ence between distributions was not Statistically 
significant at the .05 level using chi square as 
the test. The obtained chi square was 2.70 with 
1 df. The required value for chi square is 2.706 
for significance at the .05 level using the one- 
tailed test. However, another analysis was 
made to check this finding. Mean discrepancy 
Scores were determined for men in each of the 
nine positions in a squad. The expectation that 
the mean discrepancy score for squad leaders 
would be lower than the mean discrepancy 
scores for men in the other eight positions was 
confirmed. With the use of the standard error 
of the frequency this result produced a / of 


248 (p < .05). On the basis of these two’ 


analyses the data appear to corroborate the 
findings of Chowdhry and Newcomb (2). The 
appointed leaders were significantly more ac- 
curate than nonleaders in the group-relevant 
social perception. r 
Popular vs. less popular. The derived prefer- 
ence hierarchy ranking for each man was used 
as the first index of individual popularity. 
Men in ranks one through four were combined 
and called “popular.” Men in ranks five 
through nine were termed “less popular.” 


F. Lovar GREER, EucENE H, GALANTER, AND PETER G. NORDLE 


The mean discrepancy score for the 97 popula 
men was 15.6; for the 119 less popular men i 
was 17.3. The difference between distributions 
produced a chi square of 3.15 with 1 df, which 
is significant beyond the .05 level for the one- 
tailed test. 

An independent and somewhat different 
measure of popularity was available, Included 
in a battery of instruments administered to the 
men before they went on the field problem was 
a 10-item sociometric test (4). Each man re- 
sponded to the items in this test according to 
whether he would like to do a particular ac- 
tivity with each of the other eight men in his 
squad. These data yielded a figure represent- 
ing the summation of positive and negative 
choices received by each man. The distribu- 
tion of sociometric scores was split in half; 
those with a high sociometric score were called 
the “sociometrically popular" individuals, and 
the low scorers were called the “sociometrically 
less popular.” The sociometrically popular in- 
dividuals were found to have the more ac- 
curate social perceptions. A significant chi. 
Square was obtained for the difference between 
distributions with a value of 4.05 with 1 df 
(p < .05). ; 

Since 17 of the 23 squad leaders were in the | 
sociometrically popular group, it seemed that 
the results might have been confounded, Even 
though the squad leaders were eliminated in 
another analysis, the sociometrically popular 
group retained its lower mean discrepancy 
score. The chi square for the difference between 
the distributions was 4.56 with 1 df (p < 05). 
Popularity, as measured in two different ways; l 
had a positive relationship with the accuracy 
of perceptions of interpersonal attitudes; the 
more popular men perceived the social relation- 
ships more accurately than did the less populat 
men. 

Effective vs. ineffective groups. Leaders and 
popular squad members, in a sense the effects 
members in a squad, had more accurate ud 
ceptions of the preference hierarchy: s: 
in their squads. The hypothesis follows ji 
groups composed of men with more a 
perceptions of the social relationships in pd 
groups will be more effective or product! 
groups. The 107 members of effective x 
had a mean discrepancy score of 15.8. T 
people from the ineffective squads bad am o 
discrepancy score of 17.1. The chi square 7. 
the difference between the distributions. 


INTERPERSONAL KNOWLEDGE AND INDIVIDUAL AND GROUP EFFECTIVENESS 


significant beyond the .05 level with a value 
of 4.24 with 1 df. 

The squad leaders of effective squads had a 
mean discrepancy score of 13.5; for nonleader 
men of effective squads it was 16.1. The squad 
leaders of ineffective squads had a mean dis- 
crepancy score of 16.4; for nonleaders of ineffec- 
tive squads it was 17.2. These data might seem 
to indicate that the difference between effective 
and ineffective squads in terms of their per- 
ceptual accuracy was essentially due to the 
| especially good ability of the squad leaders of 
| effective squads. But the comparison of the 
| distributions of scores between the members 
of effective and ineffective squads after squad 
_ leaders were excluded produced a chi square of 
3.06 with 1 df. This is significant beyond .05 
level using the one-tailed test. Therefore, the 
difference originally obtained between all men 
in effective and ineffective squads was still 
significant, even though the scores of squad 
leaders were excluded from the analysis, 

Other analyses. The possibility existed that 
the measure of perceptual accuracy used in 
this study might have been associated with 
some other variable which was causally related 
to the effectiveness of squads on the criterion 
field problem. The variables of age, education, 
Army grade, or whether a person came from a 
_ Small or large community did not differentiate 
| between men in effective and ineffective 
Squads. The length of time the men had worked 
together might have affected both the accuracy 
of social perceptions and the effectiveness of 
_ the squads, The data indicated that members 

effective squads had an average of 168.1 
days together before the field problem, while 
Members of ineffective squads had an average 

of 170.4. The difference is negligible and in- 
. Consistent with what might have been ex- 
pected, 

, An Army measure of intelligence showed no 
Significant difference between the mean scores 
_ of all of the members of effective and ineffec- 
tive squads. There was a significant difference, 
se between squad leaders; the squad 
pus of effective squads had a significantly 
i ES mean intelligence score than squad 
4i ets of ineffective squads (5). The relation- 

D between the intelligence of a squad leader 
and the effectiveness of his squad is not sur- 
prising, Since intelligence and the accuracy of 
Eu Perceptions may be related, one might 

Onder whether the intelligence of squad 


413 


leaders alone, and not the accuracy of social 
perceptions, was the causal variable related 
to group effectiveness. But the finding that 
nonleader men in effective squads had signifi- 
cantly lower discrepancy scores than the non- 
leader men in ineffective squads, but did not 
differ in intelligence, militates against this 
explanation. Both the intelligence of squad 
leaders and the accuracy of the social percep- 
tions of the squad members appear to have 
contributed to squad effectiveness. 


DISCUSSION 


Hites and Campbell (6) found that leaders, 
both those appointed and those nominated, 
were no better than nonleaders in the accuracy 
of their perceptions of attitudes held by the 
members of their groups. Newcomb! has pro- 
posed that for certain attitudes, e.g., those 
attitudes within the group which everybody 
has had an ample opportunity to learn, there 
will be no difference between leaders and non- 
leaders in the perceptual accuracy of their 
estimates. There are certain other types of 
attitudes regarding group morale and leader 
competency for which the leader's perception 
may be distorted through his ego-involvement. 
In the Chowdhry and Newcomb study (2) it 
was found that leaders were more accurate 
than nonleaders in perceiving "consensus data” 
when relevant group issues were involved; no 
significant difference existed between leaders 
and nonleaders when nonrelevant group issues 
were concerned. Naturally, when all the mem- 
bers of a group hold the same attitude, a differ- 
ence in the estimate of the group attitude 
between leaders and nonleaders can hardly be 
expected. Indeed, accurate perceptions of 
group attitudes may sometimes occur simply 
because one holds the same attitude as most 
of the members of his group. The negative 
findings of Hites and Campbell (6) can prob- 
ably be explained as a function of the attitudi- 


.nal issues used in their study. 


Bell (1) found that in small, task-oriented, 
face-to-face groups with no previous associa- 
tion, the sociometrically determined leaders 
did not have greater accuracy than other 
group members in the perception of the atti- 
tudinal structures in the group. He explained 
his negative findings in terms of the lack of 
previous associations among members in his 


1 Newcomb, T. M. Personal communication. Decem- 
ber 16, 1952. 


414 


groups and the fact that leaders were chosen 
merely on the basis of their ability to meet 
the group goals and not the individual needs 
of the group members. The positive findings 
in the study reported in this paper can prob- 
ably be attributed to the fact that none of the 
conditions which appear to have led to the 
negative or indecisive results in the studies 
of Hites and Campbell (6) and Bell (1) was 
present. 
CONCLUSIONS 


Leaders and popular individuals as well as 
members of effective groups had greater knowl- 
edge of the group preference hierarchy struc- 
ture of members within their groups. It seems 
reasonable to assume that accuracy in this 
particular perceptual category reflects ac- 
curacy in other perceptual matters relating to 
group attitudes toward group-relevant issues. 
The position of an appointed leader within a 
group appears to facilitate the learning of 
group attitudes. A popular individual also 
achieves, or because of his popularity comes to 
have, this more accurate social perception. 
The results support the work of Chowdhry and 
Newcomb (2) and indicate, furthermore, that 
the accuracy of the group members’ percep- 
tions of group attitudes toward group-relevant 
issues is related to the effectiveness of group 
performance. 

It is suggested that this ability to perceive 
accurately the attitudes of another has the 
following consequences. If one can anticipate 
the needs and behaviors of other people, he 
can act to satisfy them. By satisfying the needs 
of others, an individual becomes in effect a 
“problem solver” for the other person. Re- 
search (3) indicates that such problem solvers 
are often chosen as leaders; the more a leader 
is perceived as a problem solver, the more the 
followers appear to be motivated to help the 
leader (5). A person possessing greater ac- 
curacy in social perceptions can act with more 
certainty and confidence in the consequences 
of his interpersonal behavior. He is in a position 
not only to achieve with more certainty the 

goals of others, but also the social goals that 
he has for himself. 

When one group has individuals in it who 
are better than individuals in another group 
in anticipating the needs and behaviors of 
their group members, this group may be a more 
effective group in its performance. The indi- 
viduals may have to be less motivated toward 


F. Lovan GREER, EucENE H. GALANTER, AND PETER G. NORDLIE 


behaving in relation to each other than be. 
having in relation to the group goal. Further 
study in this area seems relevant for the de- 
velopment of a theoretical structure for the 
analysis of interpersonal relationships and 
group behavior. t 

SUMMARY 


1. Measures of discrepancy between an ob- 
jective group preference hierarchy structure of 
members and an individual estimate of this 
preference hierarchy structure were obtained 
for members of infantry rifle squads who had 
scored high and low on a criterion field prob- 
lem, 

2. The appointed leaders of these ongoing 
groups were better able to estimate the pref- 
erence hierarchy structure than were non- 
leaders. 

3. Popular individuals were more accurate 
in this social perception than were less popular 
individuals, 

4. Members of effective groups were more 
accurate in their estimates of the group pref- 
erence hierarchy structure than were members 
of ineffective groups. | 

5. The findings are interpreted and explana- 
tions are offered for reconciling the conflicting 
data found in other studies. 


REFERENCES 


1, BELL, G. The relationship between leadership and 
empathy. Unpublished doctor's dissertation, 
Northwestern Univer., 1952. , 

2. CHownmgy, K., & Nrwcows, T. M. The relative 
abilities of leaders and non-leaders to estimate 
opinions of their own groups. J. abnorm. so. 
Psychol., 1952, 47, 51-57. 

3. GREER, F. L. Neighborhood leaders. In D. Courtney 
& al., Naval, neighborhood, and national leadership. 
Series 1953. Institute Report #1. Philadelphia: 
Institute for Research in Human Relations, 1953: 

4. U. S. DEPT. ARMY. ADJUTANT GENERAL'S OFFICE: 
PERSONNEL RESEARCH SECTION. fe Rh 
of small mili units. Parts I-III. Persom 
Res. Sect. Reb, 1982, 980, 150 p. Washington 
American Documentation Institute, Lie 
Cotigress, Document No. 3969, microfilm, $5./5i 
photocopy, $18.75. ; ih 

5. U. S. DEPT. Army. ADJUTANT GENERAL'S Omm 
PERSONNEL RrsEARCH SECTION. An bs 
study of human relationships in effective Vr a 
rifle squads. Personnel Res. Sect. Rep., 195 ds 
183 p. Washington: American Documentation 1 
stitute, Library of Congress, Document No. 
microfilm, $6.75; photocopy, $23.75. yiliy 

6. Hires, R., & Campsett, D. T. A test of the 2 / 
of fraternity leaders to estimate group OP! 

J. soc. Psychol., 1950, 32, 95-100. 


Received May 4, 1953. 


1 NUMBER of investigators have noted 
1 A various types of irregularities in 


sortings of attitude items made by 
the method of equal-appearing intervals. 
Types of irregularities most frequently noted 
have been reversal of ends of the continuum 
(1, 3, 5), failure to rate items (11), bunching 
of items in extreme intervals (2, 3, 4, 5, 10, 11), 
failure to use some intervals (8, 11), and mis- 
understanding of, or failure to, follow directions 
(1, 2, 6, 9, 10, 11). Though these types of 
irregularities have been noted, no systematic 
_ study of the extent to which they appear in 
judgmental data, or of their effects on scale and 
ambiguity values, has been made. 

The purpose of this note is to report an 
analysis of the various types of irregularities 
found in the sortings made by 712 college 
students of the 130 Thurstone-Chave (10) 
Statements of attitude toward the church. 
Hypotheses relative to the causes of some of 
these irregularities are suggested. 


METHOD 


Procedure. The method of collecting the judgments 

was a modification of the Seashore-Hevner technique 
o). Ratings using 11, 7, and 5 intervals were recorded 
on specially prepared IBM answer sheets which pro- 
vided appropriate numbers of identified intervals. 
, In preparing the booklet of items, the first three 
items were selected so that the first and third items 
ould express extremely unfavorable attitudes, while 
B y second would express a very favorable attitude. 
$ m Temaining 127 items were arranged in scrambled 
"der. To minimize reversals, the verbal designations 
Associated with the extreme and middle intervals of 
Rete Were written at the top of each page of the 
Se original Thurstone and Chave directions (10), 
Em ied to provide for recording ratings on tlie answer 
to E Were used. Ratings were made by groups of 20 
ie Students during a 50-minute class period. The 
n were read aloud to the students, and all 
Questions Concerning them were answered before the 
Tating of the items started. 


ae Study has been supported by a grant from 
an mory University Research Committee. The 
E M wishes to express appreciation to Drs. Joseph 
a Core, Georgia Institute of Technology, Florene 
S. ae University of Georgia, and to the staffs of the 
ychology and sociology departments of Emory Uni- 


| versity for assistance in securing subjects for this study. 


IRREGULARITIES IN JUDGMENT DATA COLLECTED BY THE 
METHOD OF EQUAL-APPEARING INTERVALS! 


SAM C. WEBB 
Emory University 


After the judges started rating the items, the ad- 
ministrator circulated among them to check their rating 
procedures. If the pattern of a judge’s ratings was not 
such that the first and third statements fell on the 
“depreciation” side of “neutral” and the rating of the 
second statement fell on the “appreciation” side of 
“neutral,” the administrator asked the student to ex- 
plain his judgment for the item or items which did not 
conform to this pattern. While some students indicated 
that they did not fully understand what to do, a sur- 
prising number replied, “but I don’t agree with it!” 
From the emotional tone of these replies, the writer 
inferred a personal involvement with the item on the 
part of the student and not just a confusion in directions, 

Subjects. The 11-interval data were collected from 
243 students enrolled in psychology classes at Georgia 
Institute of Technology. The seven-interval data were 
collected from 244 students enrolled in psychology 
classes at the University of Georgia; and the five-in- 
terval data were collected from 225 students enrolled 
in psychology and sociology classes at Emory Uni- 
versity. 

RESULTS 


After a preliminary examination of the 
sortings for types of irregularities, nine 
classifications or categories were established. 
These were arranged in order of the serious- 
ness of the defect as viewed by the writer. 
The most serious defect was placed at the top 
of the list. Each set of ratings was then con- 
sidered for inclusion in each category, working 
from top to bottom. If a set was classified in 
one of the top categories, it was not examined 
for other irregularities which would permit it 
to be classified in a lower category. The classi- 
fications and the percentage of sets of ratings 
for the 11-, 7-, and 5-interval scales and for 
all scales combined found in each classification 
are shown in Table 1. “Double marking” 
refers to two ratings for an item. Wicker’s (11) 
éxtrapolation of the Thurstone and Chave 
criterion was used to determine if too many 
items had been judged to fall in one interval. 

An inversion is defined as the rating of an 
item on one side of “neutral” when it would 
be expected to fall on the opposite side of 
“neutral.” The number, of inversions per set of 
ratings was determined in the following way. 
The writer by inspection selected 49 extreme 
items (23 favorable and 26 unfavorable) which 
were fairly evenly distributed throughout the 


415 


416 Sam C. 
TABLE 1 
SUMMARY OF IRREGULARITIES FOR Jupem Groups 
11 7 5 Ix- 
Gitaa INTERVAL | INrzRVAL| rervar | TOTAL 
Nl % IN| INI GIN] % 
1. Incomplete 8] 33| 9] 3. 2| .9|19| 2.7 
2, Double marking | 16 | 6.6 | 15 | 61] 2] .9| 33] 4.6 
3. Inversions (100r | 8| 3.3 | 11 10] 4.5] 29] 4.1 
more, 
4. Omissions (2 or! 7| 29| 7| 2.9] o|o |1] 20 
more) 
5. Too many in one | 30 | 12.3] 18| 7.4] ofo | 48] 6.7 
interval 
6. Unused intervals | 9| 3.7] 3| 1.2] 3|13] 15] 21 
7. One omission 24| 9.9) 16) 6.6] 11] 4.9] st] 7.2 
8. Inversions (less | 70 | 28.8 |111 | 45.5 | 91 |40.4 [272 | 38.2 
than 10) 
9. Perfect sorting | 71 | 29.2 | 54 | 22.1 [106 47.1 |231 | 32.4 
Total 243 |100.0 |244 |100.0 |225 [99.9 |712 |100.0. 


test booklet. All sets of ratings were “scored” 
on these items to determine the number of 
items placed on the unexpected side of the 
“neutral” interval. Sortings with 10 or more 
inversions were placed in category 3; those 
with less than 10 inversions were placed in 
category 8. 

It seems plausible to classify the eight cate- 
gories which describe the irregularities into 
four groups. These groups and their percentage 
of irregularities are as follows: (a) Irregulari- 
ties resulting from a time limitation (category 
1), 2.7 per cent. (b) Irregularities associated 
with the answer sheet method of collecting 
the data—double markings and omissions 
(categories 2, 4, and 7), 13.8 per cent. (c) 
Irregularities usually attributed to “careless- 
ness" in sorting—too many in one interval 
and unused intervals (categories 5 and 6), 
8.8 per cent. (d) Irregularities defined as in- 
versions (categories 3 and 8), 42.8 per cent. 

There were fewer irregularities in the 5- 
interval data than in the 11- and 7-interval 
data. For the five-interval data 47.1 per cent 
of the ratings were classified as perfect, while 
only 29.2 per cent and 22.1 per cent of the 11- 
and 7-interval ratings, respectively, were só 
classified. 

Additional analyses of inversions were made. 
First, a count of the number of inversions 
found in each of the 301 sets of ratings which 
were classified in category 3 (10 or more in- 
versions) and category 8 (less than 10 in- 
versions) showed that 127 (41.5 per cent) of 
the sets of ratings had only one inversion and 


WEBB 


that 259 (86 per cent) did not contain mor 
than five inversions. Second, using all sortings 
in categories 3 through 9, the Percentage of 
inversions of each of the 49 items was obtained, 
The distribution of these Percentages showed 
a significant shift toward increased Percentages 
of inversions when compared with the distri- 
butions of the percentages of inversions for 
the same items reported by Thurstone and 
Chave (10) and Wicker (11), both of whom 
also excluded judges before tallying judgments, 
However, when the percentages of inversions 
were computed on the basis of all sets of 
ratings in categories 4 through 9, the distribu- 
tion of percentages so obtained was similar to 
the distributions of percentages for the Thur- 
stone and Chave and for the Wicker data, 
For the 49 items the percentage of inversions | 
computed on the basis of sortings in categories 
3 through 9 and the percentages computed on 
the basis of ratings in categories 4 through 9 | 
correlated .63 and .67, respectively, with the 
percentage of inversions found in the Thur- 
stone and Chave data for these items. å 
For the sortings of the 29 judges with 10 or 
more inversions (category 3), a detailed analy- 
sis was made by tabulating on a sheet, with 
statements asrowsand with judges as columns, 
each item inverted by each judge. Since the | 
inversions for each judge were fairly well 
Scattered among the entire set of 49 items, 
they do not appear to be a function of system 
atic reversal of the ends of the continuum, 
For each person the number of inversions of 
favorable and unfavorable items was : de- 
termined; and for each judge the obtained 
frequencies were tested against a 23:26 distri- 
bution (the number of favorable and un- 
favorable statements) by the use of chi square 
corrected for continuity. In ten cases the chi 
Square was significant at beyond the .10 | 
level of confidence. In eight of these ten cases 
the significance of chi square came from an 
inversion of unfavorable items, while in two 
cases the significance came from an inversion 
of favorable items. n 
Finally, a distribution of these 29 cases T 
a two-by-two table on the basis of bunehbi 
no bunching (as defined by Wicker's extrapo? 
tion of the Thurstone and Chave criterions 
on the one hand, and on the basis of a sig?! 
cant number of inversions or lack of a 9 uU 
cant number of inversions (using the p vali 


described above and with a p of less than .10 
being considered as significant), on the other, 
yielded a. chi square corrected for continuity 
of .9 and a phi coefficient of .18. Neither of 
"these values is significant. 


Discussion 


From the data at hand it is not possible to 
determine whether the smaller number of 
irregularities in the five-interval data is a 
function of the number of intervals or of other 
factors. In this connection, however, it is 
interesting to note that a few of the judges 
complained that 11 intervals were too many. 

Since the data reported here are only 
descriptive in nature, they do not permit a 
conclusive determination of the causes of 
these different irregularities. While bunching 
of items (too many in one interval) and failure 
to use some intervals have been observed in 
our data, there is no evidence to explain the 
causes of these irregularities. 

To find an explanation for double markings 
and omissions, the writer examined all sets of 
Tatings containing these irregularities. This 
. examination showed that omissions were 
randomly scattered among the items and that 
_ In practically every case where double marking 
Occurred, one rating fell on the opposite side 
of the neutral interval from the other—an 
Observation which suggests that the judges 
tecorded a judgment incorrectly and then 
! failed to erase it before making a second 
tating. These findings suggest that these ir- 
regularities may well be a result of carelessness. 
_ The decision to use inversions as a type of 
Mregularity resulted from the writer's experi- 
ence with judges who indicated that they were 
- &xpressing agreement or disagreement with an 

item rather than making a judgment of it. 

‘his type of irregularity seems not to have been 
discussed by other investigators. The data 

Show tendencies for the same items to be in- 
| Yerted with the same relative frequency by the 
i 


Judges of Thurstone and Chave, of Wicker, 
and of the writer. These inverted items tend 
to be predominantly unfavorable statements. 
Since 41.5 per cent of all sortings falling in 
Categories 3 and 8 had only one inversion, it 
I5 possible that many of these inversions may 
be attributed to carelessness in recording 
. Judgments. But since the judgments of certain 
items are inverted more frequently and with 


IRREGULARITIES IN JUDGMENT 


417 


some degree of consistency by different judging 
groups than are the ratings of some other 
items, it does not appear that carelessness 
provides an adequate explanation for all of 
them. 

Nor does the occurrence of inversions appear 
to be a function solely of the items. A detailed 
analysis of the sortings of 10 judges having a 
significantly large number of inversions showed 
that in no case were all favorable (or unfavor- 
able) items inverted by any one judge, nor 
were the judgments of any one item inverted 
by all judges. In view of these observations it 
would appear that the inversion process is not 
a systematic process but may be attributed to 
an interaction of a particular judge and a 
particular item. 

In view of his experience with judges whose 
pattern of judgments for the first three items 
did not follow the expected pattern, the writer 
proposes as an explanation of inversions the 
hypothesis that the judge’s ego involvement 
with certain items becomes so strong that it _ 
disrupts his set for expressing an unbiased 
judgment and intensifies a set for expressing 
an agreement or disagreement. Under the 
influence of this disruption the judge places 
his rating at some position other than the one 
he would place it in if he were judging with 
an unbiased attitude. If it is assumed that he is 
judging extreme statements, this displacement 
would not be noticed as irregular if he displaces 
his judgment one or two intervals; only when 
he displaces the statement to such an extent 
that it falls on the unexpected side of the neu- 
tral interval will it be noticed as an inversion 
as defined here. 

While the operation of an ego involvement 
in this manner may be hard to demonstrate, 
the possibility of its existence seems consistent 
with the fact that it is relatively easy to get 
students to adopt a set for expressing agree- 
ment or disagreement with items; but it is 
difficult to get them to adopt a set for express- 
ing a judgment, since they prefer to express 
an agreement or disagreement with the items 
instead. 

REFERENCES 
1. Epwanns, A. L., & Kilpatrick, F. P. A technique 
for the construction of attitude scales. J. appl. 


Psychol., 1948, 82, 374-384. 
2. Hincxtey, E. D. The influence of individual 


418 


opinion on construction of an attitude scale. 
J. soc. Psychol., 1932, 8, 283-296. 

3, Hovrawp, C. I., & Suerrr, M. Judgmental phe- 
nomena and scales of attitude measurement: 
item displacement in Thurstone Scales. J. ab- 
norm. soc. Psychol., 1952, 47, 822-832. 

4. Kettey, I. B. The construction and evaluation 
of a scale to measure attitude toward any institu- 
tion. Bull. Purdue Univer., 1934, 35, No. 4, 18-36. 

5. KrrsaTRICE, F, P. A technique for the measure- 
ment of attitudes. Unpublished master's thesis, 
Univer. of Washington, 1947. 

6. Miter, H. E. The construction and evaluation 
Of a scale of attitudes towards occupations. 
Bull. Purdue Univer., 1934, 85, No. 4, 68-76. 

7. SEASHORE, R. H., & Hevner, K. A time saving 
device for the construction of attitude scales, 
J. soc. Psychol., 1933, 4, 366-372. 


Sam C. WEBB 


8. Suerrr, M., & Hovraxp, C. I. J 
nomena and scales of attitude measurement; 
placement of items with individual choice of 
number of categories. J. abnorm, soc. Psychol, 
1953, 48, 135-141, 

9. Smance, E. B., & Remmers, H. H. An experi- 
mental generalized master scale: a Scale to 
measure attitude toward any subject. Bull, 
Purdue Univer., 1934, 35, No. 4, 84-87. 

10. TmunsrONE, L. L., & Cnavz, E. J. The measure 
ment of attitude. Chicago: Univer. of Chicago 
Press, 1929. 

11. Wicker, M. P. A comparison of attitude scale 
values yielded by scales of differing lengths, 
Unpublished master's thesis, Univer. of North 
Carolina, 1950, 


Received May 7, 1953. 


PERCEPTUAL RIGIDITY AS MEASURED BY ANISEIKONIC LENSES: 


WESLEY C. BECKER 
Stanford University 


taken a prominent place in the current 
psychological literature Cattell and 
Tiner (6) and Scheier and Ferguson (11) have 
approached the problem by use of factor 
analysis. Fisher (8) and others have used a 
more clinically oriented approach. Though 
numerous measures of rigidity have been 
offered, several writers have emphasized the 
fact that better measures are needed before 
research in this area can advance. This paper 
explores the utility of a novel perceptual 
device, aniseikonic lenses, which seems to 
avoid pitfalls of other measures of rigidity. 
Aniseikonic lenses were initially designed to 
study distortion in stereoscopic perception. 
They magnify the image of one eye in the 45- 
degree meridian and of the other in the 135- 
degree meridian. The induced effect of the 
lenses is to make a table appear to tip up like 
a drawing board or to make a wall lean toward 
the observer. Preliminary observations by the 


T* study of personality rigidity has 


. Writer indicated that people varied widely in 


the time it took to see the effect and in the 
degree to which distortion? was perceived. 
Some older people could not see any distortion 
àt all. Several six-year-olds reported seeing the 
effect almost immediately. Individuals in the 
Intermediate age group varied greatly, more 
or less independently of age. These and other 
observations Suggested that the aniseikonic 
Phenomenon might be related to personality 
hgidity when the concept is defined, as in 
Cattell (6), as the ease with which old estab- 
lished habits may be changed in the presence 
of new demands. 


th a Paper is condensed from a thesis submitted to 
s. Spartment of Psychology of Stanford University 
uel fulfillment of the requirements for the Master’s 
id The author wishes to thank Professors Joseph 
one Clarence L. Winder for advice and assistance 
ie out the study. He is indebted to Jean Humphrey 
and Wirt Wolff for their assistance in scoring 
peus of the data, 
the For general reviews of the literature on rigidity, 
SES is referred to Cattell (6) and Fisher (7). 
ix istortion as used here actually refers to percep- 
s aoe in line with the reality of the situation. But 
lien ls used because phenomenologically the in- 
im effect seems to be a distortion of reality as it is 
mmonly perceived. 


General Hypothesis 

On the basis of the preliminary observations 
and as a step towards clarification of the 
meaning of the observed individual differences, 
the following hypothesis was made: Individuals 
who take a longer time and/or report less dis- 
tortion while looking through the aniseikonic 
lenses will reveal more rigidity on personality 
measures than those who take a shorter time 
to see the effect and/or report a greater amount 
of distortion. 


Measures of Personality Rigidity 

Three somewhat different methods of assess- 
ing personality rigidity were used. These meth- 
ods, with a brief description where appropriate, 
are given below. In addition, the specific pre- 
dictions that follow from the hypothesis have 
been indicated. 

Ego-control rating. Jack Block (4) and Jean 
Block (5) have defined a genotypical per- 
sonality construct called ego control which is 
rated from a standardized interview. Ego 
control refers to “the individual's characteristic 
means of handling or mediating both his in- 
ternal need-tensions and the demands imposed 
upon him by the external world" (4, p. 6). In 
their operational descriptions of three reference 
points, a number of statements suggest that 
the overcontroller has difficulty adapting to 
change and restructuring a situation. The 
undercontroller easily shifts from one frame of 
reference to another, while the appropriate- 
controller's ability to shift with environmental 
change is intermediate. Some people showing 
both "over" and “under” tendencies do not fit 
into the continuum and have been called 
“bimodal” (5, pp. 29-35). 

It was predicted that a shorter time delay 
and/or greater perceived distortion with the 
aniseikonic lenses would relate to undercon- 
trolling tendencies, whereas a longer time 
delay and/or lesser perceived distortion would 
relate to overcontrolling tendencies. 

Rorschach. A number of predictions were 
made about Rorschach variables assumed to 
be related to rigidity-flexibility.* The predic- 
tions have been stated in one direction only to 
simplify exposition. 


AIO 


420 


It was predicted that a shorter time delay 
would relate (a) to a shorter average initial 
reaction time, (5) to the production of a greater 
R, (c) to a greater Dd%, (d) to the production 
of more S responses, (e) to a lower + %, (f) to 
the production of a greater variety of content, 
(g) to the use of a greater variety of determi- 
nants, and (4) to a higher Sum C. It was con- 
currently predicted that each of the above 
Rorschach variables would relate to the degree 
of perceived distortion in a direction opposite 
to that stated for the time delay. 

The Rorschach protocols were scored by 
Klopfer's method (10) with the exception of 
form level for which Beck's tables (1, 2) were 
used. All responses with a form element were 
included in the +%. Additional white space 
and color responses were given half weight. 
"The determinant variety score is the number of 
different determinants used in any way. A 
different determinant was defined in terms of 
the groupings under Total on the Rorschach 
Individual Record Blank (9). 

Fisher's rigidity scale. Fisher (8) devised a 
system of weights which are given to various 
extreme Rorschach scores, the sum of which is 
assumed to reflect a “general personality 
rigidity.” It was predicted that a shorter time 
delay and/or a greater estimate of distortion 
would relate to a lower score on the scale, Since 
Fisher's weightings were based on Beck's 
scoring, minor adaptations of the scale were 
necessary in using it with Klopfer's system. 
This scale does not add much novelty, because 
of its overlap with previous predictions. 


Mernop 


Procedure, Each subject (S) was individually given 
the standardized interview designed by the Blocks (4, 
5) from which the rating of ego control was made. 
^ After the rating was completed S was instructed as 

Stand here and look at this desk. I'm going 
to bave you put these glasses on in a second, 
and as soon as you sec the desk start to tip, you . 
tell me. I want you to look generally in this di- 
rection (center of the table) and try to take in 
the whole top in one view. Don't move your 
eyes around freely, but you may have to do 
some shifting of your from front to back 

movement to a mini- 
mum, however, and try to keep a general fixed 


‘The reader concerned with the rationale for these 
predictions is referred to the original work (3). 


Westey C. BECKER 


focus which takes in the whole top of the desk, 

to me as soon as you can tell which way 
ithastipped. Say “now” orsomething like that. 
I'll keep one eye covered till I start my watch 
After the first trial you'll know pretty well 
what you're looking for. 

After S had reported the distortion, he was given 
time for the effect to stabilize and then asked, "how 
much higher would you say the back is than the front 
in inches?" The report was recorded. This procedure 
with the aniseikonic lenses was repeated for seven 
trials. One minute was allowed to elapse with the 
lenses off between trials. 

Since a preliminary study indicated that learning 
effects were minimal after the first two trials, an aver: 
age of the last four out of seven trials was used as the 
measure of the time delay. The report given on the 
first trial was used for the estimate of distortion score, 
since this seemed to represent S’s maximum. 

In this part of the experiment, S stood three feet 
from the front of a 30-inch-high desk. A two-by-four 
foot plywood top was used to add texture cues for bet- 
ter depth judgment. On the desk were a flexible neck 
lamp, a tennis ball, and an oil can about six inches high. 

An individual Rorschach was then administered 
using the procedure set forth by Klopfer and Kelley (10). 

Subjects. The Ss were 10 women and 22 men from 
an elementary psychology class at Stanford Univer | 
sity. Their ages ranged from 18 to 48 with a mean of 
24.5 and an SD of 7.37. Eleven Ss who wore glasset 
underneath the experimental lenses during the et- 
periment had a slightly longer average reaction tims, 
but not significantly so. The total N was 32 for all 


relationships except ego control, where the elimination 
of 5 bimodals left an N of 27. 
RESULTS 


Reliability and consistency. For reliability 
purposes, a second ego-control rating was made. 
from the interview protocols by an e: 
clinician familiar with the ego-control com- 
struct. On a six-point scale, there was agree 
ment within one point in 69 per cent of the 
cases and a deviation of more than one in 3l 
per cent of the cases. Discrepancies in ral 
were eliminated by a discussion session. 

With an analysis of variance technique W 
estimate the error variance, the reliabilily 
coefficient of the averaged time delay score was 
found to be .95. McNemar* has shown 
procedure used will lead to values equ! 
to those obtained by the Spearman-Brow® 
formula. 

* The effect of the lenses does not occur with mote 
ular vision. This allows Æ to control precisely the fiet 
of starting by covering one eye until both 5 and 


ready. - 
* McNemar, Q. Personal communication. 


An estimate of the degree of individual con- 
sistency of the time delay score in different 
situations was made in the preliminary study. 
our different situations varying in dimension 
d familiarity of structure were used. By 
termining the reliability of an individual's 
ean score based on the four situations, the 
ee of individual consistency can be esti- 
. Such an estimate of reliability gives a 
ficient of .83. This is based on an N of 15 
nd is significant beyond the .001 level. 
Evidence that the measure is fairly stable 
r time was obtained on nine Ss who were 
lested from three to four weeks later. The 
-difference correlation for the time delay 
cores was .70. This is a conservative estimate 
f consistency since the conditions of measure- 
nt differed considerably from the first to 
the second testing. 
Relations to ego control. The predictions for 
the time delay and the degree of distor- 
Hon as related to ego control are confirmed. 
product-moment correlation between the 
delay and ego-control ratings is .58, sig- 
cant at the .001 level for an N of 27. The 
product-moment correlation between the 
timated distortion and ego control is —.58, 
nificant at the .001 level. The fact that the 
Mime delay and distortion scores intercorrelate 
it ~.59 should be kept in mind in interpreting 
these results. 

Rorschach predictions. Tetrachoric correla- 
on coefficients’ and the chi squares for the 18 
ictions to Rorschach variables are pre- 
ed in Table 1. The tetrachoric coefficients 
uld be interpreted with caution for this 
mall an V. They are presented merely to give 
1 idea of the relative magnitude of the rela- 

nships. A shorter time delay with the anisei- 
lenses is significantly related to (a) a 
ter Dd%, (b) a greater variety of content, 
M à greater use of S, (d) a greater variety of 

terminants, (e) a higher Sum C, and (f) a 
er score on Fisher's rigidity scale. Initial 

on time, R, and + % failed to reach the 
level of significance, although the results 

in the expected direction. The degree of 
'ortion was significantly related to R and to 
"All chi squares are based on two-by-two tables. 
bks were cut as close to the median as pos- 


tetrachoric correlations were computed by 
of Thurstone’s diagrams (12). 


PERCEPTUAL RIGIDITY AS MEASURED BY ANISEIKONIC LENSES 


421 


TABLE 1 


Tae RxLATIONS Or Tie DELAY AND DISTORTION 
TO THE RORSCHACH VARIABLES 


RomRscHACH Tue DELAY DISTORTION 
Y M Bd Ld | rt | x | 9° | ot 
Reaction time | 2.00 | .08 | .37| 1.17 | .15 |~.29 
R .50 | .25 |-.18| 3.24 | .04 | .48 
Dì% 6.15 | .01 |—.63| 4.40 | .02 | .53 
No. S 3.14 | .04 |—.46| 1.89 | .10 | .36 
+% .80 | .25 | .18| 1.17 | .15 |—.29 
Sum C 8.00 | .001,—.71| 6.35 | .01 | .66 
Variety of 6.15 | .01 |—.63| 7.94 | .01 | .70 
content 

Variety of de- | 4.20 | .02 |—.54| 5.87 | .01 60 
terminants 

Fisher's scale | 3.14 | .04 | .46| 4.40 | .02 |-.53 
* One-tailed 
1 Tetrachoric coefficient, 


those*same variables which were significantly 
related to the time delay with the exception 
of S. 


SUMMARY AND CONCLUSIONS 


On the basis of current theory as to the 


nature of personality rigidity, it was postulated 
rigidity would 


R 
E 

A 

3E 

§ 
$. 
H 

a 
s 


f 
: 


422 


are meaningfully related to other measures of 
perceptual and personality rigidity is supported 
by the data. 


REFERENCES 


1. BECK, S. J. Introduction to the Rorschach method: 
a manual of personality study. Amer. Ortho- 
psychiat. Ass. Monogr., 1937, No. 1. 

2. BECK, S. J. Rorschach’s test. Vol. I. Basic proc- 
esses. New York: Grune & Stratton, 1944, 

3. Becker, W. C. The use of the aniseikonic glasses 
as a measure of perceptual rigidity. Unpublished 
master’s thesis, Stanford Univer., 1953. 

4. Brock, J. An experimental investigation of the 
construct of ego-control. Unpublished doctor’s 
dissertation, Stanford Univer., 1950. 

5. Brock, Jean H. An experimental study of a topo- 
logical representation of ego-structure. Unpub- 
lished doctor’s dissertation, Stanford Univer., 
1951. 


WESLEY C. BECKER 


6. CATTELL, R. B., & TiNER, L. Gnosr. The varieties 
of structural rigidity. J. Pers., 1949, 17, 321-341, 

7. FISHER, S. An overview of trends in research deal- 

j ing with personality rigidity. J. Pers., 1949, 17, 
342-351. 

8. FrsuER, S. Patterns of personality rigidity and 
some of their determinants. Psychol. Monogr., 
1950, 64, No. 1 (Whole No. 307). 

9. Krorrzn, B., & Davinson, HELEN. The Rorschach 
method of personality diagnosis, individual record 
blank. Yonkers, N. Y.: World Book Co., 1942. 

10. KrorrEeg, B. & Krrrev, D. M. The Rorschach 
technique. New York: World Book Co., 1946, 

11. Scnerer, I. H., & Fercuson, G. A. Further fac- 
torial studies of tests of rigidity. Canad. J. 
Psychol., 1952, 6, 18-30. 

12. THURSTONE, L. L., CHESIRE, Leona, & Sarrm, 
M. Computing diagrams for the telrachoric cor- 
relation coefficient. Chicago: Univer. of Chicago 
Bookstore, 1933. 


Received May 7, 1953. 


Se a a EE ec a SOR o MC a aD 


VALUES AS DETERMINANTS OF WORD- 
RECOGNITION THRESHOLDS: 


J. C. GILCHRIST, J. F. LUDEMAN, AN» WILLIAM LYSAK 


University of 


HE issues of motivated perception such 
as value (8), perceptual defense, and 


| subception (2, 5, 6) are still unclear and 
the subject of controversy (1, 7). In an attempt 
to clarify some of the factors involved, the 
present study was undertaken with the expec- 
tation (a) that value, both positive and nega- 
tive, lowers word-recognition thresholds; (b) 
that the effects of value upon such thresholds 
cannot be explained on the basis of frequency 
of word usage; (c) that negative value should 
Dot be assigned equivalent status with emo- 
tionality; and (d) that the latter raises word- 
recognition thresholds. 


APPARATUS AND PROCEDURE 


‘The major piece of apparatus was an overhead ta- 
chistoscopic slide projector. The lamp of this projector 
Was connected with a variable transformer capable of 
delivering from 0 to 135 volts. A voltmeter giving volt- 
age readings accurate to within + 1.0% was included 
in the lamp circuit. The projector was equipped with a 
shutter which made flash presentations of projected 
words possible. Flashes were constant in duration (01 
sec.) throughout the experiment. The word image ap- 
Peared upon a beaded projection screen which had a 
black 34” x 14” fixation mark affixed directly below 
the site at which the center of the projected word ap- 
peared. The screen was located 12 ft. in front of the 
Projector. The words used in the experiment were typed 
in capital letters on slide mats. Each of the words ap- 
Peared on two slides; once with the word ink typed one 
Space above and below it, and once with Jew 
above and below it. These will hereinafter be referred to 
ni Surrounds.” The positively valued words were: able, 

ave, honest, and loyal. The neutral words were: chair, 
p orange, and salt. The negative words were: cheap, 

‘rly, loud, and vulgar. The positive words were selected 
L Tepresentative of the values deemed positive by 
us and Lynd (4). The negative words were chosen 
to € same fashion except that they were alsd screened 

Teflect the negative stereotype of the Jew in American 
TuS According to the Lorge-Thorndike word counts 

» the positive words appear in the first, second, 
ng and fourth thousand most frequently used 
third respectively; the neutral words in the first, 
i Second, and first, respectively; the negative words 

* second, third, second, and sixth, respectively. 
7, _ set of value words contains 1 four-, 2 five-, and 1 
Sixetter word, 


CEU TUA N EE EAM Se er no d es 
ie Study was supported, in part, by funds sup- 
by The Research Committee, The Graduate 

ol, University of Wisconsin. 


423 


Wisconsin 


The Ss were 30 University of Wisconsin students en- 
rolled in a first course in psychology. They composed 
two groups of 15 each which were obtained from the up- 
per and lower quarters, respectively, of a distribution of 
235 scores on the Levinson-Sanford (3) scale of anti- 
Semitism. Possible scores on this scale range from 52 to 
364. The group of low-prejudice Ss had a range from 52 
to 98; the high-prejudice group, a range from 199 to 
250. None of the Ss were Jews. The Ss were not told the 
reason for their selection, nor their scores on the scale. 
Despite this, the ratio of high-prejudice to low-prejudice 
volunteers was much less than one, indicating the op- 
eration of some other selective factor in that group. 
The low-prejudice group was made equivalent to the 
high-prejudice group with respect to number and date 
of appearance for the experimental session. 

Each S was brought into the laboratory and seated 
in a chair facing the screen which was situated directly 
in front of the projector. He was given the following in- 
structions: 

This is an experiment in perception. I am 

going to flash on the screen there (E points) 
groups of three words for just a fraction of a 
second. The words will be arranged like this. 
(E shows S card on which is typed the word 
please with cat above and below it.) Your job 
will be to identify the middle word which will 
appear just above that little black mark. Be- 
foreeach trial, I'll say “ready” and then one sec- 
ond later the flash will appear. You probably 
won't be able to see very much for the first 
few trials but soon the words will become 
clearer. As soon as you can make them out, 
tell me what they are. It's all right to guess. 
Tell me what the word is or what any part of 
it might be as soon as you can. Are there any 
questions? O.K. Let's try a few practice words 
before we begin the experimental words. 

Three practice words, hand, solve, and people—with 
bit, and, and nod as their respective surrounds—were 
used to obtain an initial mean threshold for each S. 
This was done in the following manner. Voltage across 
the lamp filament was set at 50 volts and the word was 
exposed once. It was then exposed once at each 2-volt 
step above 50 volts until the S gave two successive 
correct reports of the surrounded word. Threshold was 
considered to be the voltage setting at which the first 
of the two successive correct reports was given. The 
mean threshold was determined for the three practice 
words, and 20 volts were subtracted from it. This 
value was the starting point for each subsequent thresh- 
old determination on the experimental words, each of 
which was given each S once in each surround. Each 
SS, therefore, appeared in each value and surround cell 
of the experimental design, with prejudice groups being 
independent. The threshold determinations with ex- 
perimental words, except for the starting point, were 


424 
TABLE 1 
SUMMARY OF ANALYSIS OF VARIANCE 
MEAN 

Source OF VARIATION df Square F p 
Value 2 131.15 9.44 <.001 
Prejudice TON 12.830. —* 
Surround 1 28.40 2.04 
Ss within prejudice 28 356.49 25.65 <.001 

groups 
Prejudice X value 2712.15 3 
Prejudice X surround 1 18.38 1.32 
Value X surround 2 53.57 3.85 <.05 
Value X Ss within 56 25.02 1.80 
Surround X Ss within 28 11.84  — 
Prejudice X value X 2 7.64. — 

surround 
Residual 56 13.90 


* Tested by Ss within prejudice groups. 


the same as that described above. The experimental 
words were presented in a different random order for 
each S. All responses made by the S to each exposure 
were recorded. At the conclusion of the experiment, 
each S was asked to recall the words seen, and these 
responses were also recorded. This completed the ex- 
perimental session. 


RrsurLTS* 


Since, in the range of values between the 
lowest starting point of any S and the highest 
single word threshold of any S, the relationship 
between lamp voltage and illuminance was 
essentially rectilinear, all data are presented in 
terms of lamp voltage. Inasmuch as no signifi- 
cant difference was found between the pre- 
liminary thresholds of the high-prejudice and 
low-prejudice groups, an analysis of variance 
was performed on the data, the results of which 
are presented in Table 1. Individual word 
thresholds were not used in the analysis. The 
data used were the Ss’ mean thresholds for all 
four words in a value set in each surround. 
Plots of the data for the significant variables 
as indicated by the analysis of variance are 
presented in Fig, 1 and 2. The ¢ values for the 
various points in the two surrounds are all 
significant at better than the 5% level of confi- 


? The data reported herein comprise a duplication of 
a previously unreported experiment, the data from 
which were discarded when a possible change in pro- 
jector characteristics was noted at the conclusion of 
the experiment. The main effects and interactions for 
which significant differences were obtained are pre- 
cisely the same in this experiment as in the previous one, 
although the absolute values differ as a result of the 
changed apparatus. 


J. C. Gircgmisr, J. F. LUDEMAN, AND WILLIAM Lysak 


dence: that for the positive mean thresholds 
yields a  < .02; that for the neutral mean 
thresholds gives a p < .05; and that for the 
negative mean thresholds, a p < .01. 

Tabulation of prerecognition responses 
showed no significant differences between 
groups or between value categories of words 
when classified in terms of the categories 
co-valuant, contra-valuant, structural, non- 
sense, and unrelated used by Postman, Bruner, 
and McGinnies (8). Nor were there any sig- 
nificant differences found between groups and 
value categories for recall. 


9 ^ 


91 


89) 


Lamp Voltage 


87 


85 
- o + 


Word Value 


Fic. 1. MEAN THRESHOLDS IN Lamp VOLTAGE OF 

NEGATIVE, NEUTRAL, AND PosrrivE VALUE WORDS 

COMBINED FoR Borm SURROUNDS AND PREJUDICE 
Groups 


Lamp Voltage 


w o + 


Word Value 


oF 
Fic. 2. MEAN THRESHOLDS IN LAMP Von 
NEGATIVE, NEUTRAL, AND PosrrivE VALUE 
COMBINED FOR PREJUDICE GROUPS 


VALUES AS DETERMINANTS OF WoRD-RECOGNITION THRESHOLDS 


Discussion 


It is apparent from Fig. 1 that both posi- 
tively and negatively valued words have lower 
recognition thresholds than neutrally valued 
words. That word frequency is an inadequate 
explanation of this result is shown by the 
fact that the neutral words have a higher mean 
frequency than do either the positive or nega- 
tive words. Further, such a finding indicates 
that previous investigators have perhaps dealt 
with only one portion of the curve, or with 
another aspect of the problem, when they have 
implied that only positive values lower word- 
recognition thresholds. McGinnies, in fact, has 
explicitly stated this latter position: “When 
..- individuals are faced with stimulus objects 
which are not actually threatening, but which 
represent for them areas of little interest or 
Some antipathy, they also generally display 
taised thresholds of recognition” (6, p. 244). 
Our evidence would seem to negate such a 
Statement if by “some antipathy” is meant 
negative value. 

Briefly, we believe that the difference is due 
to the kinds of negatively valued words used 
in the several experiments. Words such as 
whore and bitch are not just carriers of negative 
value: saying them in an uncertain social 
Setting (e.g., S with an E who, at least, is not 
too well known by S) may result in tacit dis- 
approval of the speaker or even overt expres- 
Sions of condemnation. This position is, of 
Course, that of Howes and Solomon (1) and 
implies that the phenomenon observed by 

CGinnies is one of response suppression 
Tather than one of perceptual defense. 

Tt Js, however, difficult to maintain this 
position in the face of the data shown in Fig. 2. 

uming, not unreasonably (10), that our Ss 
all Sàw the surrounds in each condition, it is 
Possible to argue that the effect of the emo- 
tionally loaded Jew surround actually produced 
Sensitization, as evidenced by the lowered 
neutral threshold. However, in conjunction 
With either value category, it produced re- 

* The finding of low thresholds for negatively valued 
iue may also explain Lazarus and McCleary's (2) 
Cata wherein they get high GSR's to shock syllables in 
ORE (?) trials but no differences in thresh- 
ii iud that their comparisons were between syl- 

rrectly chosen by S would make no difference 


a is argument when their techniques of forced choice 
d threat of shock are taken into account. At this 
Point, however, we hesitate to make this claim cate- 
gorically. 


425 


sponse suppression as indicated by a reluctance 
to make the connection of either brave Jew or 
dirty Jew until certainty was reached. But why 
suppression of brave—at least by the low- 
prejudice Ss? 

Nevertheless, it is true that only four of the 
30 Ss ever reported seeing, during the experi- 
ment or in postexperimental interrogations, 
either surround. And in this situation there 
seems less reason to suspect withholding by Ss 
than in McGinnies' case. Even so it is equally, 
if not more, difficult to maintain the perceptual 
defense position, for it would appear that from 
this position one would have to predict raised 
thresholds for the neutral category in the Jew 
surround. Further, it seems that one would 
have to predict, as well, an interaction between 
prejudice, value, and surround, with higher 
thresholds in the positive category and lower 
thresholds in the negative category in the Jew 
surround for high-prejudice Ss, and with the 
reverse occurring for the low-prejudice Ss. No 
suggestion of this trend can be found in the 
data. In its absence, the results of the experi- 
ment remain ambiguous on the question of the 
effects of emotionality upon word-recognition 
thresholds. Nevertheless, it is believed that the 
finding of raised thresholds for the positive 
category poses problems for proponents of 
both "response suppression" and "perceptual 
defense." With the present evidence, we see no 
immediately clear solution. But in the first 
flush of reaction against perceptual defense we 
feel that caution should prevail lest “the baby 
go the way of the bathwater.” There is sub- 
stantial evidence to the effect that attitudes 
and value systems selectively change a wide 
variety of behaviors. This evidence shows that 
when stimulus or mnemonic material contains 
values counter to those of S, the effect is 
usually to remove them from S’s field. Such 
evidence indirectly makes some form of a 
perceptual defense hypothesis plausible. On 
the other hand, the evidence of this study does 
seem to indicate that if we want to find out 
what these effects are in perception, we will 
have to become more subtle and stop intimi- 
dating naive college student Ss with words 
like whore and bitch. 


SUMMARY 


Some effects of word value and emotional 
context upon word-recognition thresholds have 


426 J. €. Gircumisr, J. F. LuDEMAN, AND WILLIAM LvsAK 


been investigated using high- and low-prejudice 
Ss to define the meaning of the context values. 

In general, it was found that (a) both posi- 
tive and negative values lowered word-recogni- 
tion thresholds in comparison with neutral 
value, and (b) emotionally loaded contexts have 
the effect of raising the thresholds of both 
positively and negatively valued words while 
"lowering that of neutral words. 

It was pointed out that these data pose 
problems for both the concepts of “response 
suppression" and "perceptual defense." 


REFERENCES 


1. Howes, D. H., & SorowoN, R. L. A note on Mc- 
Ginnies’ “Emotionality and perceptual defense.” 
Psychol. Rev., 1950, 67, 229-234. 

2. Lazarus, R. S., & McCreary, R. A. Autonomic 
discrimination without awareness: a study of 
subception. Psychol. Rev., 1951, 68, 113-122. 

3. Levinson, D. J., & Sanrorp, R. N. A scale for 


the measurement of anti-Semitism. J. Psychol., 
1944, 17, 339-370. 


4. Lynn, R. S.,-& Lynp, HELEN M. Middletown in 


transition: a study in cultural conflicts. New York: 
Harcourt, Brace, 1937. 


. McCreary, R. A., & Lazarus, R. S. Autonomic 


discrimination without awareness: an interim 
report. J. Pers., 1949, 18, 171-179. 


6. McGrnnigs, E. Emotionality and perceptual de- 


8. 


10. 


fense. Psychol. Rev., 1949, 56, 244-251. 


- McGiNNiEs, E. Discussion of Howes and Solomon’s 


note on “Emotionality and perceptual defense.” 
Psychol. Rev., 1950, 57, 235-240. 

Postman, L., BRUNER, J, & McGiNNrxs, E. Per- 
sonal values as selective factors in perception. 
J. abnorm. soc. Psychol., 1948, 43, 142-154. 


. THORNDIKE, E. L., & Loree, I. The teachers’ word 


book of 30,000 words. New York: Teachers Coll., 
Columbia Univer., 1944. 

WHITAKER, E. M., Giicurist, J. C., & FISHER, 
J. W. Perceptual defense or response suppres- 
sion? J. abnorm. soc. Psychol., 1952, 47, 732-733 


Received May 14, 1953. 


WORD FREQUENCY AND THE MEASUREMENT OF VALUE AREAS: 


DONALD R. BROWN anp JOE ADAMS 
Bryn Mawr College 


HE measurement of value areas by such 
| tests as the Allport-Vernon Scale of 
Values (2) and the recent revision by 
Lindzey (3) has been characterized by Solomon 
and Howes (7) as a function of word frequency. 
They reach this conclusion as a result of their 
critical analysis of the earlier work of Postman, 
Bruner, and McGinnies (6) on the relationship 
between value and perceptual threshold. We 
have already treated the Solomon and Howes 
frequency hypothesis in a theoretical manner 
(1). The present report is the result of an 
attempt to test our contention that such con- 
structs as “value” or “meaning” are useful 
theoretical constructs, admissible to opera- 
tional demonstration, and necessary to explain 
results obtained on such tests as the Allport- 
Vernon. We decided to test the Solomon- 
Howes word frequency hypothesis by maxi- 
mizing conditions in favor of this hypothesis, 
So that a clear-cut prediction on the basis of 
the word frequency hypothesis could be made. 
Our prediction, on the other hand, was that 
central cognitive mechanisms or structures 
would tend to overcome wide differences in 
the frequencies of the stimuli. 


METHOD 


Six forms of a value category test were constructed 
after the general pattern of the Allport-Vernon (2) test, 
using the six Spranger value categories. The test con- 
sisted of 40 multiple-choice items. Each item con- 
tained three one-word choices, and each choice repre- 
sented one of the six value categories. Since there are 
20 combinations of 6 forms taken 3 at a time, the 40 
items included each possible triple combination twice. 
Each of the 6 forms favored one of the value categories 
in terms of frequency; that is, each time that category 
appeared in the form as a choice, it was represented by 
a very high frequency word from the Thorndike-Lorge 
count (9), while all other value categories on that form 
were represented by very low frequency words. The 
high frequency words (of which there were 120 in all, 
20 for each value category) were chosen from the G or 
general count of Thorndike and Lorge and were words 
which in 57 cases appeared 100 times or more in a mil- 
lion words; in 47 cases appeared from 50 to 99 times 


| Part of this paper was delivered at the Eastern 
Psychological Association meetings in April 1953, at 
Boston, Massachusetts. 


in a million words; and in 16 cases appeared from 20 
to 43 times. The infrequent words appeared 10 times 
or less in a million words, with the great majority of 
them having fewer than 5 appearances. Occasionally, 
it was necessary to go into lists other than the G count 
to get the right word, but always the most infrequent 
word that could be found that connoted the proper 
value category was chosen. 

The following item is typical of the test, only one of 
the two words (high and low frequency) for each of the 
three value areas actually appearing on a given form: 

1, Scientific research should seek 
(Theoretical) Truth. (AA)? Verity. (2) 


(Social) Social gain. | Humanitarianism. 
(A-AA) (2) 
(Economic) Profit. (A) Remuneration. (1) 


The subjects (Ss) were 121 women students in the 
required history of philosophy course of a liberal arts 
college. They were divided into six groups of approxi- ' 
mately 20 each and were given the Allport-Vernon- 
Lindzey Scale of Values (3),3 followed immediately by 
one of the six forms of our test. 


RESULTS 


Table 1 shows the mean score and the stand- 
ard deviation (in parentheses) for each of the 
six groups of Ss for the six value categories. The 
number of Ss in each-group is indicated in 
parentheses beside the letter corresponding to 
the form which that group took. Each form is 
named by the letter of the category favored 
(in word frequency) by that form. ; 

According to the word frequency hypothesis, 
the diagonal means should be larger than other 
means in the same column. Table 2 shows the 
mean of the group taking the favored form, the 
mean of the other five groups, and a ¢ test of 
the difference between the two means for each 
value category. The p value is that for a one- 
tailed instead of a two-tailed test, as the 


direction of the difference is predicted by the 


2 The following is a key to the symbols appearing in 
parentheses after each word. These symbols did not, of 
course, appear on the test. Ha 

AA—100 or over occurrences per million words. 

A—50 or over occurrences per million words. 

The numerals refer to the actual number of occur- 

nces million words. : 
As Since the publication of the Solomon-Howes article, 
the Lindzey revision of the Allport-Vernon was pub- 
lished. The changes between the old and new forms are 
not relevant for our purposes and, therefore, we used 
the Allport-Vernon-Lindzey revision in our experiment 
because it was the only form of this test available. 


427 


428 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF SCORES OF 
Six DIFFERENT GROUPS ON THE SIX VALUE 


CATEGORIES 
VALUE CATEGORY 
We Ars- Eco- | Porrr- | REL- Socar | TEEO- 
THETIC | NOMIC | ICAL GIOUS RETICAL 
AQ | 1.05 | 400 | 610 | 5.67 | 8.24 5.43 
BD G2 | (3.1) | (4.1) | 3.9) | (2.5) 
E(22) | 8.73 | 3.18 | 7.18 | 5.00 | 8.18 | 6.91 
(2.8) | (2.2) | (3.2) | (4.0) | (2.8) | (3.2) 
P(20) | 9.05 | 3.15 | 5.65 | 6.65 | 7.90 | 7.20 
(3.0) | (2.2) | (2.2) | (5.3) | (3.2) | (2.1) 
R19) | 7.42 | 3.37 | 7.00 | 8.05 | 7.95 | 4.95 
(3.8) | (2.2) | (3.7) | (4.2) | (2.7) | (3.3) 
$(20) 8.80 | 3.25 | 6.15 | 5.55 | 9.25 6.45 
(6.0 | (2.5) | (2.1) | (4.1) | G.2 | (2.7) 
TQ) | 7.95 | 3.05 | 624 | 3.62 | 9.24 9.67 
(3.7) | (2.4) | (2.0) | G.S) | G.9 |. G.D 
TABLE 2 


DIFFERENCES BETWEEN MEANS ON FavoRED FORM 
AND OTHER Five (NoNFAVORED) FORMS FOR 
Eacu VALUE CATEGORY 


vem ies [s], e 
CATEGORY |FAvORED| Faa tailed) 
Aesthetic 10.05 | 8.40 1.65| 1.83| <.07 
Economic 3.18 | 3.37 |—0.19|—0.35 
Political 5.65 | 6.53 |—0.88|—1.49| 
Religious 8.05 | 5.28 | 2.77| 2.59| <.01 
Social 9.25 | 8.31 .94| 1.18 
"Theoretical 9.67 | 6.21 3.46| 4.68| <.01 


word frequency hypothesis. The value of t is in 
each case computed so as to eliminate differ- 
ences between the means of nonfavored forms 
as a contribution to the estimate of sampling 
error; this means that we.are not considering 
the five nonfavored forms as a random sample 
_ from a universe of nonfavored forms. If this 
correction had not been made, each ¢ would be 
smaller than it is as actually computed. 

By computing ż in the way described and by 
making a one-tailed test we have given the 
word frequency hypothesis every chance of 
being supported. As can be seen from the table, 
however, scores on only two categories, re- 
ligious and theoretical, are significantly higher. 
On two of the categories, economic and po- 
litical, the obtained differences are not even in 
the predicted direction, Our results, therefore, 
certainly do not support the hypothesis that 
Scores on the Allport-Vernon test or even a 


modified version of it are a function solely of 


Donatp R. BROWN AND Jor ADAMS 


word frequency. We would argue further that 
our results do not support even the less extreme 
contention that word frequency per se is a 
variable that enters into the determination of 
the value score. We do not, of course, deny 
that the frequency with which an individual 
uses or perceives a word is correlated with his 
degree of interest in the area related to the 
word; in fact, interest may well be the de- 
terminer of frequency. 

On the other hand, since two of the six 
differences are clearly statistically significant 
and a third one approaches significance, we 
feel obligated to account for these differences 
in some way that will not imply that all six 
differences should have been significant. Since 
our division of the 121 Ss into six groups was 
not entirely random, some sections systemati- 
cally getting more of a given form than others, 
and since we had Allport-Vernon-Lindzey test 
scores for all our Ss, we made an analysis of 
covariance for each of the six value areas, 
eliminating the differences between forms 
attributable to regression of scores on our test 
on the Allport-Vernon-Lindzey scores. Follow- 
ing the procedure given by McNemar (5) and 
Edwards (4), we obtained the results presented 
in Table 3. As can be seen from this table, dif- 
ferences among forms on aesthetic, religious, 
and theoretical are statistically significant at 
the .05 level or better. 

We can believe that the reason for these 
differences is the fact that a change in word 
frequency often produces a change in meaning. 
The word “poesy,” for example, in addition to 
being less frequent than the word “literature” 
has an obviously different meaning. In some 


TABLE 3 


RESULTS OF THE ANALYSES OF VARIANCE AND 
COVARIANCE FOR THE SIX CATEGORIES 


F (AnaLysis F (ANALYSIS 

y OF oF 
CATEGORY COVARIANCE) p VARIANCE) P 
Aesthetic 3.44 «.01 1.40 
Economic .80 .45 
Political .94 .85 
Religious 2.73  «.05 2.39  «.05 
Social 1.51 4 -11 
Theoretical 6.59 <.001 6.60 <.001 


Note.—Each F is the ratio of the variance estimate based n 
variation of form means to the variance estimate based on varia- 
tion within groups. The degrees of freedom are 5 (numerator) and 
117 (denominator). 


Worp FREQUENCY AND THE MEASUREMENT OF VALUE AREAS 


TABLE 4 
CORRELATIONS BETWEEN SCORES ON THE ALLPORT- 
VERNON-LINDZEY TEST AND SCORES ON THE SIX 
DIFFERENT Forms or THE Mopiriep Test 


CATEGORY (ALLPORT-VERNON-LINDZEY) 


Form 

A E P R S T 
A .72 -66 S3 | .88 | .63 | .47 
E .46 -76 +62 | .71 | .72 | .42 
D .70 .58 .48 | .72 | .61 | .49 
R .53 71 -38 | .88 | .43 | .75 
S .68 .69 A3 | .68 | .56 | .55 
T -70 .56 -29 | .79 | .54 | .74 


cases, like the above, less frequent words have 
an archaic or unattractive connotation or are 
unknown. Although we made an effort to keep 
meaning relatively constant, we found it im- 
possible to do so. 

Further, it seems to us that with a more 
careful choice of low and high frequency words, 
with special attention to their connotations, 
than we made in the construction of the various 
forms of our test, scores on the category 
favored in terms of frequency could be made 
consistently lower than scores on that same 
category onother, nonfrequency favored forms. 

Our central cognitive interpretation of value 
and interest scores is further favored by a 
correlational analysis of our data. Table 4 
presents correlations between scores on the six 
forms of our test and the Allport-Vernon-Lind- 
zey for all six categories. Within a given column 
the correlations are completely independent, 
being based on six different groups. Within a. 
given row the correlations are based on the 
same group and therefore are not completely 
independent. The fact that the coefficients are 
consistently positive and, with a few exceptions, 
not significantly different in size, is in accord 
With a central cognitive interpretation of value 
Scores and not at all easy to explain or predict 
in terms of a word frequency hypothesis. 


Discussion 


It appears from the above results that while 
frequency is undoubtedly important in deter- 
Mining perceptual thresholds for verbal ma- 
terial, it alone does not determine performance 
on value tests. One might question the crucial 
nature of our results by raising the criticism 
that individuals, on the basis of their experi- 
ence, do differ in the amount of commerce that 


429. 


they have with certain words. Our answer to 
this criticism is twofold: (a) A relationship was 
shown by Howes and Solomon (7) between 
duration threshold and Thorndike-Lorge count 
when a group of Ss was used; if the Thorn- 
dike-Lorge count is invalid when as many Ss 
are used as in our experiment, their results are 
unexplained. (b) The differences between the 
high frequency words and low frequency words 
used in our experiment were so great that it 
seems implausible that, with the number of Ss 
used, differences in frequency of individual 
usage would not be appreciable. 

However, we do not deny the importance of 
individual differences in word frequencies. 
Word frequency is not the causative agent in 
the results of perceptual threshold experiments 
dealing with verbal material, but rather a 
reflection of the past experience of the S, and 
therefore will be correlated with measures such 
as the Allport-Vernon-Lindzey test or our own 
forms of that test. Individual word frequency. 
then becomes merely another indicator of some 
central mechanism, such as values, attitudes, 
interest, or "style of life," which is the causa- 
tive factor for both probability of word usage 
by that individual and his performance in 
situations designed to reflect his basic belief 
system.‘ 

SUMMARY 


Six forms of a value-area test were con- 
structed after the general pattern of the All- 
port-Vernon to test the Solomon-Howes hy- 
pothesis that Allport-Vernon scores could be 
explained as a function of the log frequency of 
the answer choices. Each form favored fre- 
quency-wise one of the six Spranger value 
areas. One hundred twenty-one women en- 
rolled in a history of philosophy course were 
tested in six groups; each group received one | 
form of our test and the Allport-Vernon-Lind- 
zey test. ; ‘ 

1. The test of the Solomon-Howes frequency 


‘hypothesis failed to show consistent changes 


in scores on the given value area favored for 
each group, compared to the five other groups 
in that value area in our test, as would be pre- 
dicted by the frequency hypothesis. 

2. Correlations of our test with the Allport- 
Vernon-Lindzey scale remain significantly posi- 

4 We have reason to believe from private communi- 
cation that Solomon and Howes may be in basic agree- 
ment with us on this point. 


430 


tive even with frequency changes, as would be 
predicted by the postulation of a central cog- 
nitive affective construct which may be called 
“value area." 


REFERENCES 


1. ApAus, J., & Brown, D. Values, word frequencies, 
and perception. Psychol. Rev., 1953, 60, 50-54. 

2. ALLPORT, G. W., & VERNON, P. E. A study of values. 
Boston: Houghton Mifflin, 1931. 

3. ALLPORT, G. W., VERNON, P. E., & LiwpzEY, G. A 
study of values. (Rev. Ed.) Boston: Houghton 
Mifflin, 1951. 

4. Enwanps, A. L. Experimental design in psychological 
research. New York: Rinehart, 1950. 


Donatp R. BROWN AND JOE ADAMS 


5. McNemar, Q. Psychological statistics. New York: 
Wiley, 1949. 

6. Postman, L., BRUNER, J. S., & McGinnis, E. 
Personal values as selective factors in perception. 
J. abnorm. soc. Psychol., 1948, 43, 142-155, 

7. Sotowon, R. L., & Howes, D. H. Word frequency, 
personal values, and visual duration thresholds. 
Psychol. Rev., 1951, 58, 256-270. 

8. Soromon, R. L., & Postman, L. Frequency of usage 
as a determinant of recognition thresholds for 
words. J. exp. Psychol., 1952. 48, 195-201. 

9. THORNDIKE, E. L., & LORGE, I. The leacher's word 
book of 30,000 words. (ist Ed.) New York: 
Teachers Coll., Columbia Univer., 1944, 


Received May 15, 1953. 


Fa! 


THE ASSESSMENT OF GROUP OPINION BY LEADERS, AND THEIR 
INFLUENCE ON ITS FORMATION 


GEORGE A. TALLAND 
Institute of Psychiatry, Maudsley Hospital, London 


N A recent study Chowdhry and Newcomb 
I (1) tested the relative abilities of leaders 
in estimating group opinion and found 
strong evidence to confirm their hypothesis 
that "leaders should be more accurate than 
non-leaders in their estimates of the attitudes 
of other members toward issues relevant to the 
group's interest." The authors advanced two 
complementary explanations to account for 
their findings. Leaders may be accorded their 
status because of their superior ability in as- 
sessing group opinion, and their more accurate 
evaluation may be due to their position which 
furnishes them with better opportunities for 
assessing the attitudes of their followers. The 
latter was not considered as sufficient explana- 
tion in itself. 

While Chowdhry and Newcomb did not in- 
dicate that only these two circumstances could 
account for their results, they certainly omitted 
considering a third which, in their experimental 
setting, appears equally probable. Leaders 
may know best the opinions of their groups 
because they, more than any other member, 
were influential in formulating these opinions. 
This line of reasoning would also apply to 
leaders more generally. Everyday experience 
in industry, in the services, in political and 
Social organizations, as well as history, demon- 
strates the fact that leaders are not invariably 
in close touch with the opinions of their fol- 
lowers. Action leaders in a battle or mob 
situation, for instance, are likely to assume 
that role exactly through their insensitivity 
to the attitudes of those they impel with their 
commands. The purpose of the present report 
is to support the argument that leaders know 
best the opinions of their groups because by 
their prestige, authority, or other influence 
they bring their followers in line with their 
own views. 


METHOD 


The experiment formed part of a series of psycho- 
metric studies of therapeutic groups at the Maudsley 
Hospital, London.! Data collected for the purposes of 


1 Carried out with a grant from the Bethlem Royal 
Hospital Research Fund and a University of London 
Postgraduate Studentship. 


other researches were utilized, and only a minor modi- 
fication of the testing procedure was introduced in 
order to create experimental conditions for the present 
research. 

The applicability of Chowdhry and Newcomb's find- 
ings to this type of group was first tested on ten psycho- 
therapeutic groups, totaling 64 members. These had 
been in treatment for varying periods of from 2 to 18 
months, and each consisted of five to eight members, 
some being all male, others all female, and others of 
both sexes. 

As in the sociometric procedure, leadership status 
was based on one's fellow group members' assessments. 
"The information, however, was not that of choices and 
rejections on specified criteria but complete rankings of 
all other group members. Group scales were derived 
from the summation of individual ranking scales. This 
procedure was designed by F. K. Taylor (7), two of 
whose status scales were employed in the present study, 
public popularity and dominance. The individual rank- 
ings from which the popularity scale is derived are 
given in answer to a question: “ . . , estimate the general 
popularity of each member in the group . . . you should 
not allow your own feelings towards a particular mem- 
ber to influence your judgment." The criterion of 
dominance rankings was the degree of a member's 
domination of, and influence upon, group discussion, ir- 
respective of the helpful or harmful effect of this in- 
fluence. To these was added a new scale, leadership 
role, derived from rankings of the members according 
to the leadership they had displayed in the group. 

So as to provide for a common metric irrespective of 
the size of the group, all group summated scales were 
converted according to Taylor's (7) formula into tp 


: ye) E 
scores, these being 100 PED 02)! in which ¢ 
is the sum of all rank scores assigned to a group mem- 
ber by his colleagues, and m, the number of members in 
the group. The range of these scores is from 0 to 100, 
with a mean of 50. Any two or more / scales can be 
summated to provide composite scales. 

In order to allow for the highest possible degree of 
comparability with Chowdhry and Newcomb's find- 
ings, leadership was measured by a composite scale 
which combined direct rankings for leadership role, 
dominance, and popularity. These are very similar to 
Chowdhry and Newcomb's first, second, and fourth 
sociometric criteria. No scale equivalent to their third 
criterion (worthiness of representing the group in a 
convention) could be obtained from this type of group. 
Apart from theoretical considerations which suggest 
that in this type of group, too, functional criteria of 
leadership are preferable to sociometric criteria (6), 
the composite scale also gave rise to some practical 
disadvantages. In one group no leader emerged, as the 
lowest score on the composite ¢ scale was above 90, 
and thus too close to the mean to justify the assignment 
of leadership status to its holder. When judged by the 
composite scale, this was a leaderless group, although 
dominance is one's criterion, if the group had a leader 


scoring as low as.12.5 who, however, had high scores on 

the'popularity and leadership role scales. (The data 
being Scores, low scores indicate high status.) In 
another group two members tied for leadership on the 
composite scale, and two others scored so high (above 
255) that both ought to be considered as bottom 
scorers, or “isolates.” 

‘Throughout the report, leadership will be defined as 
top score on the composite scale if this is below £j; = 
90 which would be equal to t = 30 on a single scale, 
and isolation as bottom score if higher than tp = 210. 
As an alternative measure, top and bottom scores on 
the dominance scale will also be considered as indices 

_ of functional leadership and its opposite. 

Group opinion was probed in one of the few areas 
which could be safely assumed to be of common interest 
to all members within a group: the therapeutic useful- 
ness of various topics of discussion to the group. A list 
of 15 topics was submitted to the subjects (Ss), each 
of whom was asked to rank these individually with a 
view to their usefulness to the group as a whole. Con- 
cordances of rankings within each group were sig- 
nificantly higher than chance at the 1 per cent level of 
Confidence as measured by Taylor’s (7) modification of 
Kendall’s (4) W coefficient. In view of this, a measure 
of group opinion could be obtained by summating the 
individual rank scales. This summated scale was 
ranked, and accuracy of evaluating group opinion was 
measured by the rank correlation coefficient between 
each member’s individual rank scale and the ranked 
summated scale. The coefficient used was rho. The ap- 
Plicability of Chowdhry and Newcomb’s conclusions to 
the therapeutic groups was tested by examining the 
positions of the most and least accurate assessors of 
group opinion on the composite scale of leadership and 
the dominance scale of each group. 

Up to this point ten groups provided data for the 
research. The final experiment, however, could be ap- 
plied to seven groups only, with a total membership of 
44. The topics were ranked by each member on the basis 
of (a) personal helpfulness, and (5) helpfulness to the 
group. These were submitted, and the groups were 
then asked to furnish a collective rank scale which would 
express the opinion of the group as a whole. In order 
to provide this information, the groups were told to 
discuss the matter, and the experimenter (E) left 
them to proceed with the task. Three of the ten groups 
were not available for this experiment, and in one of 
the remaining five groups two members could not at- 
tend the discussion. Neither of these occupied extreme 


PLN GEORGE A. TALLAND 


positions on either the composite leadership or the 
dominance scale. 


RESULTS 


Nine leaders and nine “isolates” emerged 
from the ten composite scales. In one group 
the lowest score was above 90, in another the 
highest was below 210. None of these nine 
leaders had the highest assessment coefficient 
in his group, nor did these leaders tend to score 
in the clusters of best assessors, which are an 
artifact of the method by which group opinion 
was obtained. Two leaders made the least ac- 
curate evaluation of group opinion in their 
groups, and one of these leaders tied for top 
score on the composite scale. Two isolates gave 
the least accurate, but two others gave the 
most accurate, evaluation in their group. None 
of the most dominant members surpassed all 
his group colleagues in accuracy of assessment. 
Two gave the least accurate evaluation in 
their group, as did two of the least dominant 
members. Two bottom scorers on dominance 
made the best assessment of group opinion. 
It should be noted that each group had at 
least one member who scored below /, = 30 
and one who scored above ip = 70 on the 
dominance scale. Assessment coefficients in 
the ten groups are set out in Table 1, giving 
details of evaluating the summated group 
opinions. Leaders and isolates on the com- 
posite scale are marked L and I, the most and 
least dominant members D and S. The bar 
over L means a tie. 

Replacing the summated scales of group 
opinion by the collective rankings we find that 
the association between leadership and ac- 
curacy of assessment becomes clearly marked. 
Five of the six leaders surpass their fellow group 
members in evaluating the collective opinion. 
Only two of the five isolates gave the least 


TABLE 1 
COEFFICIENTS OF ASSESSMENT OF GROUP OPINION BEFORE PUBLIC DISCUSSION 
I n pud IV M VI VII VIII IX X 
.89 IS .88 .82 7 .89 .84 .93 IS .83 .80 90 
.88 .80 hue S71 L IS..98 .83 i .82 .T6 63 
.82 .68 IS .73 48 LD.85 LD .76 .85 LD .72 a 5588 
.81 .58 58 D .43 .80 72 .80 .44 ID.9 L.4 
LD.71 LD.57 LD.57 1.42 .78 -68 .78 58 .65 IS .33 
.68 .29 .54 .40 IS .40 .69. 35  S.63 LD .29 
.64 .19 IS .63 
IS .62 LD .48 


| 
| 


LEADERS AND GROUP OPINION e 


Ec 


A s 


TABLE 2 LU PA 

COEFFICIENTS OF ÅSSESSMENT OF GROUP OPINION AFTER PUBLIC DISCUSSION . n w 

I Il m IV v vI w 
LD .75 LD .75 LD .69 D .55 LD .93 LD .86 .85 T 

.69 .62 IS .64 41 IS .80 .84 18 

.59 57 -63 S .32 .69 .78 M 

.43 IS .45 .32 31 .69 .58 14 

IS .23 .30 Bu .27 .61 .50 .64 

—.40 .05 .12 I.16 IS .24 IS .55 

—.02 LD .44 

.36 


accurate assessment. Six of the seven top 
Scorers on dominance gave the best assessment 
of collective opinion, only one bottom scorer 
the worst. The new assessment coefficients 
are set out in Table 2. 


DISCUSSION 


The results confirm the hypothesis that 
leaders influence the formation of group 
opinion when this emerges in the course of dis- 
cussion, and that this influence is likely to 
account to some extent for the finding that 
leaders make the most accurate evaluations 
of group opinion. Before the matter at issue 
was publicly discussed, not one of the nine 
leaders in the ten groups had assessed the 
group opinion more accurately than had other 
members. One group was leaderless according 
to the criterion adopted. 

After the question had been publicly dis- 
cussed, i.e., the psychotherapeutic groups had 
become discussion groups in the commonly 
accepted sense of the term, all but one of the 
six leaders or seven top scorers on dominance 
were found to have assessed the group's col- 
lective opinion more accurately than any of 
their colleagues. The one exception to this 
rule may be accounted for as a chance oc- 
currence, though it can be explained by the 
unusually high agreement among the indi- 
vidual opinions of the four members who fol- 
lowed the leader on the composite scale. With 
correlation coefficients ranging from .80 to 
.93 they established a group norm which hardly 
changed in the course of discussion, and from 
which the leader had markedly deviated. The 
results throw little light on the isolate, except 
that in this situation he does not tend to assess 
group opinion less accurately than other mem- 
bers. The change in the accuracy of the leader's 
assessment of group opinion is highly sig- 


nificant. It would be difficult to account for 
it otherwise than by his direct influence on the 
shaping of group opinion, whether this in- 
fluence is exerted by deliberate pressure or by 
prestige, because of more forceful advocacy 
or more clearly formulated views on the ques- 
tion at issue. 

The departing point of the present experi- 
ment was Chowdhry and Newcomb's research. 
The groups examined satisfied three of their 
four criteria. They were organized around a 
definite interest pattern, they provided a basis 
for face-to-face interaction and for phenomena 
of leadership to emerge, and their members 
were sufficiently familiar with each other to 
be able to evaluate group opinion. Leadership, 
too, was defined on criteria similar to those 
adopted by Chowdhry and Newcomb. Ad- 
mittedly, the presence-of the therapist did not 
allow for a formal recognition of leadership, 
but this role did emerge in a functional sense. 
The principles of the therapeutic technique 
employed (2, 8) prescribed the minimum of 
interference by the therapist, and, in fact, it 
was found (5) that the latter’s personal judg- 
ment had little influence on the shaping of his 
patients’ opinions as far as the usefulness of 
the topics of discussion is concerned. While 
psychotherapeutic groups are not meant to be 
representative of groups in general, nor rank- 
ings of discussion topics for their usefulness 
a typical example of collective opinion, there 
is no reason to doubt that what has been found 
in these groups would also occur in others, If 
the influence of leaders in shaping group 
opinion is so marked when their status, far 
from being officially recognized, is established 
under the shadow of an appointed authorita- 
tive conductor, it is likely that official leaders 
who discharge their functions as a matter of 
duty would exert at least as great an influence. 


‘Also, if this influence is so effective, after each 
member had first to form his opinion in- 
dependently on the same issue and has there- 
fore some motivation to resist prestige sug- 
gestion, it may be expected that the leader's 
influence would be at least as effective when 
opinions in a group are initially formed by 
means of public discussion. 

The conclusions drawn from this study are 
intended to complement those of Chowdhry 
and Newcomb and the discordant findings of 
Hites and Campbell (3). The results suggest 
that the leader's more accurate evaluation of 
group opinion need not be attributed pri- 
marily to his more sensitive perception of atti- 
tudes and atmosphere in the group, nor to this 
factor combined with his greater opportunities 
of communication. They also suggest that 
publicity rather than familiarity is the opera- 
tive attribute of opinions in this context. In 
order to discover the circumstances by which 
leaders arrive at more accurate evaluations of 
group opinion than do followers, investigations 
carried out after such opinions had been 
formulated by public discussion fail to assess 
the leader's influence on the outcome of these 
discussions. Concerning Chowdhry and New- 
comb's second hypothesis, i.e., that the leaders’ 
more accurate assessment of group opinion 
may be due to their better opportunities for 
communication, though this factor would not 
in itself account for the finding, the present 
experiment gives support to the qualifying 
clause. Theoretically all members of a thera- 
peutic group have the same opportunities of 
communication. In fact, leaders communicate 
more frequently and more explicitly to others 
and receive more communications from their 
fellow group members. It would therefore 
have been expected that their views should 
be most influential on judging group opinion 
before its explicit formulation. The results did 
not confirm this expectation. 


SUMMARY 


Leadership status was defined and estab- 
lished for each member of ten psychothera- 


GEORGE A. TALLAND 


peutic groups on a combined criterion of 
dominance, popularity, and leadership role. 

By asking the members of these groups to 
rank 15 topics of discussion in the order of 
their usefulness in therapy, group opinions 
and individual evaluations of these group 
opinions were obtained on a matter of common 
interest. 

It was found that leaders were not more ac- 
curate in assessing group opinion than were 
other members, before a public discussion in the 
course of which group opinion was formulated. 

When groups were asked to discuss the rela- 
tive usefulness of these topics publicly and to 
present a collective ranking, it was found that 
the collective opinion was, in six cases out of 
seven, Closer to the leader’s individual opinion 
than to those of other members. 

The results confirm the hypothesis that 
leaders influence the formation of group 
opinion, bringing this in line with their personal 
views. It is suggested that this influence should: 
be considered in accounting for the finding 
that leaders evaluate group opinion more ac- 
curately than other members. 


REFERENCES 


1. CHownmny, K., & Newcoms, T. M. The relative 
abilities of leaders and non-leaders to estimate 
opinions of their own groups. J. abnorm. soc. 
Psychol., 1952, 47, 51-57. 

2, Fourxzs, S. H. Introduction to group-analytic psycho- 
CERA London: Wm. Heinemann Medical Books, 

3. Hires, R. W., & Campsett, D. T. A test of the 
ability of fraternity leaders to estimate group 
opinion. J. soc. Psychol., 1950, 82, 95-100. 

4. KENDALL, M. G. Rank correlation methods. London: 
Charles Griffin, 1948. 

5. TALLAND, G. A. Leadership in psychotherapy groups. 
Paper read at the Conference of the Brit. Psy- 
chol. Soc., 1953. 

6. TALLAND, G. A. Étude du leadership dans les groupes 
de psychothérapie. Bull. de Psychol., 7, 224-228 
(Paris: Sorbonne), 1954. 

7. TAYLOR, F. K. Quantitative evaluation of psycho- 
social phenomena in small groups. J. ment. Sci., 
1951, 97, 690-717. 

8. Tavron, F. K. On some principles of group therapy. 
Brit. J. med. Psychol., 1952, 26, 128-134. 


Received May 18, 1953. 


FL 


PREDICTION IN THE CLINICAL METHOD AND INTERRELATIONS 
OF BIOCHEMISTRY, PSYCHIATRY, AND PSYCHOLOGY: 


ELI ROBINS AND IVAN N. MENSH 
Washington University School of Medicine 


T THE 1950 annual meeting of the 
A Canadian Psychological Association, 
C. R. Myers (29) took clinical psychol- 
ogy to task for too little use of the “prediction- 
test" method in clinical work. The scientific 
method of hypothesis-prediction-verification 
falls by the wayside when only the first step 
is taken, and Myers was concerned that “in 
the clinical setting, we are forever taking the 
first step—guessing—without being able to 
(or perhaps without daring to) follow it up 
with the next two steps—predicting and 
verifying.” His search of the Psychological 
Abstracts for the years 1947-1949 revealed 
over 16,000 titles, but only 24 were listed 
under the entry “Prediction.” Of the latter, “16 
appeared in 1947, 5 in 1948, and only 3 in 
1949. It would not appear that they are in- 
creasing in frequency either.” Most of the 24 
studies referred to the prediction of success at 
school, college, in marriage, or on parole; two 
used the clinical interview method for pre- 
dicting success in pilot training; and only two 
were “more strictly ‘clinical’ studies.” 
Among the reasons Myers sees for the infre- 
quent use of prediction are (a) “the lurking 
suspicion that it is really not possible"; (b) 
group- rather than individual-centered statis- 
tics (26, 45); and (c) “the ordinary practical 
difficulties of using prediction effectively as a 
routine procedure.” Further, if prediction is 
indeed the criterion of validity in science, 
“and if validation of current theory and prac- 
tice is the most urgent need in clinical psychol- 
ogy, it is high time we made more deliberate 
use of prediction as a means of increasing the 
precision of our clinical understanding.” Sug- 
gestions for clinical predictions were made by 
Myers and included the routine use, not only in 
research, of “the spiral pattern of observe-guess- 
predict-check-observe-guess-predict-check””; ex- 
plicit statements, in definite and quantitative 
terms, and records of clinical predictions in 
a form which may later be checked by 


_ 1 This paper is one of a series of papers in a sympo- 
sium presented at the 1951 annual meeting of the 
American Psychological Association. 


others; actual checking of predictions against 
outcome and examination of errors in predic- 
tion; and short-term predictions, which are 
easier, more profitable, and allow more prompt 
checking and revision. 

Psychologists have not been entirely remiss, 
however, in studies of prediction. In May’ 
1946 Malamud (25) summarized a number of 
studies designed to predict response to shock 
treatment; last year Rosenzweig (36) sug- 
gested a method of validation by successive 
clinical predictions; the Minnesota group 
developed an empirical scale of the MMPI (10) 
for selecting patients who respond successfully 
to convulsive shock therapy; statistically- 
oriented clinicians and  clinically-oriented 
statisticians have explored new methods for 
treating clinical data (26, 45); and for some 
years Wittman (44) has, with her Prognostic 
Rating Scale, studied prognosis in hospitalized 
psychotics. Also, as Myers noted in his presi- 
dential address (29), Horst’s monograph “The 
Prediction of Personal Adjustment” (15), and 
Sarbin’s review of “The Logic of Prediction 
in Psychology” (37) present relevant lessons 
and rationale. 

The study of validation and prediction is 
not alone a concern of the psychologist, and 
other students of human behavior have turned 
to research in this area. Biochemistry, psy- 
chiatry, and other branches of medicine now 
are investigating predictive indices, as witness 
the Cornell group’s “A Clinical Guide to Progno- 
sis in Stress Diseases” (3), Kline and Tenney’s 
study of Sheldon’s somatotypes (19), and 
stress studies (39) in various diagnostic groups. 
For example, the ultimate aim of biochemical 
studies concerned with the prediction of be- 
havior is to be able, from a single chemical 
finding or a pattern of chemical findings, to 
predict important and relevant behavioral 
characteristic(s) of the organism in question. 
The present paper gives a number of examples 
of the prediction of behavior in the clinical 
situation utilizing the interrelation of bio- 
chemical and physiological techniques, and 
their psychological and psychiatric correlates. 


435 


436 Eur Rosins AND Ivan N. MENSR 


"These are selected to illustrate the contribu- 
tion of predictive methods from the fields of 
biochemistry, psychiatry, and psychology. 
Before discussing any detailed experimental 
work, two examples will be presented to point 
out some of the problems involved in predic- 
tion. First, behavioral changes occurring in a 
severe hypoglycemic reaction in relation to a 
low blood sugar, resulting in coma, can be 
predicted with almost complete certainty. The 
individual in question will have seriously 
disturbed brain function manifesting itself in 
disorientation, confusion, and marked dis- 
turbances of behavior leading to coma. This 
is a very precise prediction in terms of the 
relation of blood sugar level to behavioral 
disturbances. It is also an extremely important 
one, both in terms of therapy and in the in- 
ferences which may be drawn about metab- 
olism in the brain. In this example one makes a 
nearly exact individual prediction. 

The second example concerns the relation 
of the level of blood lipids to probability of 
recovery from a schizophrenic or manic- 
depressive psychosis. As will be discussed later, 
it is known that, in general, high blood lipid 
level is associated with a high probability and 
low blood lipids with a low probability of 
recovery from either of. these diseases (13). 
However, while the relationship between high 
blood lipids and recovery is highly significant, 
jin schizophrenia only 55 per cent of those 
with high blood lipids were well five years 
after they were first examined. Consequently, 
for any given schizophrenic patient with high 
lipid level one cannot predict whether or not 
he will recover, although the probability is 
much greater than if he had a low lipid level. 
This is in sharp contrast to the first example 
of extremely low blood sugar in which prac- 
tically every case affected could be predicted. 
Since a 100 per cent predictive accuracy is rare, 
the problem that arises is one of evaluating the 
usefulness and significance of the tests that 
can give an answer only in terms of more 
probable or less probable. 

There is a second major problem: What 
kind of behavior are we going to predict? In 
the first example of low blood sugar, an almost 
perfect predictive accuracy was achieved if we 
defined the behavior to be predicted as dis- 
orientation, confusion, and marked disturb- 
ances in behavior leading to coma. On the 


other hand, if we wished to predict whether 
the patient during his behavioral disturbance 
will hallucinate, dance a jig, curse the boss, 
lose his temper, or speak lovingly to someone 
he ordinarily hates (all these behaviors have 
been observed in hypoglycemic reactions), we 
cannot do so accurately from the level of the 
blood sugar. We must, therefore, be careful to 
define exactly what we wish to predict. 

Various experiments concerned with differ- 
ent kinds of prediction will be discussed. These 
kinds of prediction include: (a) Does a given 
test predict that a certain disease or behavior 
not present now will occur in the future? 
(b) Does a given test predict that a certain 
disease or behavior is present at the time of 
the test? (c) Does a given test predict recovery 
or failure to recover from a given condition? 
(d) Can we predict the effects of lack of or 
excess of a given drug or other chemical sub- 
stance? It may be seen that diagnosis and 
treatment are involved here as well as prog- 
nosis. 


Toxic AND NUTRITIONAL FACTORS IN 
BEHAVIOR 


Table 1 summarizes the behavioral effects 
of a lack of specific nutritional factors and of 
an excess of certain drugs. These effects are 
variable, but they generally lead to psychotic 
behavior. What does this mean in terms of 
prediction? It can be predicted that the ma- 
jority of individuals on an experimentally 


TABLE 1 
Tue Errects or Some Toxic Drucs AND LACK OF 
CERTAIN ViTaMINs ON BEHAVIOR IN MAN 


SPECIFIC VITAMIN LACKED BEHAVIORAL RESULT 
Thiamin Fatigue, nervousness, work 
inefficiency, palpitation 
Niacin Delirious psychosis* 
Vitamin Bi: Delirious psychosis 


Spreciric Deus 
Ap! 


MINISTERED BEHAVIORAL RESULT 


Bromides Delirious psychosis 

Atropine Delirious psychosis 

Mescaline Hallucinations, delusions, 
anxiety, finally delirious 
psychosis 

Alcohol Inebriation 


* Delirious psychosis = disorientation, confusion, delusions, 
hallucinations, unusual behavior, and (in some cases) stupor. 


PREDICTION IN THE CLINICAL METHOD 


deficient thiamin diet will develop the indi- 
cated symptoms (17, 42, 43), that a very small 
percentage will develop behavioral symptoms 
of Bi; lack (1) (assuming the brain lesions of 
pernicious anemia are due to vitamin Bj; 
lack), and that an appreciable but unknown 
percentage will develop symptoms of nicotinic 
acid lack (6, 18). Further, early replacement 
of the missing substance will relieve the symp- 
toms in most cases. In addition, these findings 
are important for research in that they give a 
lead concerning the chemical metabolism of 
the brain. For example, all the vitamins men- 
tioned are important coenzymes in carbo- 
hydrate metabolism (23). This is, therefore, 
another bit of evidence concerning the crucial 
importance of carbohydrate metabolism in 
brain function. 

In the second group of cases, the majority 
of subjects will develop symptoms after a 
sufficient dose of bromides (21), alcohol (14), 
atropine (20), or mescaline (40). It is also 
true that a certain small number of subjects 
will develop symptoms on a low dosage of 
these drugs. However, since little is known of 
the mechanism of action of these drugs, we 
cannot predict which subjects will be the ones 
to develop symptoms on a low dosage, but 
can only predict that the great majority of 
subjects will develop symptoms if given dos- 
ages above a certain minimal level. 


Some BIOCHEMICAL ASPECTS OF MENTAL 
DEFICIENCY 


Two diseases are presented here in illustra- 
tion—kernicterus and phenylpyruvic oligo- 
phrenia. Kernicterus, which is associated with 
mental deficiency in many cases, is the neuro- 
psychiatric complication of erythroblastosis 
fetalis (9, 41). Erythroblastosis fetalis has an 
immunochemical basis. The most common 
(among over 90 per cent of patients) im- 
munologic constellation leading to the disease 
is an Rh-positive father and child and an 
Rh-negative mother. The Rh-negative mother 
becomes immunized during pregnancy by the 
Rh antigen present in the Rh-positive fetus. 
As a result, the mother forms Rh antibodies 
Which pass through the placenta to cause 
erythroblastosis fetalis in the child (4). About 
10 per cent of children with erythroblastosis 
fetalis develop kernicterus (41). It cannot be 
predicted which of the children will develop 


437. 
TABLE 2 
STUDIES OF Parmar SWEATING 
SwrarmG RE- | 
SPONSE* 
Stronc WEAK 
Susjecrst N (70) (90) 
Psychiatric patients 100 84 16 
Nonpsychiatric patients 1160 83 17 
Hospital personnel 21517:32. 68 
MEAN 
AMOUNT OF 
Susyectst N Swear (mg.) 
Anxiety neurosis patients 33 19.1 
Normal soldiers 54 18.0 
Hospital personnel 30 9.1 


* All differences shown are significantly different, p < .05, ex- 
cept between samples of anxiety neurosis patients and normal 
soldiers. 

t Silverman and Powell (38). 

1 Cohen (7). 


kernicterus, but as a result of the immunologic 
studies—the absence of the proper immuno- 
logic constellation—it can be stated that cer- 
tain children will not develop erythroblastosis | 
fetalis, and therefore cannot develop its compli- 
cation, kernicterus. 

The other disease, phenylpyruvic oligo- 
phrenia, is a very satisfying one so far as 
predictive ability is concerned. Without know- 
ing or seeing the patient, one can see that a 
positive chemical test for phenylpyruvic acid in 
the urine means that the presence of mental 
deficiency can be predicted in every such case 
(30). Yet, we cannot predict which unborn 
children will have the disease, nor can any- 
thing be said about the relation of the chemical 
to the brain abnormality or to the severity of 
mental deficiency. : 


Srupres ILLUSTRATING A RELATIONSHIP BE- 
TWEEN BIOCHEMICAL FINDINGS AND 
PERSONAL, SUBJECTIVE FEELINGS 

In the upper part of Table 2 are shown some 
early results in a study of palmar sweating (38) 
among patients in an army hospital. The 
sweating was measured by a nonquantitative 
method, which indicated that psychiatric 
patients and nonpsychiatric medical or surgi- 
cal patients had more palmar sweating than 
did hospital personnel. In the lower part of 
Table 2 is shown another study (7) with 


438 


essentially the same results. This latter study 
extended the results of the earlier study in 
three ways: a quantitative method was used, 
the psychiatric patients were carefully selected 
with a single diagnosis, and normal soldiers 
rather than other patients were used as control 
subjects. The differences between patients 
and soldiers were not significant, but those 
between either of these groups and hospital 
personnel were significant. It is commonly 
stated that patients with anxiety neurosis 
have sweaty palms. However, these studies 
show that nonpsychiatric patients and normal 
soldiers have the same amount of sweatiness. 
On the other hand, hospital personnel did not 
show this phenomenon. It may be inferred, 
therefore, that sweaty palms do not neces- 
sarily signify anxiety neurosis, but may be 
related to such personal feelings as familiarity 
of the situation. These studies indicate the 
possible future value of biochemical studies in 
analyzing and describing subtle psychologic 
states. 


THE Use or RESPONSES TO AUTONOMIC DRUGS 
IN PREDICTING IMPROVEMENT FOLLOWING 
ExecrrosHock THERAPY (EST) 


The next two studies present a more con- 
ventional kind of prediction than the preceding 
ones. A group of 100 patients, chiefly schizo- 
phrenic and manic-depressive, composed the 
experimental group (11). Before and after a 
course of EST they were tested on separate 
days with intravenous epinephrine and intra- 
muscular mecholyl. Their blood pressure, 
psychologic responses, and somatic responses 
(pallor, sweating, and so on) were followed. 
The patients were classified in two different 
ways: (a) Seven different groups according to 
their systolic blood pressure response to mech- 
olyl and epinephrine. Groups I to VI were 
classified on the basis of marked, moderate, or 
mild reactions to epinephrine and mecholyl. 
For example, Group I had a marked reaction 
to epinephrine (marked increase in blood 
pressure) and a mild reaction to mecholyl 
(very slightly decreased blood pressure). 
Group VII had a chill after the administration 
of mecholyl. (b) Four different groups accord- 
ing to whether anxiety was produced or re- 
lieved and whether a shaking chill after mecho- 
lyl occurred (Table 3). The favorable groups 
were VI, anxiety precipitated by mecholyl 


Eur Roprns AND Ivan N. MENSH 


TABLE 3 
UsE or PsycHoLoGic AND PHYSIOLOGIC RESPONSES 
TO MECHOLYL AND EPINEPHRINE TO PREDICT 
RrsuLTS Or EST* 


IMPROVED 
AFTER 
Response TO Drucs N EST (%) 
1. Blood pressure changes 
a. Groups I-V 47 13 
b. Group VI 34 82 
2. No anxiety precipitated 6 
3. Anxiety precipitated by 
a. epinephrine only 16 19 
b. mecholyl only 10 100 
4. Anxiety decreased by mecholyl 25 88 
5. Chill after mecholyl (Group VII) 13 100 


* Funkenstein, Greenblatt, and Solomon (11). 


only, anxiety decreased by mecholyl, and chill 
after mecholyl (Group VII). These results are 
interesting and provocative; however, they 
have not yet been confirmed in other labo- 
ratories, nor has there been reported a follow- 
up study long enough to be reliable. 


PREDICTION OF RECOVERY IN SCHIZOPHRENIA 
AND MANIC-DEPRESSIVE Psycuosis UTILIZ- 
ING THE MEASUREMENT OF BLOOD 
Lms AND Bopy Buirp 


An early study of Gildea et al. (12) showed 
a significant association in normal men of a 
pyknic physique, high serum fatty acids, and a 
high energy output, and a significant asso- 
ciation of a leptosomic physique, low serum 
fatty acids, and low energy output. Gildea and 
Man (13) then followed this with a prognostic 
study of schizophrenia and manic-depressive 
psychosis. Table 4 shows that the patient ma- 
terial was divided into four groups on the 
basis of a five year follow-up. There was a sick 


TABLE 4 
PREDICTION OF CaPACITY FOR RECOVERY IN SCHIZO- 
PHRENIC AND MANIC-DEPRESSIVE PATIENTS (13)* 


Serum SERUM 


Hearta Farry CHOLES- 

AFTER Pyxnic Acrms ^5  TEROL 

Frve Puysique 125 % >200 

Dracnosis Years N (%) meq./L mg.% 
Schizophrenia sick 30 — 10 T 3 
Schizophrenia. well 22 23 55 45 
Manic-depressive sick — 44 44 6 6 
Manic-depressive well 59 59 73 91 


* All differences shown are significantly different, p < .05 


PREDICTION IN THE CLINICAL METHOD 


and a well schizophrenic group and a sick and a 
well manic-depressive group at the end of this 
time. Within the diagnostic groups there was a 
significant association of pyknic physique, 
high serum fatty acids, and high serum cho- 
lesterol with a high capacity for recovery. 
This was a striking statistical phenomenon, 
although in an individual case it might well 
fail to predict capacity for recovery accurately. 
There were only a few exceptions to the rule 
that patients with high blood lipids had a high 
capacity for recovery. 


CHEMICAL CHANGES IN THE BRAIN ITSELF 


An implicit assumption underlies these bio- 
chemical studies of behavior—behavior, sub- 
jective or otherwise, is associated with bio- 
chemical changes in the brain. So far, the chief 
evidence that this is a fact rather than only an 
assumption is contained in a series of papers 
(8, 31, 32, 33) published since 1948 (Table 5). 
These papers present a series of experiments 
performed on rats which were “emotionally 
excited" by inducing repetitive loss of support 
for about four minutes. The whole brain was 
then analyzed for the chemical constituents 
listed in the table. There were significant 
changes in three components: lactic acid, 
acetyl choline, and phosphocreatine. The other 
components did not show an appreciable 
change. These changes do not seem to form 
any coherent pattern that allows unequivocal 
inferences concerning the mechanisms involved 
in the brain during emotional excitement. One 
difficulty in this kind of study is that the whole 
brain must be used for analysis. Since the 
brain is so complex anatomically, the use of 
the whole brain may obscure crucial changes 
within a small group of cells, and consequently 
makes the interpretation of any changes found 
uncertain. 

One approach used to analyze smaller pieces 
of brain tissue is that initiated by Caspersson 
(5) in Sweden, and applied to psychiatric 
problems by Hydén (16). This method achieves 
ultra-violet absorption spectrophotometry of 
individual cells. By means of a complex micro- 
Scopal setup ultra-violet light is thrown on 
various parts of the cell. The resulting absorp- 
tion spectrum results from the presence of 
proteins and nucleic acids in the cell. Hydén 
(16) reported significant changes in the absorp- 
tion spectrum of nerve cells in schizophrenia. 


439 


TABLE 5 


Errects or EMOTIONAL EXCITEMENT ON VARIOUS 
CHEMICAL CONSTITUENTS OF WHOLE RAT BRAINS 


VALUE VALUE 
CHEMICAL CONSTITUENT mg.% mg.% 
Lactic acid (33) 8.8 37.5* 
Acetyl choline (ug/gm) (31) 1.25 0.87* 
Phosphocreatine (8) .8 11.1* 
Ammonia (32) 0.28 0.22 
Glutamine (32) 79.0 78.0 
Inorganic phosphate (8) 13.8 14.0 
Adenosine triphosphate (8) 14.9 14.9 
Hexose phosphate (8) 16.8 16.7 
Total phosphate (8) 55.3 56.7 


* These differences are significant, p < .05. 


Although this work has been criticized by 
Meyer (27) and MacKinnon (24), this, of 
course, does not mean that the cytochemical 
method as such cannot be applied to psychi- 
atric problems. 

The chief difficulty of this spectrophoto- 
metric method of Caspersson is its limitation 
to the measurement of only proteins, some 
lipids, and nucleic acids. Another approach to 
the examination of small pieces of brain tissue 
is that initiated by Linderstrém-Lang (22) 
for the stomach. These methods are being 
elaborated for the brain in the Department of 
Pharmacology, Washington University School 
of Medicine and in the Department of Neuro- 
psychiatry (34, 35). The techniques depend 
upon the possibility of analyzing minute bits 
(micrograms) of brain tissue for chemical 
constituents, including enzymes. Because only 
minute amounts of tissue are necessary, a 
relatively precisely defined part of the brain 
may be studied, and, therefore, the objection 
of working on heterogeneous areas of the brain 
is overcome. For example, the layers of the 
cerebellum (34) can be analyzed separately 
so that it is possible not only to compare one 
layer with another, but individual layers of 
this region can be compared with histologically 
similar layers of other areas of the brain. It 
is our hope eventually to apply these methods 
to the study of human psychiatric disease. 


Some METHODOLOGIC CONSIDERATIONS In- 
VOLVED IN PREDICTING BEHAVIOR 


Biochemical tests generally give precise, 
quantitative, and reproducible results. The use 


2 Lowry, O. H., et dl. Unpublished data. 


440 


of such valid and quantitative tests in investi- 
gating human behavior does not lessen the 
need for careful, controlled observation of the 
behavior. Since few biochemists are trained in 
the observing and interpreting of human be- 
havior, the importance of this aspect of the 
study is often overlooked by them. This idea 
that the use of a biochemical test in a study 
automatically insures the excellence of the 
study is held not only by biochemists but also 
by psychologists and psychiatrists. For this 
reason, certain methodologic problems should 
be emphasized. Without attention to these 
problems, the interpretation of the biochemical 
test may become meaningless in terms of its 
relation to human behavior. 

The first problem arises in selecting the 
experimental group. The group must be se- 
lected by criteria defined in such a fashion 
that they are precisely delineated, understand- 
able, and reproducible by other workers who 
may desire to select a similar group in order 
to repeat or extend the experiments. The 
important issue is that in speaking of diag- 
nosis, psychologic state, physiologic state, or 
socioeconomic state, we define them in an 
unequivocal fashion that is so clear that other 
workers can read what we have done and re- 
produce the work. This means we must sacrifice 
a certain intuitive subtlety and study a “nu- 
clear” group, i.e., one that fits our definition 
unequivocally, even though this excludes 
borderline cases who will some day be known 
to belong to the same group when an objective, 
quantitative method of selection is discovered. 

The second problem arises in the selection 
of criteria for classifying the predicted be- 
havior, This has already been mentioned above 
in the discussion of low blood sugar. There we 
contrasted our excellent ability to predict the 
occurrence of a confusional psychosis with our 
much inferior ability to predict precisely what 
the behavior of a given patient may be while 
confused because of a low blood sugar. We 
must define our criteria quite carefully, there- 
fore; otherwise the data obtained can be dis- 
torted to support or attack any idea we may 
like or dislike at the moment. As a part of the 
problem of the definition of the predicted be- 
havior, considerations about the significant 
aspects of the behavior in time are relevant. A 

follow-up study on such serious and chronic 


Ext Ross AND Ivan N. MENSH 


diseases as schizophrenia or manic-depressive 
psychosis that follows the patients only one 
month or even six months may be of limited 
value, but more probably is even worse than 
being of limited value—it may actually be mis- 
leading. In the case of psychiatric disease and 
psychologic behavior of normal subjects, so 
little is known about the natural history of 
psychiatric disease or of normal behavior that 
it is often difficult to decide how long a follow- 
up is indicated. In this circumstance it seems 
wiser to use much longer follow-up studies 
than are now used in most instances. 

The third problem concerns the predictive 
technique utilized. This requires the use of a 
technique that gives unequivocal, consistent 
results which do not require special interpre- 
tation in the light of the predictive study being 
done. This, it would seem, would preclude the 
use of uncontrolled clinical observation, and 
would preclude the use of complicated tests 
that have not been carefully validated. 

The fourth consideration involves the 
method used in collecting the data about be- 
havior. The chief point of emphasis is that 
poorly planned experiments and poorly col- 
lected data do not lend themselves to precise 
interpretation or to valid statistical analysis. 
For example, when the statement that a certain 
per cent of a given group show “hostility” is 
made, unless there was a standard reproducible 
way of deciding whether “hostility” was 
present or not, and unless this method was 
used routinely on every patient, the calculation 
of a percentage is misleading. Finally, there is 
the area (2, 28) of the often omitted but 
crucial factors involved in cross validation and 
validity generalization. It is this application 
of an apparently substantiated predictive 
hypothesis to additional samples of the re- 
search population which provides the most 
rigorous test of the validity of our predictive 
hypothesis. 


SuMMARY 
A series of illustrations of predictive meth- 
ods reviews some of the techniques of bio- 
chemistry and physiology in their application 
to psychological and psychiatric problems. The 
contributions of these interrelationships are 
indicated, together with methodologic prob- 

lems of experimental control. 


^ 


10. 


1M. 


12. 


13. 


14. 


15. 


16. 


PREDICTION IN THE CLINICAL METHOD 


REFERENCES 


. Avams, R. D., & Kus, C. S. Subacute degener- 


tion of the brain in pernicious anemia. N. Eng. 
J. Med., 1944, 231, 1-9. 


. Aur», F., Jr., & Eron, L. D. The use of Rorschach 


scores to predict whether patients will continue 
psychotherapy. J. consult. Psychol., 1953, 17, 
104-109. 


. BERLE, B. B., Prwskv, R. Worr, S., & Woxrr, 


H. G. A clinical guide to prognosis in stress 
diseases. J. Amer. med. Ass, 1952, 149, 
1624-1628. 


. CArPELL, D. F. The mother-child incompatibility 


problem in relation to nervous sequelae of 
haemolytic disease of the newborn. Brain, 1947, 
70, 486-494. 


. CasPERSsON, T. O. Cell growth and cell function: 


a cytochemical study. New York: Norton, 1950. 


. CLECKLEY, H. M., SYDENSTRICKER, V. P, & 


Geestin, L. E. Nicotinic acid in the treatment 
of atypical psychotic states associated with 
malnutrition. J. Amer. med. Ass., 1939, 112, 
2107-2110. 


. Comen, M. E. Studies of palmar hand sweat in 


healthy subjects and in patients with neuro- 
circulatory asthenia (anxiety neurosis, neuras- 
thenia, effort syndrome) with a description of a 
simple quantitative method. Amer. J. med. Sci., 
1950, 220, 496-507. 


. Dawson, R. M. C., & Ricuter, D. Effect of stimu- 


lation on the phosphate esters of the brain. 
Amer. J. Physiol., 1950, 160, 203-211. 


. Evans, P. R., & Porawr, P. E. The neurological 


sequelae of Rh sensitization. Quart. J. Med., 
1950, 19, 129-149. 

Feroman, M. J. A prognosis scale for shock 
"therapy. Amer. Psychologist, 1948, 3, 348. 
(Abstract) 

Fonkenstetn, D. H., GREENBLATT, M., & SoLo- 
mon, H. C. A test which predicts the clinical 
effects of electric shock treatment on schizo- 
phrenic patients, Amer. J. Psychiat., 1950, 106, 
889-901. 

Grea, E. F., Kany, E., & Man, E. B. The rela- 
tionship between body build and serum lipoids 
and a discussion of these qualities as pyknophilic 
and leptophilic factors in the structure of the 
perennelity; Amer. J. Psychiat., 1936, 92, 1241- 

60, 

Grea, E. F., & Man, E. B. Methods of esti- 
mating recovery in patients with manic-depres- 
sive and schizophrenic psychoses. Amer. J. 
Psychiat., 1943, 99, 496-506. E 

Gorpserc, L. Quantitative studies on alcohol 
tolerance in man. Acta physiol. Scand., 1943, 5, 
Suppl. 16, 1-128. 

Honsr, P. (Ed.) The prediction of personal adjust- 
ment. Soc. Sci. Res. Coun. Bull., 1941, No. 48. 
HypfN, H., & HanrELIUs, H. Stimulation of the 
nucleoprotein-production in nerve cells by 
malonitrile and its effect on psychic functions in 
mental disorders. Acta psychiat., Kbh., 1948, 

Suppl. 48, 1-117. 


441 


17. Jozurz, N., GOODHART, R., GENNIS, J., & CLINE, 
J- K. The experimental production of vitamin Bi 
deficiency in normal subjects: the dependence of 
urinary excretion of thiamin on the dietary intake 
of vitamin Bı. Amer. J. med. Sci., 1939, 198, 
198-211. 

18. Joie, N., Wortis, H., & Srem, M. H. Alco- 
holic encephalopathies and nutrition. In E. M. 
Jelinek (Ed.), Alcohol addiction and. chronic 
alcoholism. New Haven: Yale Univer. Press, 
1942. 

19` Kune, N. S., & Tenney, A. M. Constitutional 
factors in the prognosis of schizophrenia. Amer. 
J. Psychiat., 1950, 43, 240-258. 

20. Krantz, J. C., JR., & Carr, C. J. The pharmacologic 
principles of medical practice. Baltimore; Wil- 
liams & Wilkins, 1949. 

21. Levin, M. Experimental bromide intoxication: a 
criticism. Amer. J. Psychiat., 1951, 108, 309-310. 

22. LINDERSTRØM-LANG, K., Horrer, H., & OHLSEN, 
A. S. Studies in enzymatic histochemistry: XIII. 
The distribution of enzymes in the stomach of 
pigs as a function of its histological structure. 
Compt. rend. Trav. Lab. Carlsberg, Serie Chim., 
1935, 20, 66-125. ý 

23. LiemMann, F. Biosynthetic mechanisms. Harvey 
Lect., 1948, 24, 99-123. i 

24. MacKinnon, I. H., Hocu, P. H., CAMMER, L,& 
WaxrscH, H.. B. The use of malonitrile in the 
treatment of mental illness: preliminary report. 
Amer. J. Psychiat., 1949, 105, 686-688. 

25. Maranon, D. I. Objective measurement of clinical 
status in psychopathological research. Psychol. 
Bull., 1946, 43, 240-258. j 

26. Mensa, I. N. Statistical techniques in present-day 
psychodiagnostics. Psychol. Bull., 1950, 47, 
475-492. 

27. MEYER, A., & MEYER, M. Nucleoprotein in nerve 
cells of mental patients: a critical remark. J. 
ment. Sci., 1949, 95, 180-181. 

28. Mosær, C. I. Problems and designs of cross- 
validation. Educ. psychol. Measmt, 1951, 11, 5-11. 

29. Myers, C. R. Prediction in clinical psychology. 
Canad. J. Psychol., 1950, 4, 97-108. 

30. Penrose, L. S. The biology of mental defecl. New 
York: Grune & Stratton, 1949. 

31. RICHTER, D., & CROSSLAND, J. Variation in acetyl 
choline content of the brain with physiological 

. state. Amer. J. Physiol., 1949, 169, 247-255. 

32. Rrcureg, D., & Dawson, R. M. C. Brain metab- 
olism in emotional excitement and in sleep. 
Amer. J. Physiol., 1948, 154, 73-79. : 

33. Ricater, D., & Dawson, R. M. C. The ammonia 
and glutamine content of the brain. J. biol. 
Chem., 1948, 176, 1199-1210. i 

34, Rosms, E., & Suta, D. E. A quantitative histo- 
chemical study of eight enzymes of the cerebellar 
cortex and subjacent white matter. Proc. Ass. 
Res. nero. ment, Dis., 1953, 82, 305-327. 

35. ROBINS, E., SMITH, D. E, & McCaman, R. E. 
Microdetermination of purine nucleoside phos- 

Jase activity in,brain and its distribution 
within the monkey cerebellum. J. biol. Chem., 
1953, 204, 927-937. 


442 


36. RosENZWEIG, S. A method of validation by suc- 
cessive clinical predictions. J. abnorm. soc. 
Psychol., 1950, 45, 507-509. 

37. SARBIN, T. R. The logic of prediction in psychology. 
Psychol. Rev., 1944, 51, 210-228, 

38. SILVERMAN, J. J., & Powett, V. E. Studies on 
palmar sweating: III. Palmar sweating in an 
army general hospital. Psychosom. Med., 1944, 
6, 243-249. 

39.ESrem, M., Ronzont, E., & GirpzA, E. F. Physi- 
ologic response to heat stress and ACTH of 
normal and schizophrenic subjects. Amer. J. 
Psychiat., 1951, 108, 450-455. 

40.ESrockmos, G. T. A clinical study of the mescaline 
psychosis, with special reference to the mecha- 
nism of the genesis of schizophrenic and other 
psychotic states. J. ment. Sci., 1940, 86, 29-47. 

41&Vavcnan, V. C., III, Auten, F. H., Jr., & Dra 


Eur Rosms AND Ivan N. MENSH 


MOND, L. K. Erythroblastosis fetalis: IV. Further 
observations on kernicterus. Pediatrics, 1950, 6, 
706-716. 

42. Wirrraus, R. D., & Mason, H. C. Further obser- 
vations on induced thiamine (vitamin B;) defi- 
ciency and thiamine requirement of man. Proc. 
Staff Meet. Mayo Clin., 1941, 16, 433-438, 

43. WirLiAvs, R. D., Mason, H. L., WILDER, R. M., & 
Smits, B. F. Observations on induced thiamine 
(vitamin B,) deficiency in man. Arch. intern. 
Med., 1940, 66, 785-799. 

44, WirrMAN, Puytuis. Diagnostic and prognostic 
significance of the shut-in personality type as a 
prodromal factor in schizophrenia. J. clin. 
Psychol., 1948, 4, 211-214. 

45. ZUBIN, J. Introduction: symposium on statistics 
for the clinician. J. clin. Psychol., 1950, 6, 1-6. 


Received February 21, 1953. 


AFFECTIVE TONE AND VISUAL RECOGNITION THRESHOLDS 


LEONARD D. GOODSTEIN 
State University of Iowa 


cGrnntes (5) and others (12) have 
M invoked a concept of perceptual de- 
fense to account for the heightened 
recognition thresholds for negatively valued 
and affectively charged words. Perceptual 
defense supposedly operates as a mechanism of 
resistance to the recognition of such anxiety- 
provoking stimuli. 

McGinnies’ interpretation, however, has 
been severely criticized by Howes and Solomon 
(2) on the grounds that it did not take into 
account the relative frequency of the stimulus 
words as they occur in printed material. These 
critics pointed out that the heightened thresh- 
olds of the words with negative affective value 
can be more parsimoniously explained in terms 
of their associative strength or relative famil- 
iarity. Additional evidence of the importance of 
the associative value of the stimulus words was 
found in several subsequent studies (3, 5, 6, iP 
8, 9, 10). 

The purpose of the present note is to demon- 
strate that the recently reported results of 
Aronfreed, Messick, and Diggory (1) can also 
be interpreted as a function of the relative 
associative strength of the stimulus words. 
These investigators studied tachistoscopic 
visual recognition thresholds and galvanic skin 
responses as functions of the affective value of 
Stimulus words. The 25 words employed had 
previously been scaled from pleasant to un- 
pleasant by the method of equal-appearing 
intervals. The words were placed in three 
categories: pleasant (V = 5), neutral (V = 5), 
and unpleasant (V = 15). Theauthors reported 
that the mean visual recognition threshold for 
the unpleasant words was significantly higher 
(p < .01) than the means for the neutral and 
the pleasant words. There were no significant 
differences in recognition thresholds between 
the pleasant and neutral words.! The authors 
interpreted their threshold data as confirming 
McGinnies’ (5) previous finding that “the af- 
fective tone of the stimulus words has had an 

1 The GSR data are difficult to interpret because of 
a sex and instructional set interaction, and will not be 
considered in the present paper. 


TABLE 1 
MEANS AND STANDARD DEVIATIONS FOR THE 
Loc Worp FREQUENCIES OF THREE 
CATEGORIES OF STIMULUS WORDS 


Group 
STATISTIC PLEASANT NEUTRAL UNPLEASANT 
N 5 5 15 
Mean 2.85 2.63 1:22 
SD .83 :92 .50 
effect on mean recognition time ... (1, p. 


525). 

Examination of their list revealed that, of the 
ten pleasant and neutral words, five were 
among the 500 most common words in the 
English language as determined by the Thorn- 
dike-Lorge word count (11); seven of the ten 
were among the 1,000 most common words. 
Only one of the 15 unpleasant words, however, 
was among the 1,000 most common words. 

The relative frequency of each of the 25 
words was determined by use of the Thorn- 
dike-Lorge general word count (11). The means 
and standard deviations of the logs of these 
frequencies for the three categories of words 
are presented in Table 1. The differences be- 
tween the mean log frequencies of the un- 
pleasant and neutral words (t = 3.53, df = 
18, p < .01) and between the unpleasant and 
pleasant words (t = 2.68, df = 18, p< .02) are 
highly significant. There is no difference be- 
tween the means of the neutra] and pleasant 
words (t = .39, df = 8, P< .60). 

The correlation between the pleasantness 
ratings of the stimulus words and their recogni- 
tion thresholds is .53 (p < .01)? When, how- 
ever, the variance attributable to the associa- 
tive value (relative frequency) is partialled out, 
this correlation is reduced to .18 (p = .34 

rox.). 
pium 2 UR value of the stimulus words 
is not the only factor that can modify a 
i i thahk Dr. James Diggory for 
ee i egere data i this Russ that 
were not reported in the published paper. 


443 


444 


perceptual defense interpretation. The effects 
of response suppression, i. e., the voluntary re- 
fusal to report emotionally charged stimulus 
words, have also been clearly shown (7). 
Aronfreed, Messick, and Diggory attempted 
to study the effects of response suppression by 
giving one group of Ss information about the 
nature of the words that would be used as 
stimuli. The correlation between the mean log 
frequencies and recognition thresholds for the 
group receiving this information is not 
significantly different from that for the non- 
informed group. The correlations were —.62 
and —.54, respectively. This finding would 
strongly suggest that the authors’ attempts 
to prevent response suppression did not reduce 
the importance of the associative value of the 
stimulus words. 

These findings clearly confirm the results of 
those studies that have found no differences in 
recognition thresholds for stimulus words of 
purportedly differential affective value when 
the relative frequency of the stimulus words is 
controlled (7, 8, 9). 


REFERENCES 


1. ARONFREED, J. M., Messick, S. A., & Diccory, 
J. C. Re-examining emotionality and perceptual 
defense. J. Pers., 1953, 21, 517-528. 

2. Howes, D. H., & Sotomon, R. L. A note on Mc- 


11. 


12. 


LEONARD D. GOODSTEIN 


Ginnies’ “Emotionality and perceptual de- 
fense.” Psychol. Rev., 1950, 57, 229-234. 


. Howes, D. H., & SorowoN, R. L. Visual duration 


threshold as a function of word probability. 
J. ezp. Psychol, 1951, 41, 401-410. 


. McGrnntes, E. M. Emotionality and perceptual 


defense. Psychol. Rev., 1949, 56, 244-251. 


. McGinnis, E. M., Comer, P. B., & Lacey, O. 


L. Visual recognition thresholds as a function 
of word length and word frequencies. J. exp. 
Psychol., 1952, 44, 65-69. 


. Murray, D. B. Recognition and recall as a func- 


tion of frequency, shock and individual dif- 
ferences variables. Unpublished doctor's dis- 
sertation, State Univer. of Iowa, 1953. 


. Postman, L., Bronson, WANDA S., & GROPPER, 


G. L. Is there a mechanism of perceptual de- 
fense? J. abnorm. soc. Psychol., 1953, 48, 215- 
224. 


. POSTMAN, L., & SCHNEIDER, B. M. Personal values, 


visual recognition, and recall. Psychol Rev., 
1951, 58, 271-284. 


. SorowoN, R. L., & Howes, D. H. Word frequency, 


personal values, and visual duration thresholds. 
Psychol. Rev., 1951, 68, 256-270. 


. Soromon, R. L., & Postman, L. Frequency of 


usage as a determinant of recognition thresholds 
for words. J. exp. Psychol., 1952, 48, 195-201. 

THORNDIKE, E. L., & Lorce, I. The teacher's word 
book of 30,000 words. New York: Teachers 
Coll., Columbia Univer., 1944. 

VawpERPLas, J. M., & Brake, R. R. Selective 
sensitization in auditory perception. J. Pers. 
1949, 18, 252-256. 


Received February 15, 1954. Prior publication. 


SOME EFFECTS Or SITUATIONAL THREAT ON GROUP BEHAVIOR: 


JOHN T. LANZETTA 
Crew Research Laboratory, Air Force Personnel and Training Research Center 


DON HAEFNER, PETER LANGHAM, AND HOWARD AXELROD 
University of Rochester 7 


HE effects of threat, or other anxiety- 

evoking situations, on behavior have re- 

ceived increasing attention from social 
scientists in recent years. Interest has been 
evidenced both by an increase in empirical 
studies (5, 7, 8, 9, 10) and renewed attempts 
to incorporate the concept of anxiety within 
the framework of contemporary learning 
theory (13, 15, 18). The majority of the experi- 
ments have concerned themselves with the 
effects of situationally produced anxiety on 
performance and have focused on individuals 
either acting alone or coacting. Such studies of 
necessity ignore the area of interpersonal re- 
lations, an area which clinical and theoretical 
literature would indicate is particularly sensi- 
tive to changes in anxiety level. The present 
study was an attempt to determine the effects 
of situational threat on the behavior of inter- 
acting individuals. As such it can contribute to 
the growing body of empirical evidence on the 
functioning of small groups as well as shed 
light on the experimentally neglected area of 
anxiety and interpersonal relations. 

The broad question to be examined is this: 
How does variation in threat to a group re- 
flect itself in changes in group member be- 
havior? Variation in threat can occur along 
several dimensions; nature and/or intensity, 
locus in time, locus in "psychological" space, 
and target. The latter two are peculiar to a 
concern with group processes and have no 
counterpart if the focus were the individual 
acting alone. “Locus in psychological space"? 
refers to the origin of the threat: whether it 


1 The work described in this paper was done under a 
contract between the Office of Naval Research and the 
University of Rochester. 

2In some respects this dimension when limited to 
the dichotomy “no-locus” (objectless, unspecified, 
etc.) and “specific-locus” parallels the distinction 
made between anxiety and fear. We are interested only 
in situationally produced threat and thus according to 
some theorists are dealing with fear. We concur with 
the opinion of Miller and Dollard (14) and will not 
attempt to distinguish between fear and anxiety. 


arises from some internal group process or 
originates from an external-to-the-group source, 
Target is orthogonal to this dimension; either 
the threat is directed to the group as a whole 
or to individual group members. These two 
dichotomized dimensions were the independent 
variables for this study. 


METHOD 


The basic method used was that reported in previous 
studies by this laboratory (2, 3). Groups composed of 
three subjects (Ss) worked on a task called “intercept” 
in a laboratory setting. The Ss were observed by four 
observers (Os), who immediately classified their on- 
going behavior in terms of a category system, After 
the experimental session, each S was rated on a number 
of traits and both Os and Ss filled out a questionnaire 
designed to assess various "group" characteristics. 
The Ss also responded to an adjective check list and a 
social-sensitivity questionnaire. 

Design. The main independent variable was threat. 
Within the threat condition, a distinction was made 
between locus and target of threat, each of these being 
dichotomized: locus into external and internal to the 
group, and target into group or individually directed. 
A 2 X 2 factorial design was utilized for the threat 
conditions with nine groups assigned to each of the 
cells. Nine groups were also assigned to a control 
(nonthreat) condition. 

Experimental conditions. Threat was produced by 
structuring the sessions as evaluation tests. Previous 
studies of leadership at the laboratory have utilized 
ROTC trainees, and subsequent inquiries indicated that 
they perceived the project staff as an evaluative group. 
By securing the cooperation of the AFROTC and NR- 
OTC commanding officers, we could utilize this orien- 
tation to create our threat. The Ss were informed by 
letter that they were to be evaluated and that this 
assessment would become part of their record. This 
was reinforced by several impersonal letters, rather 
curt notices as to scheduled sessions, and a formal and 
impersonal attitude on the part of the experimenter 
ek this context of external threat, target and 
locus were varied by varying instructions as to the 
method of evaluation. Locus was manipulated by vary- 
ing the stated source of the evaluation ratings with 
either Os rating Ss (external threat), or Ss rating them- 
selves (internal threat). Tatget was manipulated by 
informing Ss that they were being evaluated as indi- 
viduals (individual threat), or that their evaluation 
reports would be identical and based on ratings the 
group received (group threat). 


445 


446 


After the instructions were read, Ss were told to 
proceed to the task instructions, and E left the room. 
He did not re-enter the room again until completion 
of the task, at which time Ss were asked to fill out 
some questionnaires and then allowed to go. 

The Ss under control conditions were instructed 
by a different E. They were informed that they were 
being used for a study necessary to complete require- 
ments for a graduate course» The E attempted to set 
an informal, relaxed atmosphere when Ss first entered 
and usually spent five to ten minutes interacting with 
them before the session began. The Ss were asked to 
do their best, the task instructions were pointed out, 
and they were told to proceed. 

Subjects. The Ss for the threat condition were 108 
AFROTC and NROTC students in freshman and 
sophomore classes. They comprised about 60 per cent 
of the total enrollment of these groups. The 108 Ss 
were randomly assigned to groups of three, and the 
groups were then randomly assigned to cells within the 
threat condition. 

The 27 Ss for the control condition were volun- 
teers from the freshman and sophomore non-ROTC 
student population. They were randomly assigned to 
nine groups of three members each. 

Task. The task utilized was called “intercept” and 
was a modification of a task developed by Schutz? for 
the Tufts Systems Analysis project. The essential 
elements were a 4 X 4-foot board ruled into 11 X 11 
squares each identified by coordinates, and enemy and 
friendly planes of three different colors. The group's 
job was to protect a "carrier" (the center square on the 
board) from being hit by “enemy” planes. The mem- 
bers accordingly had to plan a strategy for deployment 
of their planes against the enemy attack. Since but 
one of their planes could move in response to each 
enemy move, they were required to decide jointly on 
the most effective move to make. Several factors were 
introduced to make such choices a possible source of 
conflict: the scoring was so arranged that there were 
penalties and rewards for the group as a whole as well 
as for individual members of the group; a different sug- 
gested "best" strategy was imbedded in each S's 
instructions; the “radar” reports for each member were 
usually different. There were many occasions in the 
action where the individual was presented the alterna- 
tive of behaving in his own or the group interest. 

The full task consisted of three phases. In Phase 1 
Ss worked on the intercept problem for 15 minutes. 
Phase 2 was a 10-minute discussion of a “best” strategy 
based on Ss’ experience in Phase 1. In Phase 3 they 
again worked for 15 minutes on the intercept problem 
(a new set of enemy moves), utilizing the strategy 
decided upon in Phase 2. 

All instructions for the task were typed, and those 
for the various phases were placed in separate en- 
velopes. The Ss were informed when to move to a new 
phase by a buzzer signal. The E was not present at 


LawzETTA, HAEFNER, LANGHAM, AND AXELROD 


any time after the reading of the initial instructions 
discussed in the preceding section. 

Instruments. The primary data-collection technique 
was the recording of ongoing behavior in terms of an 
adaptation of the system proposed by Bales (1). The 
system was elaborated by the use of subcategories 
used by Bales as significata for his main categories. A 
full description of the categories is presented by Bales 
(1, pp. 177-195). 

One possible criticism of an analysis based on cate- 
gorizations of ongoing behavior is that indices based 
on a simple sum of behaviors in certain categories ig- 
nore the differential intensity of similarly classified 
acts and the integrated, holistic character of an in- 
dividual’s behavior. Since to a certain extent this 
criticism may be valid, global judgments were made of 
behavioral characteristics of individuals-in-situation 
by means of rating scales. These ratings consisted of 
judgments about the cooperativeness, initiative, effec- 
tive leadership, goal orientation, etc. of individuals. 
In some cases the ratings paralleled indices derived 
from the interaction categories, and thus served as a 
check on them. 

Another area in which global judgments seemed 
necessary was that of group properties. Properties 
such as the morale or cohesiveness of a group are prob- 
ably the synthesis of certain interaction patterns; but, 
because concise behavioral definitions of the terms are 
lacking, it was difficult to arrive at an index based on 
the categories alone. For this purpose, a postmeeting 
reaction questionnaire was developed. This question- 
naire, with the addition of certain satisfaction and 
sociometric items, was also filled out by Ss. The Ss 
were also asked to check an adjective check list® and a 
social-sensitivity inventory.® 

The number of dependent variables measured by 
use of these instruments can be classified into three 
broad areas. The rational classes, which crosscut the 
instruments, are: 

1. Variables attempting to assess the nature of the 
interpersonal relations between group members. The 
denotata of this class are aggression, sociability, co- 
operativeness, autocratic behavior, solidary (friendly, 
praising, etc.) behavior, antagonistic acts, striving for 
group acceptance, the degree of informal friendliness 
exhibited by the group, and the extent of personality 
conflict. 

2. Variables concerned with utilization of the actors’ 
and other group members’ resources in attempted prob- 
lem solution. These are sometimes difficult to distin- 
guish from 1 and in fact are not considered completely 
independent. The differentiating criterion is whether 
problem solution or another group member is considered 
the primary target of action. Included in this class are 
such variables as leadership; self-oriented behavior; 
achievement motivation; giving of suggestions, opin- 
ions, and orientations; asking for opinions, suggestions, 
or orientations; disagreements and agreements; g0: 


3 We gratefully acknowledge the aid of Dr. W. C. 
Schutz in making the task available to us. 

4A complete description of the task is unfeasible 
in the short space available. A fuller description and 
instructions to Ss can be obtained from the authors. 


$ Developed by Helen and Vincent Nowlis at the 
University of Rochester, from a list utilized by Gough 
at the University of California. 

*Developed by Urie Bronfenbrenner at Cornell 
University. 


SITUATIONAL THREAT AND GROUP BEHAVIOR 


* orientation of the group; competition in the group; 
and the degree of equal participation of group members 
in problem solution. 

3. Variables that attempt to assess the effectiveness 
of the group or the individual in achieving specified 
ends. Adaptability, efficiency, productivity, and com- 
munication effectiveness in the group are included in 
this class. 

The first two classes require inferences about moti- 
vational states from observation of ongoing behavior 
and, as mentioned, the distinguishing criterion is the 
target of action. The last class requires judgments about 
the extent to which behavior resulted in certain speci- 
fied effects, 


RESULTS 


Measures obtained from records of ongoing 
behavior, ratings of each individual, ratings of 
the group, Ss’ responses to a questionnaire 
concerned with aspects of their group’s func- 
tioning, and Ss’ responses to an adjective 
check list were available for analysis. For 
present purposes a “group” score for each 
variable was obtained by summing over ob- 
servers. Where measures on individuals were 
obtained, scores were also summed over the 
three individuals composing a group. For the 
Bales category system, the group score on each 
category of behavior was then expressed as a 
proportion of the total number of acts recorded 
for the group. Such a proportion index elim- 
inates the possibility of obtaining significant 
differences between treatments due to a 
difference in activity rate. Its use was based on 
the assumption that change in activity rate 


TABLE 1 


INTEROBSERVER RELIABILITIES FOR THE 
Bates CATEGORIES 


(N = 108) 
CATEGORY ru* 
I Shows solidarity 64 
II Shows tension release 65 
III Agrees .83 
IV Gives suggestion :89 
V Gives opinion .91 
VI Gives orientation 87 
VII Asks orientation 12 
VIII Asks opinion .80 
IX Asks suggestion -46 
X Disagrees -76 
XI Shows tension -10 
XII Shows antagonism cel 


* Corrected for double length since the sum of the two Os’ 
categorizations were used in analysis. 


447 
TABLE 2 
INTEROBSERVER RELIABILITIES FOR TRAIT 
RATINGS 
(N = 108) 
AVERAGE 
RELIABILITY 
OF SINGLE 
OBSERVER 
RATING (Fogw)* 
1. Cooperativeness .46 
2. Efficiency .66 
3. Leadership 2 T 
4. Aggression .53 
5. Autocratism .62 
6. Sociability .38 
7. Striving for group acceptance .34 
8. Adaptability Boks 
9. Self-oriented needs 43 
10. Achievement motivation .65 


* See text for discussion of technique used in obtaining these 
reliabilities, 


would be proportionately distributed over the 
categories in the system. 

The interobserver reliabilities for the Bales 
categories are shown in Table 1. The inter- 
observer reliabilities for the subcategories were 
considerably lower than those for the main 
categories, and thus only the main categories 
were used for analysis. 

The average observer reliabilities of the 
trait ratings are shown in Table 2. The best 
technique to use in determining multijudge 
reliability is difficult to decide upon (6, 17), 
and estimates obtained by various methods are 
not exactly comparable. The reliabilities shown 
were obtained by an analysis of variance 
technique described by McNemar (11, p. 280) 
and are based on an error of measurement 
estimate which includes an adjustment for 
differing observer (form) means. They are un- 
derestimates of the "true" reliabilities since 
scores utilized for analysis were based on 
ratings summed over four Os as well as three 
Ss. 

Tn general, results will be presented in terms 
of measuring instruments and then organized 
with respect to the areas of functioning dis- 
cussed previously. 

As the statement of the problem indicated, 
the study was concerned with two main ques- 
tions: (a) What are the effects of situational 
threat on certain aspects of group functioning? 
(6) To what extent do locus and target of 
threat affect group member behavior? 


448 


Although the former question has logical 
priority, the design dictated an analysis for 
locus and target as a first step. 


Locus and Target of Threat 


Table 3 indicates the mean percentage of be- 
havior falling in the various Bales categories 
for the four experimental conditions. The per- 
centages are believed to be quite stable, the 
number of units on which they are based rang- 
ing from 1,100 to 1,300. 

From the total column one can note con- 
siderably more of some kinds of behavior 
recorded than others. Most of this difference 
reflects the nature of the task and the composi- 
tion of the group, but some of it probably re- 
sults from the differential difficulty of observ- 
ing various kinds of behavior. The distribution 
obtained is similar to one obtained by Bales 
for a thesis discussion group and has a family 
resemblance to those obtained for his standard 
chess task (1). 

The F ratios were computed for main effects 
and interaction effects for each category, by use 
of analysis of variance techniques. None of the 
F ratios for any category on either instrument 
was significant. The Bartlett test for homo- 


TABLE 3 
MEAN PERCENTAGE OF BEHAVIOR IN THE 
BALES CATEGORIES FOR THE FOUR 
EXPERIMENTAL CONDITIONS 


EXTERNAL INTERNAL Mr 
| Carecory NUMBER 
Am B Hu D pee 
vipuAL | GXOUP |y pyar | GROUP 
I Shows solidarity 2.04 1.20 | 1.72 | 1.56 1.63 
II Shows tension re- .88 460 .80 m 75 
lease 
IH Agrees 6.67 6.50 | 6.73 | 7.24 6.79 
IV Gives sugges- | 9.21 9.16 | 7.59 | 9.59 | 8.89 
tions 
V Gives opinions 37.74 | 37.24 | 37.61 | 34.72 | 36.83 
VI Gives orienta- | 22.30 | 23.84 | 24.76 | 25.74 | 24.16 
tion 
VII Asks for orienta- 5.23 5.41 | 5.24 | 5.42 5.33 
tion 
VIII Asks for opinion | 4.62] 5.08| 4.87 | 4.94 4.88 
IX Asks for sugges- .82 .97 .68 .89 .84 
tion 
X Disagrees 4.09 4.26 | 3.84 | 2.63 3.71 
XI Shows tension 3.78 3.17 | 3.43 | 4.30 3.67 
XII Shows antago- 2.61 2.62 | 2.69 | 2.24 2.54 
nism 
"Total percentage | 100.00 | 100.06 | 99.96 | 99.98 | 100.02 
Total number of | 1249 |' 1247 | 1277 | 1176 | 1237 
acts 


LANZETTA, HAEFNER, LANGHAM, AND AXELROD 


geneity of variance indicated significant cor- 
rected chi squares for four of the categories. 

Similar analysis for observer trait ratings, 
observer ratings of the group, and Ss’ ratings 
of their groups also yielded nonsignificant F 
ratios for main effects and interaction on all of 
the variables. For these instruments three of 
the corrected chisquares were significant. 
The general lack of significance for locus and 
target did not warrant transformations for 
those few variables that were nonhomogeneous 
since no justifiable conclusions could be based 
on these variables alone. 


Threat versus Control (Nonthreat) 


Since there were no significant differences be- 
tween the various threat treatments, all threat 
groups were combined for comparison with the 
control groups. The /'s for all of the variables 
were computed, using the technique described 
in McNemar (11, p. 223) for samples with 
unequal JV's. The results of the comparisons 
for the interaction categories, trait ratings, and 
observer ratings of the group are shown in 
Tables 4, 5, and 6. In those cases where the F 
ratio indicated a significant difference in 
variance between the two samples, Cochran 
and Cox's correction, as given in Edwards (4, 


TABLE 4 
SIGNIFICANCE OF DIFFERENCES BETWEEN THREAT 
AND CONTROL CONDITIONS IN MEAN PERCENTAGE 
or BEHAVIOR IN THE BALES CATEGORIES" 


THREAT | CONTROL 
CATEGORY Naso | N79 | pr | t 
M | o | M| ot 
1 Solidarity 1.63| .85| 1.89| .52| 1.61]/<1 
II Tension release «77| -32| „74| .50j—1.58|«1 
III Agrees 6.79| 3.18| 7.24| 2.80| 1.14/<1 
IV Gives suggestions | 8.89| 4.28/11.04| 2.57| 1.67|—3.395 
V Gives opinion 36.83| 22.06/37.82/16.16| 1.37/<1 
VI Gives orientation — |24.16| 15.69/22.74/10.24| 1.14|«1 
VII Asks orientation 5.33| 2.43| 4.62| 1.15] 2.12! 5.088 
VIII Asks opinions 4.89| 2.43| 4.06| 2.03] 1.20! 1.52 
IX Asks suggestions .84| .21| .89| .20| 1.05|<1 
X Disagrees 3.71| 4.38| 2.81| 2.32| 1.89) 1.45 
XI Shows tension 3.67| 2.38| 3.20| 2.33| 1.001 
XII Shows antagonism | 2.54| 3.05| 2.92] 4.41|—1.45|«1 
Total activity 1237 23,950| 1392 | 9152 | 2.62,—3.778 


* Number of acts in a category were summed over Os and Ss 
in a group to give a group score. These scores were then expressed 
as percentage of total acts recorded for the group. i 

1 Positive values of F and i indicate threat > control; negative 
values indicate control > threat. 

$ Significant at the .01 level or beyond. 


SITUATIONAL THREAT AND GROUP BEHAVIOR 


TABLE 5 
SIGNIFICANCE OF DIFFERENCES BETWEEN 
THREAT AND CONTROL CONDITIONS IN 
Mean RATINGS OF INDIVIDUALS* 


449 


TABLE 6 
SIGNIFICANCE OF DIFFERENCES BETWEEN THREAT 
AND CONTROL CONDITIONS IN OBSERVER RATINGS 
or GROUP CHARACTERISTICS* 


TuzrAT | CONTROL THREAT | CONTROL 
Ramo Aim 39 UN m pun Inpex SR SOT td A 
M|o |u| a Milo |u| o 
1. Cooperation 55.78| 63.38|53.22| 46.94] 1.35 |<1 1. Goal orientedness — — 91.36/211.84/93.67195.75| 1.08 |<1 
2, Efficiency 51.75] 62.42,56.00| 14.75} 4.231|—2.31f 2. Productivity 13. 89]115. 30|14.67| 5.00] 23.068| <1 
3, Leadership 149.56]  41.34]52.11| 8.61) 4.801|—1.76 3. Degree of equal par- |39.61]100.07/42.44/120.53|—1.20 |<1 
4. Aggression 35.81/116. 16/43.00/153.00|—1.32 |—1.60 ticipation 
5, Autocratism 39.86, 80,01|47.00|108.00|—1.35 |—1.88 4, Degree of personality |44.58/231.45]46.33]219.25] 1.06 |«1 
6. Sociability 144.92} 92.02/43.56/105.02|—1.14 |<1 conflict 
7. Striving for group |43.72| 44.03/40.00| 10.25} 4.30%] 2.42f 5. Amount of informal |51.44/167.97/51.00/277.25|—1.65 |«1 
acceptance friendliness 
8. Adaptability 54.08 63.51157.89| 2.36] 26.918,—2.68t — 6. Communication effec- |48.81/189.96/52.67| 41.50] 4.511.—1.23 
9. Self-oriented needs |42.03| 92.83/41.67| 69.50| 1.34 |<t tiveness 
10. Achievement moti- |59.33| 65.71/64.56| 54.54| 1.21 |-1.86 ^ 7. Competition between |43.61/1207.44/48.33/195.50| 1.06 |<1 
vation members 
; 8. Cohesiveness 164.17/259.00164.00/148.75|. 1.74 | <1 


* Ratings were summed over the four Os and three Ss in each 
group to give a group rating. 

f Positive values of F and ¢ indicate threat > control; negative 
values indicate control > threat. 

1 Significant at .05 level or beyond. 

§ Significant at .01 level or beyond. 


p. 168), was used in evaluating the significance 
of the obtained /. 

Of the 31 comparisons shown in the three 
tables, two were significant at the .01 level or 
beyond, three were significant between the .01 
and .05 level, and four between the .05 and .10 
level. With 31 comparisons, less than three 
would be expected to be significant at the .05 
level or beyond by chance alone. 

When we relate these differences to changes 
in the broad areas of functioning with which 
the study was primarily concerned, i.e., inter- 
personal relations, utilization of actors' and 
other group members’ resources in problem 
solution, and effectiveness of the group or in- 
dividual in achieving specified goals, the re- 
sults might be ordered as follows: 

Nalure of interpersonal relations. Groups 
under threat showed a greater concern with 
group acceptance and showed less autocratic 
and aggressive behavior. Other differences for 
variables included in this class, although non- 
Significant, are consistent with these results. 
Threat groups were more sociable, more co- 
operative, showed less antagonism, were less 
competitive, less dominant, showed less ag- 
gression, showed a greater degree of informal 
friendliness, and were characterized by less 
conflict. A difference not consistent with the 
above is the greater amount of solidary be- 


* The index for each characteristic consisted of from three to 
six items, with a mean of 4.1 items per index. Each item was rated 
on a five-point scale and the item scores summed to give an index 
score. Ratings by four Os were summed to give a group score on 
each index. P 

t Positive values of F and / indicate threat > control; negative 
values indicate control > threat. 

1 Significant at .05 level or beyond, 

§ Significant at .01 level or beyond. 


havior shown by control groups. Thus, for the 
12 variables that reflected differences in the 
nature of interpersonal relations, 11 are con- 
sistent, although only three of the differences 
were significant. 

It might be inferred that these differences 
indicate that threat results in attempts to 
maintain interpersonal tensions at a low level. 
These attempts are characterized by both a 
reduction in aggressive, antagonistic, com- 
petitive behaviors, and an increased striving 
for group acceptance. 

Utilisation E resources in problem solution. 
Groups under threat were less achievement 
motivated, showed less leadership behavior, 
gave fewer suggestions, and were less active 
than control groups. Threat groups asked for 
orientation from each other more than control 
groups. As before, other variables reflecting 
differences in this area of functioning, although 
nonsignificant, seem consistent with the above. 
Control groups were more goal oriented, 
showed a greater degree of equal participation 
in task solution, gave more suggestions and 
opinions, and asked less for other group mem- 
bers’ opinions. 

These differences would appear to indicate 


450 


TABLE 7 


SIGNIFICANCE OF DIFFERENCES BETWEEN THREAT 
AND CONTROL CONDITIONS IN SUBJECTS’ RESPONSE 
TO AN ADJECTIVE CHECK LisT* 


LaNzETTA, HAEFNER, LANGHAM, AND AXELROD 


TABLE 8 
SIGNIFICANCE OF DIFFERENCES BETWEEN THREAT 
AND CONTROL CONDITIONS IN SUBJECTS! RATINGS 
or THEIR Gmours* 


THREAT | CONTROL THREAT | CONTROL 
Inpex rA S) Ft| tt Caanacrenistic Raren | = 36) | W = 9) Ft| at 
M|c|M|c M | o| M} ao 
I Anxiety 6.97/15.11] 7.11/12.86/1.18 |<1 1. Goal orientedness 75.03/19. 74|74.44|17.03| 1.161 
Ila Elated 8.00/14.40| 9.33/25.00]1.74 |<1 2. Productivity 10.53| 1.68 10.33) 1.00| 1.68|/<1 
IIb Depressed 1.42| 5.62| .67| 1.75|3.21 | 1.27 3. Degree of equal partici- |34.44|23.63/35.11/25.61|—1.08| «1 
III Hostile aggression 3.17| 9.91| 4.00| 7.75/1.28 | «1 pation 
IV Organic symptoms 2.11| 4.04| 1.00| 1.00/4.04t/ 2.361 4. Degree of personality — |23.06/23.37/24.00/31.75|—1.36| <1 
V Relaxation 8.95/10.91| 9.78/20.44/1.87 |«1 conflict 
VI Positive aggression — |21.11/40.96/23.00/35.75|1.15 |«1 5. Informal friendliness 52.03/19.91 j54.89.10.36| 1.92) 2.201 
VIIa Cooperative 18. 72129.06/19.44|36.78,1.27 |<1 6. Communication effec- — |47.36/29.61 49.11/17.86 1.66| 1.05 
VIIb Obstructive 6.75|10.31| 6.56/10.53/1.02 | 1 tiveness 
VIIIa Dominant 13.2527.85/14.00/20.25/1.38 |<1 7. Competition between 26.67.22.97/25.00:12.75| 1.80) 1.16 
VIIb Submissive 5.47|10.31| 5.00| 2.75|3.75 | «1 members 
IX Task involved 20.92162.08/21.44/52.28/1.19 |«1 8. Cohesiveness 63.14 30.47/63.33,30.00| 1.00|c1 


* There were 114 adjectives, which defined nine indices. An in- 
dex score was obtained by summing the number of adjectives 
checked for the particular index. Index scores for the three group 
members were summed to give a group score. 

1 Positive values of F and ¢ indicate threat > control; negative 
values indicate control > threat. 

1 Significant at .05 level or beyond. 


that threat groups are less task oriented and are 
less forceful, initiating, and active in their 
attack on the problem than control groups. 

Effectiveness of group performance. Groups 
under threat were less efficient and adaptable 
than control groups. Supportive, though non- 
significant, is the greater productivity and 
communication effectiveness of control groups. 

An inspection of the tables indicates another 
finding of considerable interest. Threat groups 
showed significantly greater variability than 
control groups in efficiency, productivity, com- 
munication effectiveness, adaptability, leader- 
ship behaviors, and striving for group accept- 
ance. Although F tests for other variables 
yielded nonsignificant differences in variance, 
once again there was a consistency that appears 
important to consider. For 24 of the 31 vari- 
ables, threat groups showed greater variability 
than control groups. Of the seven variables on 
which control groups showed greater vari- 
ability, five are concerned with interpersonal 
relations. It would appear that threat results 
in considerable variability in task-oriented be- 
haviors and large variability in group effective- 
ness, but also in consistency of behavior in the 
interpersonal sphere. _ 

The data presented above are concerned 
with overt behavioral consequences of exposure 
to threat. Data concerned with the phenomenal 


9. Satisfaction with session’ 46.53 18.66 |49.33/30.25 —1.62| 1.42 


* Index scores similar to those obtained from observer ratings 
were obtained (see Table 6) and summed over the three group 
members to give a group index score, 

1 Positive values of F and 1 indicate threat > control; negative 
values indicate control > threat. 

$ Significant at .05 level or beyond. 


state of Ss and their reactions to their groups 
are presented in Tables 7 and 8. Of the 12 
comparisons of adjective check list responses, 
only one was significant at the .05 level or 
beyond. Threat groups checked more ad- 
jectives describing organic symptoms than 
control groups. Although nonsignificant, it is 
interesting to note that threat groups describe 
themselves as less elated, more depressed, and 
less relaxed than control groups. These differ- 
ences would tend to confirm our unsystematic 
observations that threat, as operationally de- 
fined, did affect the emotional and/or motiva- 
tional state of Ss. 

Though nonsignificant, differences for other 
variables on the adjective check list appear 
congruent with the differences noted in overt 
behavior. Threat groups described themselves 
as less hostile, less positively aggressive, less 
dominant, and less task involved, as well as 
more submissive. Since the adjectives were 
checked after the session, these differences 
might simply indicate that Ss use the same be- 
havioral cues as Os in evaluating their motiva- 
tional state; or, they might be consi;lered as 
reflecting emotional concomitants of the be- 
havior changes. Until more is known concern- 
ing the determinants of Ss' responses to various 
self-description questionnaires, there is little 


— 


SITUATIONAL THREAT AND GROUP BEHAVIOR 


basis for selecting between the alternative 
hypotheses. 

Of the nine indices concerned with group 
characteristics only one achieved significance 
at the .05 level or beyond. Control Ss rated 
their groups as being more friendly than threat 
Ss. Other differences were small and incon- 
sistent. It should be noted that this instrument 
appears inadequate because of the extremely 
small variance obtained as a result of the re- 
stricted range of the response scale used by Ss. 


Discussion 


The failure to obtain significant results for 
locus and target can be interpreted in several 
ways. It might be a reflection of a “real” lack 
of effect of such variables but, in view of the 
apparent significance of such factors in non- 
laboratory situations, it is more likely a function 
of the design.’ The necessity for imposing an 
external threat in order to obtain Ss might 
have precluded effective manipulation of locus 
and target by a simple variation in instructions 
at the time of evaluation. It is also possible that 
the true difference to be expected is small, and 
a more sensitive design, one in which inter- 
group variation is reduced, would have re- 
sulted in significant differences. 

The results obtained for threat versus con- 
trol conditions might be broadly summarized 
as follows: 

1. Threat appears to result in a reduction of 
forceful, assertive, aggressive, interpersonal, as 
well as task-directed behavior, and in a 
greater concern with group acceptance. These 
changes might be considered to reflect a ten- 
dency to keep tensions within the group at a 
low level even at the expense of effective task 
activity. 

These results are similar to those obtained 
by Lanzetta (7), who used a situationally pro- 
duced stress of a quite different nature than the 
one utilized in this study, and two dissimilar 
tasks. Analogous results were also obtained by 
Wright (20) in a study of children's social rela- 
tions under frustration. 

2. Threat results in poorer group effective- 
hess and greater variability in effectiveness. 
This res;lt is similar to those obtained in 


1 Two more sensitive designs, although for our pur- 
poses they appeared experimentally unfeasible, would 
be a repeat measure or matched group design since 
they would tend to reduce the variance attributable to 
group differences. 


451 


studies of individual performance under stress. 
In their recent review of the literature, Lazarus, 
Deese, and Osler (8) note that the majority of 
studies have shown impairment of verbal and 
perceptual motor performance under stress, 
and several have found greater variability 
between subjects as a major effect. 

Attempts to fit these results into the frame- 
work of a contemporary theory are probably 
premature at this time, but some tentative 
comments can be offered concerning the pos- 
sible direction such an assimilation might take. 

It might be postulated, first, that increase in 
anxiety is the primary concomitant of threat, 
and that further, anxiety will accentuate se- 
curity operations in an attempt to terminate 


.or diminish the anxiety. The latter postulate 


has frequently been stated by learning and 
personality theorists (12, 14). The nature of 
the security operations is a function of many 
aspects of the total stimulus situation other 
than the cues which, themselves, evoke the 
anxiety. We would postulate that one of the 
most important aspects is the extent to which 
potential support is present; more specifically, 
support of other individuals or a group. If such 
potential support is perceived, then it seems 
plausible that an individual will behave in a 
fashion that will be most likely to assure this 
support. He will inhibit behaviors that are 
likely to alienate him from, or cause isolation 
by, other group members. Thus, we can expect 
an inhibition of aggressive, self-oriented be- 
havior, which has probably been negatively 
reinforced, and an increased striving for ac- 
ceptance. 

The assumption of anxiety as a drive that 
will primarily elicit responses that tend to re- 
duce the drive can also be used to explain the 
obtained decrease in effectiveness of the group. 
In the particular task utilized, rapid decisions 
by the group were required for effective action. 
When group consensus is required, rapid de- 
cision is sometimes best achieved by forceful, 
initiating behavior by individual group mem- 
bers. Such responses would be incompatible 
with the responses evoked by the anxiety drive, 
and thus, if the anxiety drive is strong, would 
be inhibited. Since ineffective task solution 
would not reduce the drive for task solution, a 
complete inhibition of task responses would 
not be expected. Rather, there would occur 
alternation of such responses with the anxiety- 
produced responses, owing to an oscillation in 


452 


the relative strength of the anxiety and task- 
oriented drives. Since the anxiety drive is as- 
sumed to be relatively weak under nonthreat 
conditions, response interference would not be 
expected; thus, effectiveness should be greater. 

The greater variability in effectiveness under 
threat may be due to the differential strength 
of the evoked anxiety drive; or, more probably, 
it may reflect variation in the kind of responses 
that have been reinforced by anxiety reduction. 
Rosenzweig’s (16) distinction between need- 
persistive and ego-defensive reactions to frus- 
tration comes to mind here. If need-persistive 
(goal-oriented) responses are high in the habit 
hierarchy, rather than ego-defensive (pro- 
tective) reactions, increased anxiety might 
actually lead to more effective performance. 

Recently, Waterhouse and Child have 
postulated an essentially similar formulation 
with respect to frustration. Their proposition 
is: “Frustration will produce a decrease in the 
quality of on-going performance, to the extent 
that the frustration evokes other responses 
which interfere with that on-going perform- 
ance” (19, p. 298). Substitution of the concept 
“anxiety” for frustration and additional 
postulates about the specific nature of the 
competing responses would allow a posteriori 
deduction of the obtained results from their 
proposition. Since they have recently presented 
experimental support for this proposition, it 
might tentatively be assumed to have general- 
ity for all anxiety-evoking situations. 


SUMMARY AND CONCLUSIONS 


Thirty-six threat and nine control (non- 
threat) groups of three members each worked 
in a laboratory setting on a task called “inter- 
cept.” Threat was imposed by structuring the 
sessions as evaluation tests, the results of which 
would become part of Ss’ ROTC records. The 
36 threat groups were further randomly di- 
vided into four classes of nine groups each, 
each class being assigned to a different cell of a 
2 X 2 factorial. The variables of main con- 
cern were locus of threat (whether external or 
internal to the group) and target of threat 
(whether threat was directed to the group as a 
whole or to the individual group member). 

Records of the ongoing behavior of group 
members, trait ratings of each member, ratings 
of group characteristics, and Ss’ perception of 
their group and their own motivational state 


LANZETTA, HAEFNER, LANGHAM, AND AXELROD 


were obtained. Analysis of the results was 
primarily by analysis of variance, and Stu- 
dent’s ¢ techniques. 

Locus and target did not prove to be signifi- 
cant variables, very likely because of the in- 
sensitivity of the design. All threat groups were 
thus combined for comparison with control 
groups. Threat affected behavior in three 
main areas of functioning. 

1. Interpersonal relations: Threat groups 
showed less behavior that would result in 
tension and friction within the group and 
more behavior oriented toward group accept- 
ance. 

2. Utilization of actors’ and other group | 
members’ resources: Threat groups were less 
task oriented and were less forceful, assertive, 
and active in their attack on the task problem. 
They also showed greater variability in such 
behavior. 

3. Effectiveness of the group: Threat groups 
were both less efficient and adaptable and more 
variable in effectiveness. 


i 


An attempt was made to interpret these re- 
sults within the framework of a competing 
response theory of anxiety. 

REFERENCES 


1. Bares, R. F. Interaction process analysis. Cam- 

1 bridge, Mass.: Addison-Wesley, 1950. 

2. Carter, L., HAvTHORN, W., SHRIVER, BEATRICE | 
M., & Lanzerra, J. The behavior of leaders 
and other group members. J. abnorm. $06. 
Psychol., 1951, 48, 589-595. 

3. CARTER, L., HavrHORN, W., Mrrmowrrz, BEA- 
TRICE, & LANZETTA, J. The relation of cate- 
gorizations and ratings in the observation of 
group behavior. Hum. Relat., 1951, 4, 239-284. | 

4, Epwarps, A. L. Experimental design in psycho- 
logical research, New York: Rinehart, 1950. EE) 

Estes, W. K. An experimental study of punish- 
ment. Psychol. Monogr., 1944, 67, No. 3 (Whole 
No. 263). : 

6. Kocan, L. S., & Hunt, J. McV. Problems of multi- 
judge reliability. J. clin. Psychol., 1950, 6, 
16-19. 

Lanzetra, J. Group behavior under stress. Hum. 
Relat., in press. 

. Lazarus, R. S. Drs, J., & Oster, SontA F. 
The effects of psychological stress upon per 
formance. Psychol. Bull, 1952, 49, 293-317. 

9. LiNpstey, D. B., & Oruers. A study of per- 
formance under speed stress. (OSRD, 1945; 
Publ. Bd., No. 18381.) Washington, p. € 
U. S. Dep. Commerce, 1946. 

McCrzirawp, D. C., & Aricerta, F. S. A func- 
tional classification of verbal reactions to €% 
perimentally induced failure. J. abmorm. sot. | 
Psychol, 1945, 40, 376-390. 


| 


fa 


oorr 


10. 


11. 
12. 
13. 


14. 
15. 
16. 


) 17. 


SITUATIONAL THREAT AND GROUP BEHAVIOR 


McNemar, Q. Psychological statistics. New York: 
Wiley, 1949. 

May, R. The meaning of anxiety. New York: 
Ronald, 1950. 

Miter, N. E. Learnable drives and rewards. 
In S. S. Stevens (Ed.), Handbook of experimental 
psychology. New York: Wiley, 1951. Pp. 435-472. 

Mutter, N. E., & Dorrarp, J. Social learning and 
imitation. New Haven: Yale Univer. Press, 1941, 

Mowrer, O. H. Learning theory and personality 
dynamics. New York: Ronald, 1950. 

RosENZWEIG, S. An outline of frustration theory. 
In J. McV. Hunt (Ed.), Personality and the be- 
havior disorders. Vol. 1. New York: Ronald, 
1944. Pp. 379-388. 

Suriver, E. L., & Suriver, BEATRICE M. A 


453 


demonstration of a modified analysis of variance 
used to obtain comparable reliability measures 
of observers, groups, and individual subjects 
in group dynamics data. Paper read at Eastern 
Psychol. Ass., Boston, April 1953. 


18. TAYLOR, JANET A., & SPENCE, K. W. The relation- 


ship of anxiety level to performance in serial learn- 
ing. J. exp. Psychol., 1952, 44, 61-64. 


19. WATERHOUSE, I. K., & Curr», I. L. Frustration 


and the quality of performance: III. An experi- 
mental study. J. Pers., 1953, 21, 298-311. 


20. Wricut, M. E. The influence of frustration upon 


the social relations of young children. Charact. 
& Pers., 1943, 12, 111-122. ! 


Received April 3, 1954. Prior publication. 


A STUDY IN LANGUAGE AND COGNITION! 


ROGER W. BROWN 
Harvard University 


AND ERIC H. LENNEBERG 


Center for International Studies, 
Massachusetts Institute of Technology 


ent in much the same form to all men of 

sound mind, There are objects like a house 
or a cat and qualities like red or wet and events 
like eating or singing and relationships like 
near to or between. Languages are itemized 
inventories of this reality. They differ, of 
course, in the sounds they employ, but the in- 
ventory is always the same. The esthetic predi- 
lections of the Italian lead him to prefer 
euphonious vowels, while the German is ad- 
dicted to harsh consonant groupings, but the 
things and events named are the same in both 
tongues. We are confirmed in this view by our 
linguistic education, which requires us to mem- 
orize lists of French or German or Latin words 
and their exact English equivalents. 

There are, of course, poetic persons who 
claim to find in each language some special 
genius that peculiarly fits it for the expression 
of certain ideas. But the majority of us are at 
a loss to understand how this can be, since 
there is apparently a relationship of mutual 
translatability among the languages we learn. 
To be sure, we can see that one language 
might contain a few items more than another. 
If the Germans were to invent a new kind of 
automobile and we had not yet thought of such 
a machine, their dictionary would have one 
entry more than ours until we heard of the dis- 
covery and named it for ourselves. But these 
inequalities are in the lexical fringe. They do 
not disturb the great core of common inven- 
tory. 


I 1s popularly believed that reality is pres- 


Tur Wnuonr THESIS 


'This linguistic ethnocentrism will be seri- 
ously disturbed by the study of languages that 
lie outside the Indo-European group. It has 
not prepared us for finding that there is a lan- 


l'This project was carried out under the joint 
auspices of the Communications Program of the Center 
for International Studies, Massachusetts Institute of 
Technology, and the Laboratory of Social Relations, 
Harvard University. 


of American Indian tongues led Whorf (18) to 
reject the usual view of the relationship be- 
tween language and thought. He suggested 
that each language embodies and perpetuates 
a particular world view. The speakers of a lan- 
guage are partners to an agreement to see and 
think of the world in a certain way—not the 
only possible way. The world can be structured 
in many ways, and the language we learn as 
children directs the formation of our particular 
structure. Language is not a cloak following 
the contours of thought. Languages are molds 
into which infant minds are poured. Whorf thus 
departs from the common sense view in (a) 
holding that the world is differently experi- 
enced and conceived in different linguistic 
communities and (b) suggesting that language 
is causally related to these psychological differ- 
ences.” 

Other authors have believed that the rela- 
tionship between language and thought is 
somewhat as proposed by Whorf. Cassirer (2), 
the distinguished philosopher, maintained that 
language is the direct manifestation of knowl- | 
edge; he explicitly denied a form-content rela~ 
tionship between words or language structure 
and isolates of knowledge. In this he was in 
agreement with such other German writers as 
Wundt (19) and Bühler (1). Orwell (15) in his 
novel Nineteen Eighty-Four describes a totali- 
tarian England of the future. The really effi- 
cient dictatorship of that day invents a lan- 
guage—Newspeak—in which it is impossible 
not only to express, but even to think, a rebel- 
lious thought. An equally great faith in the 
causal efficacy of language lies behind the 
General Semantics movement. Korzybski (8), A 
for instance, holds that clear thinking and 80" | 


2 While this seems a fair statement of Whorf's usual 
views, he occasionally took a somewhat more conserva” 
tive position. 


l 
guage in which noun and verb categories ap- 
parently do not exist, or that there is another 
in which the colors gray and brown are called 
by the same name. Such data from the study | 


454 


in A STUDY IN LANGUAGE AND COGNITION 


cial progress are to be attained through the 
reform of language. 


Cognitive Differences between Linguistic Com- 
munities 

The first tenet of the Whorf thesis is that the 
world is differently experienced and conceived 
in different linguistic communities. The evi- 
dence presented in support of this claim is 
entirely linguistic. It will be helpful to distin- 

uish between the conclusions based on lexical 
eatures of two languages and those based on 
structural features. 

Lexical features. In the Eskimo lexicon there 
are three words to distinguish three varieties of 
snow. There are no single-word equivalents for 
these in English. The word “snow” would be 
used to describe all three. What psychological 
conclusions can be drawn from these data? 
Does the Eskimo see differences and similari- 
ties that we are unable to see? 

Psychologists ordinarily infer perceptual dis- 
crimination when a subject is consistently able 
to respond differently to distinctive stimulus 
situations. The subject may be rat, dog, or 
man. The response may be running, salivation, 
or—speech. Words are used meaningfully when 
they are selectively employed with reference to 
some kind of environment—whether physical, 
social, or linguistic. The linguist in the field 
may discover the referent of a term by noting 
the pattern of its usage. The Eskimo’s three 
“snows” are sufficient evidence from which to 
infer that he discriminates three varieties of 
snow. These selective verbal responses satisfy 
the conditions for inferring perceptual discrimi- 
nation. 

What can be said of the English speaker’s 
ability to distinguish the same three kinds of 
snow? When different stimuli do not elicit 
differential responses, the stimuli may or may 
not be discriminated. A subject may be per- 
fectly able to distinguish two situations and 
still not care to do anything about it. Conse- 
quently the fact that English speakers do not 
have different names for several kinds of snow 
cannot be taken to mean that they are unable 
to see the differences, It would seem, then, that 
all such comparisons are psychologically in- 
conclusive. The Eskimo and American may or 
may not see the world differently. 

There is, however, other evidence to indicate 


that the speaker of English can classify snows 


455 


as the Eskimo does. If we listen to the talk of 
small boys, it is clear that they perceive at 
least two kinds of snow—good-packing snow 
and bad-packing snow. This is a distinction of 
the greatest importance to anyone interested 
in making snowballs. This discrimination is 
evidenced by differential response—not dis- 
tinct lexical items but combinations of items— 
“good-packing snow” and “bad-packing 
snow.” Whorf himself must have been able to 
see snow as the Eskimos did since his article 
describes and pictures the referents for the 
words. Since both Eskimo and American are 
able to make differential responses to snows, 
we must conclude that both are able to see 
differences. This seems to lead us to the conclu- 
sion that the Eskimo and American world 
views do not differ in this regard. 

Although the three kinds of snow are nam- 
able in both Eskimo and English, each of them 
requires a phrase in ordinary English, whereas 
single words will do it for the Eskimo. Zipf (20) 
has shown that there exists a tendency in Peip- 
ing Chinese, Plautine Latin, and American and 
British English for the length of a word to be 
negatively correlated with its frequency of 
usage. This is true whether word length is 
measured in phonemes or syllables. It is not 
difficult to find examples of this relationship in 
English. New inventions are usually given long 
names of Greek or Latin derivation, but as the 
products become widely known and frequently 
used in conversation the linguistic community 
finds shorter tags for them. Thus the “automo- 
bile" becomes the “car” and “television” 
shrinks first to “video” and eventually to 
“TY.” Three-dimensional movies are predicta- 
bly described as “3-D.” ah 

Doob (3) has suggested that this principle 
bears on Whorf's thesis. Suppose we generalize 
the findings even beyond Zipf’s formulation 
and propose that the length of a verbal expres- 
sion provides an index of its frequency in 
speech and that this, in turn, is an index of the 
frequency with which the relevant perceptual 
judgments of difference and equivalence are 
made. If this is true, it would follow that the 
Eskimo distinguishes his three kinds of snow 
more often than Americans do. It would mean 
—to cite another example—that the Hopi is 
less often called upon to distinguish airplanes, 
aviators, and butterflies than is the American, 
since the Hopi has but a single name for all 


456 


three of these. Such conclusions are, of course, 
supported by extralinguistic cultural analysis, 
which reveals the importance of snow in the 
Eskimo's life and the comparative indifference 
of the Hopi to airplanes and aviators. 

We will go further and propose that increased 
frequency of a perceptual categorization will 
mean a generally greater “availability” of that 
category. In the experimental study of memory 
we are accustomed to think of the methods of 
recall, recognition, and relearning as increas- 
ingly sensitive indices of retention. In the 
experimental study of categorizing behavior 
there are two principal methods: (a) Gold- 
stein’s (6) technique of presenting a subject 
with an array of objects and asking him to 
group them, and (b) Hull's (7) discrimination 

. learning technique. Hull's method seems to be 
the more sensitive of the two. We should guess 
that when the Eskimo steps from his igloo in 
the morning and is confronted by a snowy 
world, these snows will fall into named cate- 
gories for him in a way that they will not for 
the American. If, however, the American were 
subjected to a discrimination learning experi- 
ment, or if the perceptual structure were other- 
wise made worth his while, he could see snow 
as does the Eskimo. We think, really, that more 
namable categories are nearer the top of the 
cognitive “deck.” 

Structural features. Members of structural 
categories have no phonetic common denomi- 
nator. They are grouped together because they 
have the same structural relations with other 
forms in the language. In English, nouns con- 
stitute a structural category; its members can 
appear with definite and indefinite articles, can 
form plurals in certain ways, etc. In French all 
nouns of the feminine gender belong to one 
structural category since they all require the 
feminine articles and suffixes. 

Whorf generally assumes that structural 
categories are also symbolic categories. When 
he finds structural differences in languages he 
concludes that there are parallel cognitive 
differences. There are in Hopi two structural 
categories showing some similarity to our verb 
and noun categories, with thé difference that 
one of the Hopi classes includes only the names 
for such short-term events as lightning, flame, 
and spasm, while the ‘other includes only such 
long-term events as man, house, and lifetime. 
Whorf concludes that the Hopi organizes his 


Rocer W. Brown AND Eric H. LENNEBERG 


world on a dimension we usually overlook, 
When the structural class has such obvious 
semantic properties, Whorf’s conclusions have 
a kind of plausibility. 

However, very few structural classes have 
such clear and consistent meanings. In the lan- 
guages we know best, those of the Indo-Euro- 
pean family, there are many structural cate- 
gories with no discernible meaning. In French, 
for instance, it is not clear that the gender of a 
form signifies anything to a speaker. Certainly 
it is difficult to find any common attributes in 
the references for French nouns of feminine 
gender. Not even the majority of them mani- 
fest feminine sexuality—even in the extended 
sense of Freud. The French speak of “le bal- 
con” in spite of their saying, “Elle a du bal- 
con.” The linguist Charles Fries (5) has shown 
how difficult it is to describe a semantic for the 
English “parts of speech.” If the noun can be 
defined as “the name of a person, place, or 
thing,” this is only because “thing” is left un- 
explicated. It serves handily to designate 
whatever is nominalized and yet neither person 
nor place. 

Even where the ethnolinguist can discover 
consistent structural meanings, it does not 
follow that these meanings are present to the 
native speakers of a language. Suppose that a 
subject in the laboratory were required to sig- 
nal his recognition of each of ten different 
musical chords by raising that one of his ten 
fingers which has been designated for each | 
chord. If all extraneous sensory information ` 
were excluded, his ability to pattern correctly 
the movements of his fingers would be evidence 
of his ability to identify the chords. The ex- 
perimenter might introduce a potential struc- 
tural meaning by ruling that the fingers of the 
right hand would always be raised for chords 
in the major mode and the fingers of the left 
hand for minor chords. The subject’s responses 
might follow this pattern and yet he need never 
have detected the major and minor modes. 
Similarly, even if there were some semantic to 
French gender, one could speak the language 
without detecting it. “La fille” and “la femme” 
could be learned without noticing that both 
are in the feminine mode. No safe inferences 
about cognition can be made on the basis of 
the simple existence of the structural classes , 
described by Whorf. The structural evidence 1$ 
extremely difficult to interpret, and it seems 


A STUDY IN LANGUAGE AND COGNITION 


clear that psychological data are needed to 
supplement those of the linguist. 
Language in Causal Relation to Cognition 
The second major tenet of Whorf’s thesis is 
that language causes a particular cognitive 
structure. In what way can this occur? There 
seem to be two possibilities. Suppose that the 
colors red and green are not “given” categories 
but must be learned. A father who has formed 
these categories may play a game with his child 
that will teach the categories. The green blocks 
are to be used for building a house and the red 
ones for a barn. The child cannot properly 
pattern the blocks without learning to make 
the visual distinctions involved. Notice that 
the barn and house are not essential here. A 
father could ask his child to tell him whether 
each block is red or green. In learning this 
game, too, the child necessarily would learn to 
perceive the colors. Because words have sym- 
bolic properties, because their usage is pat- 
terned with reference to the total environment, 
language can cause a cognitive structure. To 
the degree that children are motivated to speak 
a language as it is spoken in their community 
` they are motivated to share the world view of 
that community. To be sure, linguistic training 
is not the only means of procuring cognitive 
socialization; the house-barn game demon- 
strates that. The word game has the tremen- 
dous advantage that it can be played con- 
stantly and concurrently with many other 
activities. The child and his adult tutor can 
chatter together whether they are walking or 
riding, playing or working. In this chatter more 
is taught than a simple motor skill involving 
the muscles of articulation. A total culture is 
internalized. 

There is a second, more dubious, avenue for 
the influence of language on thought. If life is 
a river, speech is a babbling brook whose course 
parallels that of the river. The brook is smaller 
and simpler than the river. A child can learn 
the phonemic structure of his language fairly 
easily. He will also realize that as the phonemic 
patterns he hears spoken change there are im- 
Portant changes in the nonlinguistic world. 
There is, for instance, an important difference 
that goes with the shift of speech from “father” 
to “mother.” When, on the other hand, com- 
binations of phonemes are repeated, two situa- 
tions are equivalent in some important way. 


457 


Consider the “‘strike” and the “ball” in base- 
ball. These are rather difficult categories. The 
differences between them are subtle and com- 
plex. A naive observer of a baseball game 
would have a difficult time learning these cate- 
gories by simply observing the game. It makes 
a great difference that the umpire calls out 
“strike!” each time a member of that category 
occurs and “ball!” to identify an instance of 
the other category. The umpire’s shout directs 
us to look here and now to discover something 
of importance. The word spotlights a moment 
of consciousness and puts it in connection with 
other events similarly spotlighted. The various 
“strikes” are equivalent in some way and dis- 
tinct as a category from the events labelled 
“ball.” The babbling brook can, then, be a 
guide to the structure of the more complex but 
also more interesting river. 

All of our reasoning cannot be said to prove 
the validity of any set of psychological conclu- 
sions. It does, however, point the direction for 
such a proof and suggests empirical steps that 
will advance our knowledge of this problem. 
We have made a small beginning in this work. 

Our findings bear on only one of the claims 
made by Whorf—that there are cognitive 
differences correlated with lexical differences. 
We have developed lexical differences into the 
variable of “codability” and attempted to 
spell out the relationship between this variable 
and a single cognitive performance—recogni- 
tion. 


Tue EXPERIMENT 


Sensory psychologists have described the 
world of color with a solid using three psycho- 
logical dimensions: hue, brightness, and satura- 
tion. The color solid is divisible into millions of 
just noticeable differences; Science of Color (14) 
estimates 7,500,000. The largest collection (4, 
11) of English color names runs to less than 
4,000 entries, and of these only about 8 occur 
very commonly (17). Evidently there is con- 
siderable categorization of colors. It seems 
likely to us that all human beings with normal 
vision will be able to make approximately the 
same set of discriminations. This ability ap- 
pears to depend on the visual system, which is 
standard equipment for the species. Whatever 
individual differences do'exist are probably not 
related to culture, linguistic or extralinguistic. 
It does not follow that people everywhere 


458 


either see or think of the color world in the 
same way. Cultural differences probably op- 
erate on the level of categorization rather than 
controlled laboratory discrimination. 

Our explorations in the Yale Cross-Cultural 
Index turned up many reports of differences 
on this level. Seroshevskii (16), for instance, 
has reported that in the Iakuti language there 
isa single word for both green and blue. This is 
the kind of language difference discussed in the 
first section of this paper. A region of experi- 
ence is lexically differentiated in one culture 
but undifferentiated in another. Color cate- 
gories differ from such categories as snows in 
that they have boundaries that can be plotted 
on known dimensions. Color categories, fur- 
thermore, are continuous with one another, 
sharing their boundaries. Consider for a mo- 
ment the single dimension of hue taken at a 
high level of saturation and brightness. Native 
speakers of English could be shown various 
shades and asked to give the usual color name 
for each stimulus presented. For each common 
color name there would be some shades in- 
variably called by that name. There would be 
other shades sometimes associated with one 
name, sometimes with another. When the re- 
sponses are divided about equally between two 
or more names, we should have boundaries 
between categories. If a native speaker of 
Jakuti were asked to provide the usual color 
names for the various shades, we should antici- 
pate a somewhat different pattern. English 
speakers would have trouble naming the hues 
in the boundary region between green and blue. 
Probably they would hesitate, disagree among 
themselves, and sometimes use phrases or such 
combination names as "greenish blue." For 
the Iakuti, on the other hand, this region is 
right in the center of a category and would be 
named with great ease. 

Of course, our example is greatly simplified 
over the actual case since we have dealt with 
the single dimension of hue whereas the color 
lexicon is actually patterned with respect to all 
of the three dimensions of visual experience. 
When these are considered, the range of ap- 
plicability of a color term is a space within the 
color solid rather than a distance along a line. 
The simplification was for expository purposes 
and does not alter the logic of the argument. 

This example of a cultural difference serves 
to introduce the variable codability. Certain 


Rocer W. Brown AND Eric H. LENNEBERG 


colors are differentially codable in the Iakuti 
and English languages. So long as the data col- 
lected are of the usual linguistic variety, this 
difference of codability will be manifest in only 
one way—environmental distinctions expressed 
lexically in one language are expressed with 
word combinations in another language. Our 
reasoning led us to expect differential avail- 
ability of reference categories in such a case. 
We undertook experimental work to discover 
additional behavioral indices of codability, and 
hoped to find one more sensitive than that 
which can be teased out of linguistic data. If 
we found such an index, we would go on to 
explore the behavioral consequences of differ- 
ential availability of cognitive categories. 

There are differences of codability within 
English itself. Some shades fall safely within 
the province of a given name while others lie 
within boundary regions. Here it is a matter 
of comparing the English codability of one 
region of visual experience with another region, 
whereas the ethnolinguist has usually com- 
pared the codability of one region of experience 
in several languages. If we explore the codabil- 
ity variable in English, it seems likely that our 
discoveries will apply to the cultural differences 
with which the inquiry began. If a general law 
can be found relating codability to availability, 
individual cultures may conform to the law 
though they differ among themselves in the 
values these variables assume in particular 
regions of experience. 


Measurement of Codability 


The entire series of Munsell colors for the highest 
level of saturation (chroma" as Munsell calls it) was 
mounted on cards in systematic fashion. Five judges 
were asked to pick out the best red, orange, yellow, 
green, blue, purple, pink, and brown from these 240 
colors. These names are the most frequently appearing 
color terms in English (17). For each name the color 
chip most often selected was added to our test list. 
Agreement among judges was high, and it is quite clear, 
therefore, that there is in this series one particular color 
chip with the best claim to each color name. The number 
of colors was then raised to 24 by adding chips that 
would, in combination with the first 8, provide as even 
a coverage of the color space as practicable. These 
colors are specified in Table 1. One set of the 24 chips 
was mounted on white 3 X 5 cards, one chip to a card. 
Another set was arranged randomly on a single large 
card. 

To expose the single small cards a drop shutter was 
mounted in a 3 X 2-foot gray (Munsell neutral value 6, 
reflectance 30 per cent) board. The board was about 
three feet from the subject’s (S’s) eyes and was 


A STUDY IN LANGUAGE AND COGNITION 


TABLE 1 
THE MUNSELL NOTATION AND Scores For Dis- 
CRIMINABILITY, CODABILITY, AND RECOGNITION 
FOR THE 24 Test COLORS 


RECOGNITION 
Discana- (Group C 
MuNsELL NABILITY CopasiLITY TABLE 3) 
NorATION* 
Score Rank Score RANK Score RANK 
2.5R 7/8 38 2 18 9.5 .875 8 
2.5R 5/10 21.5 6 7 18.5  .694 11 
SR 4/14 23 10.5 19 7.5 1.020 S 
T.SR 8/4 18 15 7 18.5  .236 18 
2.5YR 6/14 38 2 29 1.5 1.499 2 
SYR 3/4 24 9 26 3 972 7 
T.SYR 5/8 26 7.5 8 16 136 9 
2.5Y 7/10 12 19 3 24 486 13 
5Y 8/12 37 4 25 4 2.450 J: 
7.5Y 6/8 13 17 4 23 .250 17 
3GY 7.5/11.2 23 10.5 14 12 1.222 4 
T.5GY 3/4 9.5 23 14 12 0.000 23.5 
2.5G 5/8 18.5 14 23 6 986 6 
7.5G 8/4 17.5 16 19 7.5 .167 19 
SBG 3/6 45 24 12 15 MI 22 
10BG 6/6 21 12 7 18.5 458 14 
8.5B 3/6.8 38 2 13 14 0.000 23.5 
2.5PB 7/6 19 13 18 9.5 4436 16 
SPB 4/10 105 21 29 1.5 «695 10 
10PB 5/10 12 19 7 18.5 .125 20.5 
SP 8/4 12 19 14 12 7 12 
10P 3/10 10 22 24 5 44 15 
SRP 6/10 26 7.5 6 21.5 125 20.5 
BRP 3.4/12.1 31 5 6 21.5 1.464 3 


* For conversion to C.LE. Tristimulus values and Source C, 
C.LE. chromaticity coordinates see Nickerson, Tomaszewski, and 
Boyd (13). 


illuminated from above and behind by a General Elec- 
tric standard daylight fluorescent lamp. 

The Ss were 24 Harvard and Radcliffe students who 
Spoke English as a native language and had no particu- 
lar training in distinguishing colors. They were screened 
for color blindness with the standard Pseudo-Iso- 
Chromatic Plates. 

The Ss were first shown the 24-color random chart 
for about five minutes. After the chart was removed, 
they were told that each of the colors on the chart 
would appear individually in the tachistoscope and that 
S's task was to give the name of each as it appeared. 
"Name" was defined as the word or words one would 
ordinarily use to describe the color to a friend. The 
Ss were urged to be both quick and accurate. H 

The 24 colors were presented in a predetermined 
random order for each S. No order was repeated. Each 
color was exposed until S had named it. In our trial 
procedure we used a voice key and chronoscope to 
measure the reaction time. The scope was activated by 
the opening shutter of the tachistoscope and stopped 
by S's first vocalization. This method proved to be 
unsuitable since Ss would frequently burst out with 
something other than a color name, which, of course, 
Stopped the undiscriminating chronoscope. Conse- 
quently, we abandoned this technique and used the 
Stop watch. The watch was started as the experimenter 
(E) dropped the shutter and stopped at first mention of 
a color name. y 

The variable of codability was measured in five ways. 


459 


(a) The average length of naming response to each color 
was obtained by counting syllables. (b) The average 
length was also obtained by counting words. (c) The 
average reaction time for each color was obtained by 
ranking all of the reaction times of an individual S and 
taking the mean rank across Ss for every color. (d) The 
degree to which Ss agreed with one another in naming a 
color was assessed as follows: We counted the total 
number of different responses to a color (DR) and also 
the number of Ss who agreed on whatever response was 
most often given to a particular color (CR). The first 
value was subtracted from the second and a constant of 
20 added to keep the results positive (CR — DR + 20). 
Color 18, for example, was given the following eight 
different names; gray-blue, blue, light gray-blue, light 
blue, very pale blue, light blue-gray, pale blue, and 
powder blue. Of these, the single-word response “blue” 
occurred most often—six times. Color 18, then, scored 
6 — 8 + 20, or 18. (e) The degree to which Ss agreed 
with themselves from one time to another in naming a 
color was calculated as follows: Five Ss were recalled 
after a period of one month and subjected to a repeti- 
tion of the naming procedure. When an S gave identical 
responses to a color on the two occasions, we counted 
one agreement. We determined the number of agree- 
ments for each S and considered that to be unity. Each 
individual agreement was then given the appropriate 
fractional value, Suppose an S had eight agreements. 
If he agreed in his name for Color 11, he would add 3g 
to the score for that color. The agreement score is, then, 
the sum of the individual performances weighted for 
each individual's over-all tendency to agreement. 

In Table 2 the intercorrelations of scores on these 
fivemeasures appear. All correlations are inthe predicted 
direction and most of them are significant, with .355 the 
smallest. With a single iteration this matrix yielded a 
general factor which we call codability. No correlations 
over .113 remain after the extraction of this single fac- 
tor. Our fourth index, the degree of agreement between 
Ss, has by far the largest factor loading. It was selected 
as the measure of codability for the second phase of the 


TABLE 2 
CORRELATION MATRIX FOR FIVE INDICES 
Or CODABILITY 
————— 
MEASURE 1 2 3 4 5 


Number of syl- 
lables 


.368 
.486* 
.537* 


Reaction time 

Interpersonal 
agreement 

Intrapersonal 
agreement 


.864* 


.649* .773* 


k from second 
factoring 

Communality 
from first fac- 
toring 


589 .195 


403 .653 


+» 2.05. 


460 


experiment. The obtained codability values for thé 24 
colors are listed in Table 1. 


Codability and Recognition. 

Once the codability variable suggested by Whorf’s 
ethnolinguistic observations had been operationalized, 
it remained to relate this variable to some nonlinguistic 
behavior which might be considered an index of availa- 
bility. We selected the recognition of colors. 

From the 240 Munsell chips taken at highest satura- 
tion we selected out alternate chips, taking care to in- 
clude the 24 colors for which codability data had been 
collected. The resultant collection of 120 colors was 
systematically mounted on a white card. Hue varied 
along the vertical dimension of the card and brightness 
on the horizontal dimension. Since there were 20 steps of 
hue and only 6 of brightness, we divided the total colors 
jn half and mounted one half above the other so as to 
make a more manipulable display. 

New Ss were screened, as before, for color blindness 
and language background. The basic procedure was to 
expose simultaneously 4 of the 24 colors, remove them, 
and ask Ss to point to the colors just seen, on the large 
Chart of 120. Neither E nor S mentioned any color 
name during the session. The recognition score for a 
color was computed as follows: We determined the 
number of correct identifications made by each S and 
considered this number to be unity. Each individual 
correct identification was given the appropriate frac- 
tional value. Suppose for instance, that an S who cor- 
rectly identified a total of six colors recognized Color 24. 
"This recognition would have counted as 1$ on the total 
recognition score for that color. Another S for whom 
Color 24 was one of eight correctly identified colors 
would have contributed } to the score for Color 24. In 
other words, the recognition score for a color is the sum 
of the individual performances weighted for each S's 
over-all ability to recognize colors. The scores for the 
24 colors appear in Table 1. 

In trial runs, Ss were asked how they managed to 
retain the four colors in memory after they were re- 
moved from sight. Most Ss reported that they named 
the colors when they were exposed and "stored" the 
names. It seemed likely, therefore, that those colors 
that are quite unequivocally coded would be most 
likely to be recognized. When a color elicits a consider- 
able range of names, the chances of recovering the color 
from the name would be reduced. This expectation was 
fulfilled by a rank-order correlation of .415 between 
codability and recognition scores. 

There is, however, another variable that influenced 
recognition. The 120 colors used are not perceptually 
perfectly equidistant. The manufacture of equidistant 
color chips is technically difficult and expensive and, 
indeed, above a certain level of saturation, impossible. 
Since we were unable to control experimentally the 
variable ‘“‘discriminability,” we must ask whether or not 
our findings were due to a positive correlation between 
codability and discriminability. Could it be that our 
codable colors were so distant, perceptually, from their 
nearest neighbors that their superior recognizability 
was actually due to these’ better discrimination condi- 
tions? To obtain an answer to this question we deter- 

mined the true perceptual distance between each of the 
colors used from the Newhall, Nickerson, and Judd 


Rocer W. BROWN AND Eric H. LENNEBERG 


(12) charts. These charts convert every Munsell book 
notation into a renotation which is the specification of 
a true perceptual locus of each color within the Munsell 
coordinate system. The difference between two renota- 
tions expresses quantitatively the perceptual distance 
between the colors. 

For each of the 24 test colors we computed a discrimi- 
nability score which describes its distinctiveness from 
the colors surrounding it. The difference between two 
renotations yields three numbers, one for each dimen- 
sion. To make these numbers perceptually commen- 
surable (i.e., to reduce them to a common denominator), 
the Optical Society of America Subcommittee on the 
Spacing of the Munsell Colors suggests the values 3, 2; 
and 1 for hue, chroma, and value, respectively. Since 
every color has two neighbors on each of the three di- 
mensions, a total of six numbers will express, in a 
rough way, the discriminability of that color. The sum 
of these yields the unadjusted discriminability score. 
Adjustments of this score are necessary (a) because if 
a colorappears on the margin of our chart it has a lower 
chance of being recognized correctly and (b) because a 
color that has a very close neighbor on one side and dis- 
tant neighbors on three others might come out with a 
good discriminability score although the close contigu- 
ity on one side would hinder correct recognition con- 
siderably. Consequently, colors appearing on the 
margin of our chart had the constant 3 subtracted from 
their unadjusted discriminability score, and colors with 
a close neighbor had the constant 6 subtracted. 

Our scoring method is to a certain degree arbitrary, 
to be sure, but since the equation of perceptual distances 
on different visual dimensions is an unsolved problem, 
there seems to be no more objective method available. 
In addition, of course, all decisions were made without 
knowledge of recognition scores. 

Since we were unable to control discriminability 
experimentally, we controlled it statistically. The par- 
tial correlation between codability and recognition, 
with discriminability constant, is .438. Furthermore, 
the correlation between codability and discriminability 
is .074, which is not significant. Evidently the relation 
between codability and recognition is not a consequence 
of variations in discriminability. 

Since the reports of our early Ss indicated that colors 
were stored in linguistic code, it seemed likely that 
color codability would increase in importance as the 
storage factor was maximized in the recognition situa- 
tion. Discriminability, on the other hand, should remain 
at the same level of importance or possibly decline 
somewhat. If, for example, a single color were exposed, 
removed, and then identified with minimal delay, Ss 


TABLE 3 
RECOGNITION PROCEDURES 
Nuunzz or Coroxs 
ORIGINALLY LENGTH OF CONTENT OF 
Grouer N Exposep INTERVAL INTERVAL 

A 9 1 7 seconds 
B 9 4 7 seconds 
C 16 4 30 seconds 
Dy. 4 3 minutes Tasks 


Note.—Exposure time for all groups was 3 seconds. 


A SrUDY IN LANGUAGE AND COGNITION 


TABLE 4 
CORRELATIONS INVOLVING SCORES ON CopaBrtity (C), 
DiscrminaBitity (D), AND RecocniTion (R) 
WITH Four EXPERIMENTAL CONDITIONS FOR 
RECOGNITION 


C wits R, 


Grour CwiHR DwiteR D constant 
A 248 .540* .248 
B All -460* .426* 
c A15 .503* .438* 
D .487* .505* moose 
* p.05. 


might retain some direct memory of the color, perhaps 
as a visual image. In this situation discriminability 
would be a determinant of recognition but codability 
would not be. However, when the number of colors is 
increased and the interval prolonged and filled with 
activity, the importance of linguistic coding should 
increase. Table 3 describes the experimental variations 
we used. Groups A, B, C, and D are arranged in what 
we believed to be an order of increasingly difficult stor- 
age of colors. Group C is our major group, for which 
results have already been described. The tasks which 
filled the interval for Group D were simple but absorb- 
ing—the kind of thing often used in experiments on the 
Zeigarnik phenomenon. 

It can be seen from the data in Table 4 that the cor- 
relation between recognition and codability scores does 
increase as the importance of storage in the recognition 
task increases? The particular order obtained would 
occur by chance only once in 24 times. 

Table 4 also shows that discriminability is most 
closely related to recognition in Group A, for which the 
possibility of some direct memory of the color is maxi- 
mized. The importance of discriminability declines 
slightly but not significantly as the recognition is made 
more difficult. Our expectations with regard to both 
Codability and discriminability are generally con- 
firmed. 


In the first section of this paper we con- 
cluded our discussion of lexical differences be- 
tween languages with the prediction that a 
given set of cognitive categories will be more 
available to the speakers of a language that 
lexically codes these categories than to the 
Speakers of a language in which the categories 
are not represented in the lexicon. Lexical 
differences have been expanded into the varia- 
ble of codability, and category availability has 
been operationalized as a recognition score. 
We found that differences in the English coda- 
bility of colors are related to differences in the 
recognition of these colors. We expected these 


è Kurtz and Hovland (9) have shown that verbaliza- 
tion during observation of stimulus objects facilitates 
recognition under certain circumstances. 


461 


results to apply to the cross-cultural case, and 
some confirmation of this expectation is avail- 
able in the results of a study by Lenneberg and 
Roberts (10). This study of Zuni Indians used 
a field adaptation of our methods and appara- 
tus. The Zuni color lexicon codes the colors we 
call orange and yellow with a single term. 
Monolingual Zuni 5s in their recognition task 
frequently confused the orange and yellow 
colors in our stimulus set. Our English-speak- 
ing Ss never made this error. It is a distinction 
which is highly codable in English and highly 
uncodable in Zuni. Interestingly, bilingual 
Zunis who knew English fell between the 
monolingual Zuni and the native speaker of 
English in the frequency with which they made 
these errors. 

The Whorf thesis claims more than a simple 
relationship between language and cognition; 
language is held to be causally related to cog- 
nitive structure. Our correlational evidence 
does not, of course, establish the direction of 
causality. If we may be permitted a guess it is 
that in the history of a culture the peculiar 
features of the language and thought of a 
people probably develop together. 

In the history of an individual born into a 
linguistic community the story is quite differ- 
ent. The patterned responses are all about him. 
They exist before he has the cognitive struc- 
ture that will enable him to pattern his behav- 
ior in the approved fashion. Simple exposure 
to speech will not shape anyone’s mind, To the 
degree that the unacculturated individual is 
motivated to learn the language of a commu- 
nity, to the degree that he uses its structure as 
a guide to reality, language can assume a for- 
mative role. 


SUMMARY 


The Whorf thesis on the relationship be- 
tween language and thought is found to involve 
the following two propositions: (a) Different 
linguistic communities perceive and conceive 
reality in different ways. (b) The language. 
spoken in a community helps to shape the cog- 
nitive structure of the individuals speaking 
that language. The evidence for the first propo- 
sition derives from a comparison of the lexical 
and structural characteristics of various lan- 
guages. The linguistic comparisons alone do 
not establish the proposition. They need to be 
complemented with psychological data. The 


462 


second proposition is not directly supported 
by any data. However, it is clear that language 
can be described as a molder of thought since 
speech is a patterned response that is learned 
only when the governing cognitive patterns 
have been grasped. It is also possible that the 
lexical structure of the speech he hears 
guides the infant in categorizing his environ- 
ment, These matters require empirical explora- 
tion. 

An experiment is described which investi- 
gates a part of proposition a—the idea that 
lexical differences are indicative of cognitive 
differences. Whorf reports many cases in which 
a given range of experience is lexically differ- 
entiated in one language whereas the same 
discriminations can only be described with 
phrases in another language. Rather than com- 
pare members of different linguistic communi- 
ties, we chose to work with native speakers of 
English and to compare their linguistic coding 
of two regions of experience. Within the realm 
of color vision there are colors that can be 
named with a single word and others that 
require a phrase. This kind of linguistic differ- 
ence in the length of name (measured by 
words or syllables) was found to be correlated 
with the latency of the naming response and 
the reliability of the response from person to 
person within the linguistic community and 
from time to time in one person. A factor 
analysis of these measures yielded a single 
general factor—codability. The measure car- 
rying the largest factor loading was the relia- 
bility of naming response between individuals 
who speak the same language. This variable— 
the codability of a color—proved to be related 
to Ss' ability to recognize colors. Codability 
accounted for more variance in the recognition 
task as the task was delayed and complicated 
to increase the importance of the storage 
factor. Data obtained from the Zuni Indians 
show a similar relationship between codability 
and recognition. It is suggested that there may 
be general laws relating codability to cognitive 
processes. All cultures could conform to these 
laws although they differ among themselves 


Rocer W. Brown AND Eric H. LENNEBERG 


in the values the variables assume in particu- 
lar regions of experience. 


REFERENCES 


. Bünrzn, K. Sprachtheorie. Jena: G. Fischer, 1934. 

. CASSIRER, E. The philosophy of symbolic forms, Vol. 
1. Language. New Haven: Yale Univer. Press, 
1953. 

3. Doos, L. W. Social psychology. New York: Holt, 
1952. 

4. Evans, R. M. An introduction to color. New York: 
Wiley, 1948. 

5. Fries, C. C. The structure of English. New York: 
Harcourt, Brace, 1952. 

6. GOLDSTEIN, K. Language and language disturbances. 
New York: Grune & Stratton, 1948. 

7. Hur, C. L. Quantitative aspects of the evolution 
of concepts. Psychol. Monogr., 1920, 28, No. 1 
(Whole No. 123). 

8. Korzyssxt, A. The role of language in the percep- 
tual processes. In R. R. Blake & G. V. Ramsey 
(Eds.), Perception; an approach to personality. 
New York: Ronald, 1951. Pp. 170-205. 

9. Kurtz, K. H., & Hovrawp, C. I. The effect of 
verbalization during observation of stimulus ob- 
jects upon accuracy of recognition and recall. J. 
exp. Psychol., 1953, 45, 157-164. 

10. LENNEBERG, E. H., & Ronznrs, J. M. The deno- 
tata of color terms. Paper read at Linguistic 
Society of America, Bloomington, Indiana, Au- 
gust, 1953. 

11. Maerz, A., & PauL, M. R. A dictionary of color. 
New York: McGraw-Hill, 1930. 

12. Newuatt, S. M., Nickerson, D., & Jupp, D. B. 
Final report of the OSA subcommittee on the 
spacing of the Munsell colors. J. opt. Soc. Amer., 
1943, 33, 385-418. 

13. Nickerson, D., TOMASZEWSKI, J. J., & Bovp, T. F. 
Colorimetric specifications of Munsell repaints. 
J. opt. Soc. Amer., 1953, 43, 163-171. 

14. OPTICAL SOCIETY OF AMERICA, COMMITTEE ON 
Corormetry. The science of color. New York: 
Crowell, 1953. 

15. ORWELL, G. Nineteen eighty-four. New York: Har- 
court, Brace, 1949. 

16. SERosnEvsxir, V. R. Jakuli. St. Petersburg: Royal 
Geographical Society, 1896. 

17. THORNDIKE, E. L., & Lorce, I. The teacher's word 
book of 30,000 words. New York: Teachers Coll., 
Columbia Univer., 1944. 

18. Worr, B. L. Four articles on metalinguistics. 
Washington: Foreign Service Institute, 1950. 

19. Wonpt, W. Vilkerpsychologie. Vol. 1. Die Sprache. 
Leipzig: Engelmann, 1900. 

20. Zier, G. K. The psycho-biology of language. Boston: 

Houghton Mifflin, 1935. 


ne 


Received April 12, 1954. Prior publication. 


CRITIQUE AND NOTES 


SOME PSYCHODYNAMIC ASPECTS OF FELT PARENTAL 
ALLIANCE IN YOUNG CHILDREN! 


EDWARD M. BENNETT au» DOROTHEA E. JOHANNSEN 
Tufts College 


HE second card of the Children’s Ap- 

perception Test (1, 2) shows a small 

bear with a large bear pulling on a rope 
against another large bear. In a study of 58 
diabetic children between the ages of seven 
and nine, some children used this card to 
verbalize a conflict, friendly or otherwise, 
among the members of a family group. Thirty- 
seven children, 13 boys and 24 girls, specified 
that the child bear joined either the mother or 
the father in this conflict. Twenty-four of 
these 37 children indicated specifically whether 
the child's side won or lost. 

No attempt is made to generalize this pro- 
jective alliance beyond the possibility that it 
reflects some form of felt alliance on the part of 
the child. It is the relationship of this felt 
alliance to other factors that is presented here, 
as a research finding needing further, more 
direct, study for any real generalization. 

The findings reported in Table 1 reflect the 
statistically significant relationships disclosed 
when the children who allied with the parent 
of the opposite sex are compared with the 
children who allied with the parent of the same 
sex. All tests of significance are based on two 
by two chi square. 

For the 24 children who indicated whether 
the child’s side won or lost the conflict, Finding 
1 reflects the presence of a significant tendency 
for the opposite-sex allied children cognitively 
to win their symbolic feud, while the same-sex 
allies show more of a tendency to lose. 

Findings 2, 3, 4, and 5 in Table 1 are based 
on ratings or rankings in which the subject 
was classified as being high or low on the vari- 
able in question on the basis of his being above 
or below the median of the total 58 children in 
the major research sample. During the course 
of the summer, the children were observed by 
camp counselors and an in-field psychological 
assistant. Findings 2 and 3 refer to mean 


1 The research findings are the result of an investi- 
gation on the psychodynamic aspects of diabetes in 
children, sponsored by the Diabetes Camp, Home and 
Hospital Fund, and the Joslin Clinic, Boston. 


ratings, based on five-point scales, by counse- 
lors and the psychological assistant. Finding 4 
refers to the mean proportional ranking of the 
subject, when ranked weekly with the other 
members of the research sample, by the psy- 
chological assistant. Finding 5 is based on the 
rating of the psychological assistant alone, and 
is open to experimental error. 

Using the general wording of the rating and 
ranking schedules, Finding 2 indicates that 
opposite-sex allied children appear more con- 
forming to that which is expected of them, 
Finding 3 reflects the tendency for the op- 
posite-sex allied children to appear more willing 
to accept blame for doing something wrong. 
Finding 4 indicates that the opposite-sex allied 
children appear to rank as “passive and to give 
in easily with adults,” when compared to the 
same-sex allied children who rank as appearing 
more “aggressive and demanding with adults.” 
Finding 5 indicates greater physical activity 
on the part of the opposite-sex allied children. 

Although the data are presented more as a 
research curiosity than as definitive of any 
specific theory, one conclusion might tenta- 
tively be made. The profile of the opposite-sex 
allied children appears to be one of physical 
activity, personal blame-acceptance, con- 
formity, and passivity with adults. The same- 
sex allies appear physically inactive, blame- 
avoiding, nonconforming, and aggressive with 
adults. Perhaps it may be concluded that the 


TABLE 1 
RELATED Factors IN FELT PARENTAL ALLIANCE 
Cni 
FINDING SQUARE 


When compared to same sex allied children, 
the opposite-sex allied children: 

tend to win the conflict more 4 
appear more conforming 4.00 
appear more blame-accepting 4.00 
. appear more passive with adults 4.00 
. appear more physically active 3 


MeN N H 


Note.—All chi squares are significant at less than the 5 per cent 
level. For Finding 1, N = 24; for all other findings, N = 37. 


463 


464 


opposite-sex identifiers are a little more satis- 
factorily adjusted to the characteristic child- 
hood role. This hypothesis may be supported 
partially by the opposite-sex allied children's 
more frequent success in the symbolic familial 
conflict. 


Harrret B. Linton 


REFERENCES 


1. BELLAK, L., & BELLAK, SONYA S. Children’s A pper- 
ception Test. New York: C. P. A. Co., P. O. Box 42, 
Gracie Station, 1949. 

2. TAT NEWSLETTER, J. proj. Tech., 1951, 15, (4). 


Received May 27, 1953. 


AUTOKINETIC JUDGMENTS AS A MEASURE OF INFLUENCE! 


HARRIET B. LINTON 
Yale University 


INCE Sherif (5) made the valuable discovery 
that the instability of judgments of autoki- 
netic movement creates a useful experimental situa- 
tion for the study of social influence, this technique 
has been used in a large number of experiments. 
In many cases, change of autokinetic judgments 
under social influence has been used as a criterion 
for other kinds of behavior, such as test responses, 
which purport to measure suggestibility (3, 6). 
A survey of the literature, however, reveals not 
only that the results are often negative, which is 
to be expected, but also that different experiments 
dealing with similar problems often produce con- 
tradictory results. 

Tt seems likely that one reason for the equivocal 
results often obtained by using the autokinetic 
situation lies in the measures that are used. In 
most experiments a given degree of “influence” 
is applied to all Ss; e.g., a planted series of judg- 
ments averaging five inches above the S’s initial 
norm. The most frequent measure used is the 
mean number of inches (or other unit) that each 
S's judgments increase under the influence of the 
planted series, with this measure used in subse- 
quent manipulations of the data as a continuous 
distribution. The method proposed here differs in 
two respects: (a) a measure of statistical signifi- 
cance (the ¢ ratio) is used for each S rather than 
the actual magnitude of increase in judgments; 
and (b) the sample is dichotomized. 

1. In most experimental work, psychologists are 
aware that a large difference between two groups 
of measures does not in itself give reliable evidence 
of a real difference. The significance of a difference 
likewise depends on the variability of the measures, 
a fact that is taken into account by the statistical 
techniques for evaluating the significance of a dif- 
ference. In the autokinetic experiment, an increase 
of 1.5 inches from one series of 20 judgments to 
another cannot have the same interpretation for 


1 This study was done while the writer was a Public 
Health Research Fellow of the National Institute of 
Mental Health. 


an S whose judgments have a sigma of 5.0 inches 
in each series that it has for an S whose judgments 
have a sigma of 0.5 inches. The estimates of the 
latter have clearly shifted in range, while those 
of the former have not necessarily done so. 


A phenomenological basis for choosing the most valid 
measure can be sought in the statements of Ss about the 
nature of influence. In a series of experiments (2; 3 
[wherein the procedure is described]), 14 Ss who felt 
that they had been influenced by a confederate's re- 
sponses were asked what the increase in their judgments 
represented. Nine felt that the movement of the light 
had not increased, but stated that the confederate had 
influenced them to change their scale; four felt that the 
light moved more, but not as much as the increase in 
their judgments, and also reported a change in scale; 
only one reported an increase in movement without a 
change in scale. In other words, all but one of the Ss 
who felt that they were influenced spontaneously stated 
that the influence caused them to change the scale with 
which they estimated movement. This evidence sup- 
ports the use of a / ratio rather than the actual magni- 
tude of the increase, since it is only by using a measure 
of statistical significance that a change in the range of 
judgments can be identified. 


In the study reported, the criterion for influence 
was a t of at least 4.50, derived from a comparison 
of an initial norm based on 10 trials and the mean 
of a series of 40 "influence trials." The actual 
criterion used in any particular experiment natu- 
rally depends on the number of trials on which the 
measure is based. It should be noted, however, 
that statistical significance in itself is not an ade- 
quate criterion for change of scale. Many Ss who 
showed none of the other attributes which were 
found to be typical of Ss classified as "influenced" 
showed a statistically significant increase in judg- 
ments, The criterion should be high enough that 
it can be assumed to represent a genuine change 
in the scale used by the S. 

2. The practice of dichotomizing has both theo- 
retical and practical support. In the experiment 
cited above, Ss were first divided into three groups: 


AUTOKINETIC JUDGMENTS AS A MEASURE OF INFLUENCE 


those with ¢ > 4.50, those with ¢ < 4.50 and > 2.00 
(significant at 5% level), and those with ¢ «2.00 
or an actual decrease in judgments. On the basis 
of interviews with Ss about their reactions to the 
autokinetic situation, to other experimental meas- 
ures, and to personality tests, it was found that the 
two lower groups did not differ in any discernible 
respect, but that the first group was clearly differ- 
ent from the other two. It was then decided that 
the criterion of £ > 4.50 represented, under existing 
conditions, a change of scale, and that the varia- 
bility within the two resulting groups probably re- 
sulted from other causes. The practice of dichoto- 
mizing is further supported by evidence in other 
experiments which indicates that influence from 
another person tends to be reacted to in an all-or- 
none fashion, and hence that a bimodal rather 
than a normal distribution of responses results (4). 


There is still the question of why one-third of the 
present sample showed a statistically significant increase 
in their judgments, but did not show the other charac- 
teristics of Ss who were influenced. The explanation is 
probably to be found in changes in the autokinetic 
phenomenon itself. Graybiel and Clark (1) have found 
that over a long series of judgments Ss show consistent 
changes in their reports, some Ss increasing consistently, 
while others decrease. It is likely that the reliable in- 
creases shown by Ss in the middle group reflect the 
same phenomenon (a few Ss in the lowest group showed 
reliable decreases in judgments). 


3. When different measures of change of judg- 
ments, based on the same raw data, are used to 
compute correlations with other variables, they 
lead to different results. The change in autokinetic 
judgments was correlated with 14 dichotomous 
Rorschach-test variables that were considered to 
reflect aspects of personality relevant to suggesti- 
bility (cf. 3 [wherein eleven variables are de- 
scribed]). To test the usefulness of different tech- 
niques for measuring autokinetic change, the 
Rorschach variables were correlated with three 
autokinetic measures: (a) biserial 7 with mean 
number of inches each S increased his estimates 
(the measure used most often in other studies), 
(b) biserial r with each S's ¢ ratio, and (c) tetra- 
choric r with the sample dichotomized at ¢ = 
4.50. 

Using biserial r with mean increase in judgments 
produced only three correlations of .30 or more, all 
less than .50 (median = .13); biserial r with the ¢ 
ratio produced nine correlations of .30 or more, two 
of them higher than .50 (median — .37); tetrachoric 
r produced eleven correlations of .30 or more, seven of 
them higher than .50 (median = .51). Another way of 
comparing the three methods is in terms of the statisti- 
cal significance of the obtained correlations: of the bi- 
Serial r’s with mean increase, four were significant at 
the 10% level, three at the 5% level, and one at the 196 
level; of the biseral r's with the / ratio, nine the 


465 


10% level, eight reached the 5% level, and four were 
at the 1% level; of the tetrachoric r's, eleven were signifi- 
cant at the 10% level, nine at the 5% level, and five at 
the 1% level. It should be noted that the criteria for 
significance are more stringent for tetrachoric r than 
for biserial r. 


Comparison of the three methods for evaluating 
autokinetic change shows, then, that higher cor- 
relations are obtained by using the / ratio than by 
using the actual increase in judgments, and that 
these correlations become higher when the / ratio is 
dichotomized. The possibility that the results are 
an artifact of the particular variables on which the 
correlations are based is reduced by the fact that 
the decision to use the dichotomized / ratio was 
made on the basis of other experimental measures 
before the Rorschach data were obtained. 

Personality test measures, then, are highly re- 
lated to change of autokinetic judgments when a 
measure of significance is used and dichotomized, 
while the relationships become much lower when 
the magnitude of the actual increase in judgments 
js used. This suggests that, in the latter instance, 
potential correlations are obscured by extraneous 
variability, resulting from factors with little or 
no relationship to the personality factors being 
considered. It is suggested that the extraneous 
variability is a function of variations in actual 
autokinetic perception, while the process known 
as “suggestibility” affects the scale with which the 
S judges movement (i.e., modifies his conceptual 
and verbal behavior) without necessarily modifying 
his actual perception. 


CONCLUSION 


It seems likely that in experiments using the 
autokinetic phenomenon to study suggestion the 
data reflect two different factors: (a) the actual 
perception of autokinetic movement, „which is 
subject to variability, but is probably influenced 
little, if at all, by social suggestion, and (b) a 
judgment process, which can often be influenced 
by social suggestion. The type of measure which 
has been here suggested attempts to evaluate 
changes in the judgment process, and to reduce the 
effect of perceptual changes on the measure used. 
This seems to be a desirable goal in experiments 
that attempt to use the autokinetic situation to 
study social processes. It seems likely that many 

iments that have reported negative results 
could yield positive results if the data were ana- 


lyzed in this way. 


REFERENCES 


1. GRAYBIEL, A., & CLARK, B. The autokinetic illusion 
and its significance in night flying. J. Aviat. Med., 
1945, 16, 111-151. 

2. Linton, Harriet B. Relations between mode of per- 


466 


ception and the tendency to conform. Unpublished 
doctor's dissertation, Yale Univer., 1952. 

3. Linton, Harrier B. Rorschach correlates of re- 
sponse to suggestion. J. abnorm. soc. Psychol. 
1954, 49, 75-83. 

4. Mumrny, G. Personality, a biosocial approach to 
origins and structure. New York: Harper, 1947. 


ELSE FRENKEL-BRUNSWIK 


5. SHERI, M. A study of some social factors in percep- 
tion. Arch. Psychol., 1935, No. 187. 

6. SrEISEL, I. M. The Rorschach test and suggestibility, 
J. abnorm. soc. Psychol., 1952, 47, 607-614. 


Received May 14, 1953. 


SOCIAL RESEARCH AND THE PROBLEM OF VALUES: A REPLY 


ELSE FRENKEL-BRUNSWIK 
University of California 


N His paper, “How Neurotic is the Authori- 
tarian?" Masling (8) raises the objection that 
undue bias entered our description of the authori- 
tarian personality (1). Since I consider this problem 
to be very crucial and since the issue of undue in- 
fluence of the authors’ values in The Authoritarian 
Personality has been repeatedly raised, I accepted 
the suggestion of the editor of this journal and of 
Masling to discuss the latter’s paper. In doing so, 
I shall start with the more concrete issues that 
Masling raises and then proceed to a more theo- 
retical consideration of the problem involved. 

One of the major points in Masling’s paper is 
that the authoritarian personality is described in 
such “nasty” terms as submissiveness to authority, 
aggressiveness against the weak, superstition, 
stereotypy in thinking, conventionality, or cyni- 
cism. All these characterizations are said to pre- 
suppose maladjustment and pathological traits 
in the authoritarian personality, whereas, in 
reality, there is no evidence that the authoritarian 
is more neurotic than the equalitarian. Masling 
cites four independent studies that show no rela- 
tion between authoritarianism and mental health, 
a result which he assumes to contradict the major 
assumption of our book. 

In reply it must first be pointed out that there 
are explicit statements in the main body of our 
book to the effect that there is no direct relation- 
ship between authoritarianism and mental health. 
On the basis of my analysis of the interviews (1, 
ch. IX-XIID, we had even come to the conclusion 
that the use of projection mechanisms as mani- 
fested in prejudice helps rather than hampers the 
authoritarian in preserving his mental balance, 
especially in view of the fact that in this case de- 
fense mechanisms act through channels which are 
partially socialized (p. 480). The following quota- 
tion from the same context will complement this 
by showing that the opposite, equalitarian per- 
sonality type is by no means depicted as an ideal 
of mental health: 


While for the high scorer [high on explicit ethno- 
centrism] possible loss of energy is connected with his 
tendency toward rigid repression, the low scorer is 
apt to waste energy by indulging in often unfruitful 
introspection and by placing the blame for mishaps 
too much upon himself. In contrast to the high scorer’s 
tendency toward externalization, the typical low 
scorer is prone to internalize in an excessive manner, 
and this in turn may lead to open anxiety, feelings of 
guilt, and other neurotic features (p. 475). 


In discussing the greater readiness of the equali- 
tarian personality to face his conflicts openly, we 
stress that “temporarily, however, such frankness 
may well lead to increased anxieties and depres- 
sions.” 

Neurotic features are generally fairly common 
among low scorers on ethnocentrism, but they are 
especially frequent in a subtype that we have 
called the “rigid low.” To quote from our book: 
“Tt must be reiterated ... that there is a distinct 
type among low scorers in whom the liberal ide- 
ology becomes a cliché and who at the same time 
shows rigidity in his personality make-up" (p. 
481). This also is the subtype who is somewhat 
predisposed toward certain forms of totalitarianism 
even though these may be of a variety opposite 
to Nazism or other rightist ideologies. 


With [them] . . . the struggle for minority rights . = 
contained features of compulsiveness, even of paranoid 
obsession. .. With respect to many of our variables, 
especially rigidity and “total” thinking, [they] could 
hardly be distinguished from some of our high extremes. 
- . » The latter kind of low scorers are definitely disposed 
towards totalitarianism in their thinking; what is acci- 
dental up to a certain degree is the particular brand of 
ideological world formula that they chance to come into 
contact with (p. 772). 


Instead of a continuum we here have partial curvi- 
linearity, with some overlap between the rigid low 
scorer and the high scorer; curvilinear relations are 
generally frequent in personality studies and may 
be subsumed under the heading of “closeness of 
opposites” (4). 


Soctat RESEARCH AND THE PROBLEM OF VALUES: A REPLY 


The findings of Maria Levinson, partly based 
on the Minnesota Multiphasic Personality In- 
ventory, to the effect that there is no clear-cut 
relation between authoritarianism and mental 
health (1, ch. XXII) are thus not in disagreement 
with the rest of our book. It seems that our rather 
dynamically conceived syndromes cut across the 
classifications prevalent in psychiatry, which are 
also used in scoring the MMPI. These classifica- 
tions may be too undifferentiated, or rather too 
oblique, for our purposes, covering as they do a 
variety of characteristics which very likely are in- 
homogeneous as far as our specific syndromes and 
their psychological dynamics are concerned. 

In a more recent paper Gough (6) even found 
some relationships between authoritarianism and 
the clinical scales of the MMPI, such as those for 
psychopathic deviate or for schizophrenia. More 
marked relationships were found between au- 
thoritarianism and personality clusters based on 
an analysis of individual items. Gough concludes 
that authoritarians tend to be resentful, harrassed, 
embittered, bewildered, and to display certain 
paranoid traits. 

Let us now turn to the results of Knutson as 
reported by Masling. There is an absence of re- 
lationship between the score on Sanford's (9) 
authoritarianism scale, which is a modification 
of our F scale, and the score on a “Personality 
Security 'Scale." Such contradictions can usually 
be reconciled by reference to different levels of 
personality structure. We found in the intensive 
clinical interviews with authoritarians that while 
on the surface they often profess self-assurance, 
on further probing they turn out to be rather in- 
secure. In a previous study on adolescents (2, p. 
243), I was able to demonstrate that both the most 
Secure and the most insecure individuals tend to 
score high on items that constitute a rather direct 
inquiry into feelings of security. Extreme overt 
self-satisfaction, far from being a dependable sign 
of adjustment, often just indicates that repressive 
mechanisms are at work. The curvilinear relation- 
ship obtained between self-reported adjustment 
and objective adjustment as rated by clinicians 
(2) allows us to expect that authoritarian indi- 
viduals will tend to score either quite high, or 
else quite low, on a security scale, depending upon 
the directness of the items. One of our most sig- 
nificant findings with authoritarian individuals 
was that they tend to have an explicit image of 
themselves which is in contradiction to the one 
revealed by a more searching depth evaluation of 
the interviews and projective techniques. This is 
only one of the many inconsistencies in the au- 
thoritarian personality. Most of these incon- 
Sistencies are the result of an inherent discrepancy 
between surface and depth. This also holds for the 
principle of the closeness of opposites as it appears 


467 


in the coexistence of rigidity and extreme fluidity, 
of overcautiousness and a tendency toward short 
cuts to action, of submission to and resentment of 
authority, of emphasis on control and an uncon- 
trolled yielding to impulses. 

The qualifications that have just been made in 
connection with overt security scales also apply to 
the other structured, or semistructured, tests 
referred to by Masling. The elicited answers may 
have originated in the deeper layers; or they may 
have come fróm the surface layers, being largely 
determined by the compensatory defense mecha- 
nisms which are so typical of the authoritarian 
personality. Relationships would thus tend to be 
obscured. The clinical interview, with its deeper 
probing and the opportunity it gives the subject 
for spontaneous formulations of the issues in ques- 
tion, provides us with sufficient cues for differen- 
tiation between the surface layers and the deeper 
layers. Probings of this kind may be able to give 
the key to the fact that, in a study on Navy re- 
cruits reported by Masling, no relationship could 
be found between authoritarianism scores and 
hospitalization. Hospitalization is a behavioral 
end effect and thus a surface phenomenon that 
may have a variety of dynamic causes. In describ- 
ing the personality patterns of the hospitalized 
recruits, Masling points out that they generally 
tended to display immature behavior and emo- 
tional instability in forms frequently leading to 
maladjustment. We can well imagine that in the 
nonauthoritarian a given Navy or Army situation 
could induce manifestations of anxiety and in- 
stability along the line of open resentment against 
authority figures and against the overstructured 
situation; in the authoritarian, on the other hand, 
it may be mainly the situational change as such 
and the temporary loss of the clear-cut structure 
provided by the family which is the source of 
maladjustment. ^ 

It was primarily the above-noted complexity 
of the relationships between the underlying states 
and the overt behavioral manifestations that 
prompted me in another context (5) to advocate 
the use of a different vocabulary for underlying 
aggression, anxiety, and so forth, on the one hand, 
and manifest aggression, anxiety, and so forth, 
on the other. The use of the same term for both 
layers creates the illusion that we are dealing with 
the same set of facts; in reality, however, there are 
—at least—two sets. These vary independently to 
a certain degree, and the curvilinear character of 
the relationships referred to above further be- 
speaks the complexity of the patterns involved. 

Let us now turn to what is ostensibly Masling’s 
main objection. This is to a certain degree in- 
dependent of the problem of the relationship be- 
tween authoritarianism and mental health. It con- 
cerns the somewhat broader observation that 


. 468 


evaluative terms are used at all in our description 
of the authoritarian personality. Masling is cer- 
tainly right in taking for granted that such terms 
as prejudice, aggression, and rigidity carry some 
negative connotations. Statements containing 
these and related terms are by no means necessarily 
value judgments in the technical sense, however. 
The question arises here whether or not as a matter 
of principle terms with evaluative connotations 
can or should be avoided at all in the social sciences. 
Almost all trait names are evaluative, imbued as 
they are with functional versus nonfunctional or 
malfunctional connotations. It would be extremely 
awkward to list each time a series of sober opera- 
tional specifications instead of using the emo- 
tionally somewhat loaded terms of the vernacular 
or of ordinary personality psychology. And even 
so, instead of saying that authoritarians are rigid, 
we would have to say that they tend to cling to 
“the familiar and definite, that their approach 
stresses detail and is generally stimulus-bound, 
that they have difficulty in changing their attitude 
or set, that they are apt to search for absolute 
solutions, that most of their actions stem from the 
need to protect themselves rather than being free 
and spontaneous, and that their feelings and 
thoughts are highly channelized, making them 
relatively inaccessible to new experience; ob- 
viously this attempt to eliminate the general term 
by giving behavioral specifications would not rid 
us of evaluative involvement. 

The reason is obvious; there are richer and 
poorer, more efficient and less efficient orientations 
or ways of dealing with reality, and as scientists 
we are not entitled to obscure or to circumvent 
this fact. Yet it is important to remember that 
statements asserting the presence of such survival- 
relevant traits as rigidity, conventionality, super- 
stition, or punitiveness in a person or group can 
fully be translated into more descriptive state- 
ments about ways of dealing with reality, and that 
such statements do not constitute value judgments 
in the sense of an expression of preference on the 
part of the investigator. In keeping with this, such 
terms as "aggression against the weak"—to which 
Masling objects as evaluative—acquire a different 
flavor as soon as we remember that “aggression” 
designates a certain way of functioning in a de- 
scriptive manner or as a dispositional term, and 
thus is a legitimate part of the behavioral vo- 
cabulary, and that “weak” is defined by a certain 
objective locus in the social hierarchy. Similarly, 
“superstitious” designates nothing but the pres- 
ence of a belief in supernatural explanations of 
natural phenomena. The sting that is bound to 
remain in any reference to such attitudes is given 
by the fact that they are in general ill suited for 


ELSE FRENKEL-BRUNSWIK 


coping adequately with the complexities of social 
and cognitive problems. They tend to lead to con- 
sequences that are detrimental to society, to others, 
and often to their holder himself; but this tendency 
is empirically verifiable like any other factual 
trend. 

This situation is by no means unique to psy- 
chology. In chemistry such originally rather emo- 
tive terms as “noble” versus "base" have for long 
periods of time been used in the purely scientific 
characterization of certain properties of metals 
that are relevant to their own “survival,” such 
as their ability to resist corrosion. 

It must be granted, nonetheless, that as long 
as we allow ourselves to become enmeshed with 
adaptive versus maladaptive ways in the am- 
biguous sphere of emotions and social attitudes, 
there is danger that the values of the researcher 
will enter the field and that the facts will be over- 
interpreted in the direction of his prescientific 
value system. Awareness of this danger led me to 
undertake perceptual experiments with ethno- 
centric and nonethnocentric adolescents in which 
the reality, presented in the form of a story or a 
perceptual stimulus, was known beyond doubt, 
and thus the reality-adequate versus narrow or 
distortive handling of this reality could be ascer- 
tained in an objective, indisputable manner (4). 
The results of this study as well as those of sub- 
sequent independent investigations have con- 
firmed that the authoritarian personality tends to 
a more limited and narrow approach to reality, 
and at the same time distorts it to a greater de- 
gree than the equalitarian. 

As we have stressed again and again in our 
book, these nonoptimal forms of approaching 
reality are by no means to be taken for any gross 
maladjustment or mental deviation. In fact, as 
we have concluded on the basis of our interviews, 
the authoritarian may, and frequently does, func- 
tion extremely well in most routine contexts of 
life; he probably even betters the equalitarian 
under certain narrowly circumscribed conditions, 
such as when the culture provides socialization 
for the mechanisms involved. Nazi Germany is an 
excellent example of a people functioning well 
within a certain given framework. This remains 
true in spite of the fact that in due time the reality- 
inappropriate expectations of a final victory of the 
master race and the projective and thus distortive 
view of the democracies as effeminate societies re- 
sulted in self-destruction. Catastrophies like this 
have confirmed us in our assumption that the 
authoritarian syndrome possesses a reduced sur- 
vival value in the long run, adaptive as it may be 
for the short haul. 

With all this in mind, we are still far from deny- 


Socrat RESEARCH AND THE PROBLEM OF VALUES: A REPLY - 


ing that the authors of The Authoritarian Person- 
ality brought with them, as would most citizens 
in general and most social scientists in particular, 
a definite set of values concerning the very issues 
that were under investigation. In our Introduction 
we explicitly recognized that our own values differ 
considerably from those expressed by our au- 
thoritarian subjects. In holding such preconcep- 
tions, social scientists differ from other scientists 
only in degree, not in kind. Choice of the field of 
study, and of the specific problems of research 
within this field, seem to be issues the social 
scientist shares with all other scientists. These 
choices are largely based on prescientific value 
decisions; there can be little doubt that they are 
determined at least in part by social background, 
personality, and other factors in the biography 
of the individual. 

Ideally, the only value system that should be 
involved while engaging in scientific work is what 
Spranger has called “theoretical values.” More 
aptly, it may be characterized as the value system 
of scientific objectivity. However, as we witness 
even in the most abstract and logically rigorous 
theoretical systems in psychology—say, Hull’s 
postulational theory of learning—or in physics, 
science as an actual human undertaking has always 
been full of subjective preconceptions. These pre- 
conceptions are often of an esthetic or formalistic 
character, containing a strong element of bias or 
distortion. Those denying the existence of such 
biases in their own work in most cases turn out 
just not to have made them sufficiently explicit 
to themselves. In this respect, the social scientist 
is as much a child of his period with its varying 
climates of conceptualization as are his colleagues 
in other areas of psychology. Over and beyond 
this, however, there is probably no way for the 
social scientist to fully escape infiltration of his 
outlook by the sociopolitical climate of opinion 
in which he lives, no matter how hard he may try. 
An analysis of the subtle effects of the attitudes 
involved would probably be as difficult as an 
analysis of the fact that, say, psychologists in some 
countries have tended to become Gestalt psychol- 
ogists while in others they became behaviorists. 
It is one of the foremost tasks facing the historian 
of science and culture, if not the social scientist, 
to make explicit and to trace the influences of the 
biases inherent in declared or tacit allegiance to 
any of these so-called “points of view.” 

The only important question in our context is 
whether or not the preconceived social and cogni- 
tive points of view were able to interfere with 
scientific rigor, that is, with the objective treat- 
ment of the specific problem under investigation. 
So far as our study of authoritarianism is con- 


469 


cerned we must show that the personality relation- 
ships that were presented as empirically estab- 
lished are not merely the expression of wishfulness 
on our part. Several arguments may be brought 
forth to prove our original contentions. K 

First, we tried to make use of the most objective 
and quantitative techniques available in the social 
sciences today. In the evaluation of the interviews 
we tried to reduce bias to a minimum by the 
adoption of a "blind" scoring procedure and by 
employing a number of independent raters. A 
confirmation of the interview interpretations is 
given by the fact that the reactions on our F scale, 
purporting to measure in an indirect, projective 
way the personality traits associated with authori- 
tarianism, show that our authoritarian subjects 
tended to present themselves precisely as we ex- 
pected them to do according to our theory based 
on the more interpretative approach. 

Second, as mentioned above, the reactions of 
authoritarian children to such clearly defined 
stimuli as perceptual configurations, stories, or 
cognitive tasks show tendencies of rigidity, dis- 
tortion of reality, and intolerance of ambiguity 
similar to those found in the more controversial 
social and emotional area (4). 

Third, in a separate California study on preju- 
dice in children and adolescents (3), it was demon- 
strated even more directly that many of the 
tendencies that were found to be accompaniments 
of authoritarianism and ethnic prejudice at all 
age levels—such as superstition, intolerance of 
ambiguity, tendency to dichotomize, admiration: 
of strength—are at the same time more frequent 
in chronologically earlier stages of development. 
In other words, the direction of the difference be- 
tween prejudiced and unprejudiced subjects in 
general, with age held constant, is the same as 
that between the younger and the older child or 
adolescent. The assertion that the authoritarian 
personality is one of relative immaturity is meant 
in this strictly descriptive rather than in an evalua- 
tive sense. Y 

Fourth, one of the most decisive arguments 
may be derived from a comparison of the results 
on the same topic of social scientists holding dif- 
ferent prescientific preferences. From the purely 
argumentative point of view it is unfortunate that 
research concerning war and prejudice is in this 
country engaged in predominantly by those who 
do not like war or prejudice. Under this same as- 
pect we must welcome the fact that we have avail- 
able in the writings of Jaensch (7) a very articulate 
description of the ideal personality type from the 
Nazi point of view. While my own work on the 
perceptual reactions of authoritarian and non- 
authoritarian children was in many ways in- 


¥ 


470. 

d 
fluenced by the German typologists, among whom 
Jaensch had long been a prominent figure, his more 
specific writings on the Nazi ideal of personality 
did not reach me until most of the work had been 
done. Jaensch finds, as we do, the Nazi personality 
rigidly stimulus-bound. But unlike ourselves, 
Jaensch evaluates this rigidity as a positive trait 
and looks down upon the intellect in particular, 
and upon “adaptability” in general, as expressions 
of degeneration and morbidity. In this and many 
other ways a comparison between his and our 

dings reveals that investigators with opposite 
evaluative attitudes toward certain social phe- 
nomena may well reach the same conclusions con- 
cerning the factual aspects of the phenomena con- 
cerned, 

Situations such as this, along with logical con- 
siderations, led Stevenson (11) to suggest a clear- 
cut distinction between value judgment, on the 
one hand, and scientific or factual judgment, on 
the other. A similar point of view has been taken 
by most logical positivists. Ethical judgments, 
that is, statements containing such terms as “good,” 
“desirable,” “ought to,” etc., are considered to be 
emotive expressions in which preferences are stated. 
Stevenson is especially impressed by the fact that 
there can be ethical disagreement—“disagreement 
in attitude"—without a disagreement about facts, 
that is, without “disagreement in belief.” Thus, 
he points out that two persons can fully accept 
the same facts and still differ in their attitudes 
toward the facts. No other method of convincing 
the other person is available in such cases except 
the method of persuasion. Though often our atti- 
tudes are based on facts, neither facts nor scientific 
methods can be granted a definitive role in a 
“normative science” such as ethics. 

Another group of value theories may be sub- 
sumed under the naturalistic point of view in that 
statements about values are not considered to be 
different from empirical statements but assumed 
to be open to verification in the same manner as 
other scientific statements. According to this view, 
moral predicates can be restated in nonmoral 
predicates, such as that of "better" conduct as 
one of “greater complexity”; in general, restate- 
ment of moral judgments takes place in terms of 
consequences. Thus, the certainty of moral state- 
ments is seen as neither greater nor less than that 
of scientific statements. 

In criticizing the naturalistic point of view, an 
earlier representative of positivist ethics, Schlick, 
states: “A fundamental error lies at the basis of 
the whole attempt; it consists in seeking value 
distinctions in the objective facts themselves, 
without reference to the acts of preference and 
selection through which alone values come into 

the world” (10, p. 104). 


ELSE FRENKEL-BRUNSWIK 


The logical positivist point of view concerning 
values has been criticized mainly on the ground 
that this position sees in value statements purely 
arbitrary expressions of emotions and preferences. 
Suffice it to say here that one may counter this 
argument by pointing out that although ethical 
norms must be differentiated from facts, facts play 
a part in all value statements. Furthermore, indi- 
viduals and cultures do, on the whole, not differ 
widely with respect to what are considered the 
ultimate ethical goals, so that the arbitrariness in 
preferences is more a matter of means. Even 
Jaensch would doubtless agree with us on many 
values, such as the desirability of health—a factor 
which, by the way, he stresses a great deal. 

Although we follow Stevenson to some extent 
in his view of the subjective nature of some of our 
ultimate value judgments or attitudes, we find 
his distinction between factual and value state- 
ments in many respects overdrawn. To begin 
with, statements and agreements about facts are 
not as clear-cut and uncontroversial as meets the 
eye, due largely to the preconceptions which enter 
our view of the facts from the very beginning. 
Even in the most “objective” sciences, the so- 
called theoretical structure, although guided by 
experience, is in the main a product of our imagina- 
tion and is connected only loosely, by so-called 
interpretative links, with the observational data, 
thus allowing for alternative explanations of the 
same facts. Furthermore, in many cases an alleged 
value statement turns out to be fully translatable 
into statements about facts. In any event, value 
judgments are at least partially based upon fact, 
a circumstance which Stevenson also recognizes. 

For all these reasons, the knowledge accumu- 
lated in the social sciences may help us to make 
a choice between alternative value systems. In 
renouncing any metaphysical or absolute position, 
we do not need to go to the other extreme of utter 
relativism, Although the social scientist, as à 
scientist, cannot make the ultimate choice for 
mankind, his function is to throw as much light 
as possible on the implications involved in existing 
value systems, and to make explicit all the rami- 
fications inherent in the options. In our work on 
the authoritarian personality we tried to establish 
a network of empirical relationships between the 
holding of certain social attitudes, such as ethno- 
centrism, and a host of seemingly more neutral 
personality traits. Even though statements about 
personality traits almost always carry a certain 
evaluative reference as to the immediate or long- 
range adjustment of the individual or society, 
they are in themselves cognitive statements in that 
they refer to relations between empirical facts. 

It was our hope that our work would increase 
our theoretical understanding of both personality 


l 


ADMINISTRATION OF THE F SCALE TO A SAMPLE OF GERMANS 


and society, and that it would add to the factual 
basis for such value decisions as to whether or not 
we should combat authoritarianism, be it one of 
the right or one of the left. But only the direct 
designation of authoritarianism as desirable or 
undesirable would constitute a value judgment in 
the logical sense of the word. Only the direct ex- 
pression of such preferences, or the distortion of 
fact under the indirect influence of such judgment, 
would go beyond, or interfere with, scientific pro- 
cedure, I have tried to show that we have steered 
clear of either of these impasses. 


REFERENCES 


1. Aporno, T. W., FRENKEL-BRUNSWIK, ELSE, 
Levinson, D. J., & SaNronp, R. N. The authori- 
larian personality. New York: Harper, 1950. 

2. FRENKEL-BRUNSWIK, ErsE. Motivation and be- 
havior. Genet. psychol. Monogr., 1942, 26, 121- 
265. 

3. FRENKEL-BRUNSWIK, EtsE. A study of prejudice 
in children. Hum. Relat., 1948, 1, 295-306. 


Eh 471 
i ' 
4. FRENKEL-BRUNSWIK, Erse, Intolerance of am- 
biguity as an emotional and cognitive personal- 
ity variable. J. Pers., 1949, 18, 108-143. 
5. FRENKEL-BRUNSWIK, ELSE. Psychoanalysis and 
the unity of science. Proc. Amer. Acad. Arts Sci., 


1954, 80, No. 4, 271-350. fe 


6. Goucm, H. Studies of social intolerance: II: EN 
personality scale for anti-Semitism. J.. soc. 
Psychol, 1951, 83, 247-255. "OS: 


7. JazNscH, E. R. Der Gegentypus. Leipzig: Barth, . 


1938. 

8. MasumG, J. M. How neurotic is the authoritar- 
ian? J. abnorm. soc. Psychol., 1954, 49, 316-318. 

9. Sanrorp, F. H. Authoritarianism and leadership: 
A study of the follower’s orientation to authority. 
Philadelphia: Institute for Research in Human 
Relations, 1950. 

10. Scutick, M. Problems of ethics. (Trans. by D. 
Rynin.) New York: Prentice-Hall, 1939, 

11. Srevenson, C. L., Ethics and language. New Ha- 
ven: Yale Univer. Press, 1945. 


Received February 15, 1954. 


ADMINISTRATION OF THE F SCALE TO A SAMPLE OF GERMANS! 
THOMAS S. COHN anp HENRY CARSCH 


Wayne University 


The F scale has been administered to a wide 
variety of groups with the implication that differ- 
ences in mean scores reflected differences in deep- 
lying, potentially fascist personality trends es- 
tablished before the age of five. This report makes 
available data on the F scale obtained from a 
sample living in Germany. The scale was translated 
into German? and administered to 140 workers in 
a cosmetics factory in 1952. The mean F score was 
5.26, SD of .86. This score is higher than the mean 
score for any group thus far reported in the litera- 
ture (p > .01). Thus, the mean F score for San 
Quentin prisoners was 4.73, SD of .86; for working 
class men, 4.19, SD of 1.18; for professional women, 
3.43, SD of .86; etc. (1). If one makes the assump- 
tion that these workers are “‘fascistic” personalities, 
then the data appear to support the validity of the 
F scale. 


1 This study was carried out while the senior author 
was a member of the staff and the junior author a 
student at the University of California, Santa Barbara 
College. 

2 The authors would like to express their apprecia- 
tion to Dr. Rolf Linn of the University of California, 
Santa Barbara College, for his aid in translating the 
F scale into German. 


Yale University 


In studies using American Ss, F scores are 
negatively correlated with educational level (2, 3). 
This relationship holds for our German sample. 
The mean F score of the 117 Ss that had attended 
Volkschule or Volkschule and Mittelschule was 
5.40, SD of .86, while the mean F score of the 
23 Ss that had attended Hochschule was 4.57, 
SD of .85. The difference between means is sig- 
nificant (p < .01). This finding indicates the 
possibility that the F-scale score is partly a result 
of educational forces operating after the age of 
five. 


REFERENCES 


1. ADORNO, T. W., FRENKEL-BRUNSWIK, EISE, LEVIN- 
son, D. J., & Sanrorp, R. N. The authoritarian 
personality New York: Harper, 1950. 

2. Coun, T. S, Factors related to scores on the E scale. 
Unpublished Ph.D. thesis, Univer. of Michigan, 
1953. 

3. Lasky, J. J. Relations of experimentally induced 
changes in ethnocentric statements to underlying 
personality variables. Unpublished Ph.D. thesis, 
Univer. of Michigan, 1950. 


Received March 6, 1953. 


] 


NOTES ON THE WESLEY RIGIDITY SCALE: THE DEVELOPMENT OF A 
SHORT FORM! 


SEYMOUR L. ZELEN anp EUGENE E. LEVITT 
State University of Iowa 


In a recent article, Wesley (5) has reported the 
development of a paper-and-pencil scale for the 
measurement of rigidity. The scale consists of 50 
statements with which the subject indicates agree- 
ment or disagreement. A 41-item form of the scale 
„was also developed and has been used in two re- 
searches which cast serious doubt on the validity 
of the scale. Katz (2) found that it was related to 
only one of 11 measures of rigidity, the E scale. 
Moldawsky (3) devised an ingenious experiment in 
which a measure of rigidity was obtained from 
spontaneous behavior. This measure was not 
related to scores on the Wesley scale. 

This paper is not concerned directly with the 
validity of the Wesley scale but rather with certain 
aspects of the scale as a scale. Examination of the 
scale reveals what appears to be considerable 
overlapping of items, for example, “I am a me- 
thodical person in whatever I do," and “People 
who go about their work methodically are almost 
always the most successful,” and finally, “It is the 
slow, steady worker who usually accomplishes the 
most in the end,” There are nine sets of such 
potential identities, totaling 26 items. In addition, 
there are four sets, taking in 11 items, which seem 
to be absolute opposites, like “I prefer to stop and 
think before I act even on trifling matters," and 
“I have taken a good many courses on the spur of 
the moment." 

"There is no reason to believe that the addition 
of an identical item or an item which is identically 
opposite enhances the efficacy of a scale. A test 
composed of many such items can hardly be 
considered a true scale. Whether or not the items 
are actually overlapping is best determined by 
empirical investigation. 

From Wesley's original group of items (4), the 
authors selected 12 that did not seem to overlap 
and which they consider to be satisfactory for a 
rigidity measure.? 


1, I am often the last one to give up trying to do a 
thing. 


l'This investigation was supported by Research 
Grant MH-301 from the National Institute of Mental 
Health of the National Institutes of Health, U. S. 
Public Health Service. This grant supports the Pre- 
ventive Psychiatry Project which operates within the 
Child Welfare Research Station of the University of 
Towa. The research was conducted directly under the 
auspices of the Preventive Psychiatry Project. 

2 The numbers are the same as in Wesley's original 
form (4). 


2. There is usually only one best way to solve most 
problems. 

11. I dislike to change my plans in the midst of an 
undertaking. 

12. I never miss going to church. 

14. I would like a position which requires frequent 
changes from one kind of task to another. 

20. I do not enjoy having to adapt myself to new and 
unusual situations.* 

28. My interests tend to change quickly. 

40. I am always on the lookout for different ways of 
doing something. 

42. I always finish tasks I start, even if they are not 
very important. 

44. When I have undertaken a task, I find it difficult 
to set it aside, even for a short time. 

60. I like to surprise my friends by unexpected actions. 

62. I find it difficult to change my way of doing some- 
thing even though it may not be successful. 


The test-retest reliability of this short scale 
was .74 for an N of 31 undergraduates. It corre- 
lated .73 with the 41-item Wesley scale for 123 
subjects. In two groups totaling 87 subjects, the 
average correlation between the short scale and 
the California E scale was .38 with a standard 
error of .11. 

These data on the 12-item form compare very 
favorably with the 41-item Wesley scale. Re- 
liability of the 41-item form is not reported any- 
where by Wesley, but the authors computed 
Kuder-Richardson reliabilities for Wesley's data 
(4) and for those of Katz (2) and Moldawsky (3). 
The coefficients are, respectively, .68 for 284 sub- 
jects, .59 for 72 subjects, and .67 for 453 subjects. 
According to Froelich (1), the Kuder-Richardson 
should not underestimate the split-half reliability 
by more than .06. We may assume, then, that split- 
half coefficients for the 41-item scale would not 
exceed .74. Since the split-half is commonly con- 
Sidered to be a best estimate of the test-retest 
reliability, we may say that the 12-item form is as 
reliable as the 41-item or the 50-item forms. 

Katz (2) reported a correlation of .26 between 
the 41-item scale and the California E scale. This 
correlation is significant at about the .03 level. 
The comparable correlation obtained with the 
short form is .38, which is significant well below 
the .01 level. Apart from Wesley's (5) results with 


the Wisconsin Card Sorting Test (which are not: 


above criticism), the E scale is the sole poten i 
criterion to which the Wesley scale has been found 


3 This item is phrased positively in the original form. 


472 


| 


NorEs oN THE WEsLEY Ricmprry SCALE: THE DEVELOPMENT OF A SHORT Form 473 


to be related. In this respect, too, the 12-item test 
is at least as good as the longer form. 

The distribution of scores obtained with the 
short form is almost identical with those obtained 
with the long form. The coefficient of variation 
for the distribution of scores for the 123 subjects 
on the short form is 26 (a mean of 6.46 and a sigma. 
of 1.69). The V's derived from the data of Wesley, 
Katz, and Moldawsky are respectively 24, 26, 
and 28. 

It is quite true, of course, that the .73 relation- 
ship between the short and long forms is influenced 
by the presence of.the short-form items within the 
long form. This circumstance, however, is im- 
material for our purpose, which is merely to 
demonstrate that at the present stage of develop- 
ment, the short form may be substituted for the 
long form with savings of time and effort. The 
correlation between the two forms is as high as 
the reliability of either form, indicating that to all 
intents and purposes, the 12-item scale and the 41- 


item scale are comparable forms of the same 
measure. 


REFERENCES 


1. Frorticu, G. J. A simple index of test reliability. 
J. educ. Psychol., 1941, 32, 381-385. 

2. Katz, A. A study of the relationships among several 
measures of rigidity. Unpublished doctor's dis- 
sertation, Univer. of Iowa, 1952. 

3. MorpAWSsKY, S. An empirical validation of a rigidity 
scale against a criterion of rigidity in an inter- 
personal situation. Unpublished doctor's disserta- 
tion, Univer. of Iowa, 1951. 

4. WESLEY, ELIZABETH. Perseverative behavior in a 
concept formation task as a function of manifest 
anxiety and rigidity and of punishment. Un- 
published doctor's dissertation, Univer. of Iowa, 
1950. 

5. Westey, ELIZABETH. Perseverative behavior in a 
concept-formation task as a function of manifest 
anxiety and rigidity. J. abnorm. soc. Psychol., 
1953, 48, 129-134. 


Received February 4, 1954. Prior publication, 


REVIEWS 


THE THERAPEUTIC CoMMUNITY. A NEW TREAT- 
MENT METHOD IN PsycHIATRY. By Maxwell 
Jones in collaboration with A. Baker, T. 
Freeman, J. Merry, B. A. Pomryn, J. 
Sandler, and Joy Tuxford. Foreword by 
Goodwin Watson. New York: Basic Books, 
1953. Pp. xx 4- 186. Price $3.50. 

This monograph by a British social psychiatrist 
and his collaborators describes an attempt to de- 
velop a new method for the vocational and psycho- 
logical rehabilitation of those most difficult of all 
nonpsychotic psychiatric patients, the chronically 
unemployable neurotics, psychopaths, and charac- 
ter disorders. Sponsored by the Ministries of 
Health, Labour, and Pensions, an Industrial 
Neurosis Unit was established with a small staff 
of psychiatrists, psychologists, social workers, 
government “disablement resettlement” officers, 
occupational instructors, a research technician, 
and nurses. Patients with psychiatric disabilities 
who had particular employment problems were 
sent to this unit from around the country. The 
method of treatment that was developed is 
clearly presented in the text with reports on its 
psychiatric, psychological, vocational, and social 
aspects. Some interesting research designed to im- 
prove procedures, evaluate the effect of the method 
against certain controls, and analyze pertinent 
factors in the employability prognosis of the pa- 
tients is included, making for a well-rounded, 
thought-provoking contribution to the modern 
vocational rehabilitation literature. 

After several brief accounts of earlier units, the 
treatment method of this Industrial Neurosis Unit 
is described in detail. Shortly after receiving fairly 
intensive psychiatric and psychological evaluation, 
new patients are required to begin working regu- 
larly (four hours per day) at one of a number of 
semiskilled or unskilled jobs available in the hos- 
pital workshops or nearby industries. The aim is 
“to get them back to the habit of work.” This 
work is closely supervised by the staff and is 
extensively supplemented by morning group discus- 
sions, some individual psychotherapy, and con- 
siderable emphasis on and opportunity for social- 
ization, group communication, and hospital 
community responsibility. While the focus of con- 
cern is always vocational rehabilitation, the basic 
objective toward which the program is geared is 
that of providing for these asocial, embittered per- 
sonalities a rich and rewarding experience in group 
living, in belongingness, and in the satisfactions of 
community effort and responsibility. To accom- 

plish this objective, twd major approaches were 
involved: (a) an imaginative and varied use of 
group psychotherapy for the patients and (6) 


a marked reorientation of the attitudes and role 
of the hospital staff. The closeness of living and 
working in this benevolent but fairly structured 
environment provided extensive material for the 
morning group psychotherapy sessions. Didactic 
lectures, opportunities for group catharsis, some 
quasi-analytic group methods, as well as role- 
playing, psychodrama, and free-expressive meth- 
ods (drawing, for example) were all employed. The 
integration of these varied techniques of modern 
group dynamics seems especially skillfully handled 
and is well exemplified throughout, particularly 
in the rather dramatic excerpts presented in the 
appendix. 

Perhaps even more unique in this study is the 
extensive concentration on developing within the 
staff an atmosphere and general orientation con- 
ducive to helping the patients on their road to 
socialization. The self-searching consideration of 
the various professional roles, the use of enlight- 
ened group dynamics techniques within the staff 
conferences are in many ways the most daring and 
refreshing feature of the entire project. In fact, 
the many practical problems of changing functions 
and staff attitudes which are clearly presented 
should prove especially provocative to readers 
from the various disciplines engaged in the voca- 
tional rehabilitation of the psychiatric patient. 
Translation of these British methods to the re- 
habilitation work in our own state and veterans 
hospitals, even with the considerable differences in 
the extent of centralized national control, should 
prove relatively feasible. 

Regarded as a progress report on the first stages 
of a long-term effort, this monograph is indeed 
valuable. True, it is short on theory and the dis- 
cussion of group dynamics, and the therapeutic 
process is somewhat superficial. Nevertheless, the 
manner in which ongoing research is integrated 
with the therapeutic process as an essential feature 
of the total program is gratifying, whatever the 
limitations of the actual data. The practical prob- 
lems raised and the nature of the material have a 
social realism that extends far beyond the limited 
scope of many clinical investigations. Thus, à 
deeper note is struck when the caseworker doing 
the follow-up home visits comments on the sig- 
nificant lack of social cohesion she observed in 
many communities. Problems in marriage, em- 
ployment, and juvenile delinquency are increas- 
ingly being solved by resort to agencies and ex- 
ternal officials rather than by initial efforts within 
the community or even within the family structure 
itself. In the face of this broader weakness in the 
social scheme, the admittedly tentative results ob- 
tained by these investigators in their effort to 


474 


| 


REVIEWS 


integrate these chronically asocial patients as pro- 
' ductive members of the community are certainly 
encouraging. 
Jerome L. SINGER 
Franklin D. Roosevelt VA Hospital, 
Montrose, New York 


Tue Nature or NoNDIRECTIVE GROUP PsycHo- 
THERAPY. AN EXPERIMENTAL INVESTIGA- 
TION. By Leon Gorlow, Erasmus L. Hoch, 
and Earl F. Telschow. Foreword by Nich- 
olas Hobbs. New York: Bureau of Publica- 
tions, Teachers College, Columbia Uni- 
versity, 1952. Pp. viii + 143. Price $3.25. 

This study was designed to investigate with 
quantitative precision various parameters of the 
process of nondirective group psychotherapy. Ac- 
knowledging the tentative, exploratory nature of 
the investigation, the authors outline a threefold 
objective: (a) analysis of the uniformity of group 
patterns of verbal behavior as a function of 
variables like time, differing membership, and 
leadership; (b) delineation of the therapeutic value 
of the group, per se; and (c) evaluation of the role 
of the group leader. To achieve these goals each 
of the investigators led a nondirective psycho- 
therapy group consisting of graduate students 
who expressed a desire for help. There were 17 
subjects divided among the three groups. They 
met twice weekly for approximately ten weeks. 
Prior to therapy, subjects were evaluated for ad- 
justment by means of the Rorschach, modified 
Hildreth Self-rating checklist, and an incomplete 
sentences test. Following treatment the last two 
tests and the investigators’ judgments were em- 
ployed to divide clients into most- and least-profited 
groups. 

After an intensive evaluation of the reliability of 
ratings based upon recorded protocols, a chapter 
by Hoch provides data relevant to the hypothesis 
that there exists a characteristic nondirective group 
psychotherapy process. Thus, protocol ratings for 
the three groups reveal that expression of positive 
emotion increases with the passage of time, while 
negative attitudes reach their peak midway in the 
therapy cycle and subsequently decline. A sub- 
process is suggested by the finding that while the 
most-profited clients show a decline in “poorest” 
client behavior (confusion, defensiveness, negative 
attitudes towards self and others), the least-profited 
subjects persist in such behavior throughout the 
treatment cycle. Gorlow's section on the clients’ 
activity as therapists reveals that therapeutic be- 
havior denoting permissiveness and acceptance 
increases, while evaluation and criticism decline. 
Subjects rated from their Rorschachs as less 
anxious or hostile are categorized by therapist 
behavior involving more positive and less negative 
categories than that shown by more anxious group 


475 


members. Finally, Telschow's section provides 
data suggesting relative uniformity of leader be- 
havior with 98 per cent of leaders’ statements 
categorized as nondirective. Except for a trend 
suggesting that greatest client release is evoked by 
clarification of feeling, results in this section are 
largely negative. In all, 40 conclusions are derived 
from the data by the three investigators. 

This book is impressive by virtue of the ex- 
tensive analyses of a wealth of protocol material. 
Certainly the collaborators have demonstrated 
many ingenious methods for quantifying group 
therapy data and they have laid bare many param- 
eters of the treatment process that merit considera- 
tion. Nevertheless, even with the authors' fre- 
quently reiterated words of caution concerning the 
study's limitations, one cannot help but be dis- 
appointed with the book. The absence of any 
control groups raises a very serious question as 
to whether effects like the temporal sequences re- 
ported are peculiar to nondirective or even to any 
psychotherapeutic group. Can we be sure that a 
discussion group on mental health might not re- 
flect similar patterns over a period of 20 sessions? 
Similarly, the criteria for evaluation of the most- 
and least-profited groups are not made sufficiently 
explicit, and they appear to be inadequate. The 
self-ratings or sentence completion items may 
simply elicit the very same response that charac- 
terizes the self-attitude statements rated from the 
therapy protocols. This seems to imply somehow 
that people who begin to make positive statements 
about themselves and others after 20 sessions of 
group therapy have profited from the treatment. 
This is a rather naive or narrow view which could 
have been clarified by more explicit presentation 
of external criteria of personality change to avoid 
the possibility that what develops among some 
group members is merely a mutual self-admiration 
society. The small number of subjects imposes a 
serious limitation upon the statistical treatment. 
As a consequence many of the results are incon- 
clusive rather than clearly negative, and the in- 
vestigators several times are put in the position of 
accepting the null hypothesis with correlations of 
the order of .50 or p’s in the area of .10. The many 
trends suggested by the data deserve a more ade- 
quate test for conclusive acceptance or rejection of 
a number of important experimental hypotheses. 
Finally, although the book is clearly written, it is 
extremely abstract and dehumanized. The absence 
of actual protocols, frequent citations, or excerpts 
leaves one confused as to what actually goes on in 
the nondirective group psychotherapy session. 
This deficiency not only impairs the interest value 
of the book but denies the-reader an opportunity to 
evaluate for himself the relevance of various re- 
sponse categories and of the authors’ descriptions 
of the therapeutic process. In summary, then, this 


476 


book is successful in providing interesting hy- 
potheses and techniques for analyzing protocols. 
The limitations of its experimental design, how- 
ever, prevent an evaluation of either the unique 
characteristics or therapeutic value of nondirective 


group psychotherapy. 
JEROME L. SINGER 


Franklin D. Roosevelt VA Hospital, 
Montrose, New York 


THREE MEN: AN EXPERIMENT IN THE BIOGRAPHY 
or Emotion. By Jean Evans. New York: 
Alfred Knopf, 1954, Pp. xviii 4- 298. Price 
$3.75. 

The psychology teacher who would bring his 
students the raw materials of human living finds 
himself facing a singular dilemma. Scientific liter- 
ature is replete with case studies setting forth the 
salient facts about the lives of various people. In 
this journal alone, in the decade 1939-1949, no 
less than 25 such studies appeared. However, 
nearly all available case studies, when considered 
as teaching materials, are found to have the serious 
drawback of including the writers’ hypotheses and 
interpretations. This is no doubt desirable from 
the point of view of scientific communication. But 
what happens when you put such a document in 
the hands of a student? Usually he feels that there 
is little he can do with, or add to, the material. The 
author, he feels, knows more about the subject than 
he does, and has already set forth the most likely 
hypotheses concerning dynamic and causative 
factors in the subject's life and developed them 
more adequately than he, the student, could pos- 
sibly do. So he is often reduced to reading the study 
and wondering, in a frustrated sort of way, at the 
wisdom of the author; he feels also, perhaps, that 
he will never reach that point of wisdom in his 
own scientific development. 

What are needed here are the sheer, specific 
facts of a single life, unadorned by any interpreta- 
tion, explanation, or elaboration. True, there must 
be selection. No one tome would be large enough 
to contain all the facts of even the most ordinary 
life. But what the student needs is a selection of 
those facts that mirror the life of the subject, as 
he lived it, in his own world. Schools of law and 
of business administration have pioneered in 
adapting the “case method” of instruction to the 
social sciences, and in developing case materials 
free of interpretation and explanation. Donham 
at Harvard and Berrien at Colgate have been 
leaders in this effort. A further drawback, for 
which the authors can hardly be blamed, charac- 
terizes most of the case materials so far published. 
The scientific or professional psychologist is rarely, 

if ever, an artist and craftsman in the field of 

. letters. His presentation of the facts of a life story 

may achieve objectivity, but it seldom rises to that 


REVIEWS 


graceful and inspiring verisimilitude that dis- 
tinguishes the work of a really good professional 

Jean Evans is a professional journalist and 
author whose work combines in a truly unusual 
way the sensitivity and felicity of expression of the 
artist with a passionate devotion to the ideal of 
what Allport has called “portraiture.” ‘Walk 
through the slum section of any American city 
some evening. Pause at the poolrooms, the gyms, 
the dingy bars, the candy stores, and certain street 
corners where boys and young men gather. Any 
one of them might be a Johnny Rocco.” Thus be- 
gins the life story of “Johnny Rocco,” which first 
appeared in this journal in 1948. In 1950 the 
journal published Miss Evans’ “portrait” of 
“William Miller,” son of Danish immigrants to a 
small town in Pennsylvania. He had lost both 
parents by the time he was six months old. In the 
course of an unusual life, involving a variety of 
hardships and ranging over a wide geographical 
area, he lost and recovered his vision several times 
in ways and under circumstances that totally con- 
founded both his doctors and his friends. 

The unusual quality of Miss Evans’ writing 
lies in the happy blend she has achieved of the 
good qualities of both artist and scientist. Like 
the former, she immerses herself in the material 
more deeply and for a longer period than the im- 
patient scientist is generally willing to do. Long 
interviews with her subject alternate with patient 
study of all available documents and interviews 
with many different people who have known the 
subject. Unlike the scientist also, she seeks to 
avoid explanation, pointing to causes, and theories 
of personality. And, unlike the artist, she rigor- 
ously avoids the “creative” act of fashioning a 
character out of the many bits and pieces that 
may have come from as many different people. 
Instead she seeks to reflect into the mind’s eye 
and ear of the reader an accurate reproduction, 
a “portrait,” of what her subject actually looks 
like and how he sounds. She sets forth her picture, 
to a very considerable extent, in the very words the 
subject used in talking to her. 

Those of us who have used these cases in our 
teaching know how absorbing the students find 
them, how eagerly they become concerned with all 
kinds of questions that come to mind, how profuse 
and varied are the hypotheses they bring forth to 
explain, to predict, to understand, And these 
products of the students’ working with the ma- 
terial have the cardinal virtue of being their own, 
for Miss Evans offers none. Because she has satu- 
rated herself with the material, and because she 
brings to bear all her art and skill as a writer, she 
simply makes the person live in the perceptions 
of the reader. 

Now, if we wish to inculcate the disciplines of 


— m Mà 


| 


| 
| 


REVIEWS 


scientific caution and systematic verification, we 
find this material well adapted to such purposes. 
Once the hypotheses have been brought forth and 
noted down, we can go back to the document and 
test them against the facts. We may even invite 
the student to interrupt his reading at some point 
and undertake the hazardous venture of attempting 
a prediction or two. For example, most students 
who used the original Rocco material, which takes 
him to February 1947, predicted a far less well 
adjusted and socially productive Johnny than the 
follow-up material in this book, which brings him 
up to September 1952, shows him to have become. 

The first two “portraits” in this book are of 
men at least a significant portion of whose lives 
was lived under conditions of real poverty. In the 
case of the third man, however, this was not the 
case. “Martin Beardson’s” father was a prosperous 
business man, his mother an “upper class” woman 
of “genteel tradition.” Her people, Martin says, 
“were the leading people in town.” “Except for 
brief periods, and then only in recent years, Martin 
has never known deprivation in terms of food, 
clothing, and comfortable shelter. By realistic 
standards he also enjoyed many luxuries.... Yet 
Martin has always had a sense of poverty no less 
poignant than if he had been a slum child, and a 
sense of grievance no less biting than if he had 
lived all his life on the periphery of want” (p. 195). 
The student will ask “Why?” and the answer, 
I dare say, is not to be found solely in the fact that 
Martin's father and mother separated when he 
was but two years old. 

An ever-present problem of scientific investiga- 
tions that comes to mind again while reading these 
lives is that of the effect of the operation of ob- 
servation on the very process observed. Miss Evans 
writes, in a footnote to “Miller,” “It is to be borne 
in mind that the censure against which Miller 
constantly pleaded existed solely in his mind. 
Although I was genuinely interested and sym- 
pathetic, my primary relationship to my subjects 
is that of ‘historian,’ and not of counselor; in so 
far as humanly possible, my attitude was one of 
friendly neutrality” (p. 144). Even so, clinical 
experience has clearly demonstrated that the role 
the questioner assigns himself and the role in which 
he is perceived by the one he questions are fre- 
quently, if not always, different. And I found my- 
self wondering, especially while reading the follow- 
up of Rocco, what effects, in defining his own roles 
and goals, the many hours of association with Miss 
Evans may have had on Johnny. That Miss Evans 
was not unaware of this possibility seems clearly 
indicated in the passage quoted above. 

One wonders also why it is that Miss Evans 
herself, and I imagine many of her readers as well, 
comes to think of two of these men by their first 
names and one of them by his last. It is Johnny 
and Martin, but never William or Bill, just Miller. 


477 


This is a book that will be read with pleasure 
and pondered with profit by many scientists and 
professionals in clinical, social, and dynamic psy- 
chology, in sociology, psychotherapy, social work, 
and a number of other related fields. It will be of 
wide and lasting usefulness in classes and seminars 
in any field of study where the focus is on the lives 
of real people as they are actually lived in this 
world of ours, 


Brooklyn College Austin B. Woop 


Tue ACHIEVEMENT Motive. By David C. Mc- 
Clelland, John W. Atkinson, Russell A. Clark, 
& Edgar L. Lowell. New York: Appleton- 
Century-Crofts, 1953. Pp. xxii + 384. Price 
$6.00. 

"This book reports a swarm of little experiments, 
no one of which is more than a pilot study, but 
all of which are focused on one human need. The 
host of miniature demonstrations creates a fas- 
cinating pattern, from which the reader, like the 
authors, may learn "not only a lot about the 
achievement motive but other areas of personality 
as well." At thesame time, the reader keeps glimps- 
ing idea after idea for use in his own future re- 
search. He finds provocative inductive treatment 
of the nature of the achievement motive, its effects 
on behavior, especially school performance, and 
its origins in infancy and culture. All this “builds 
up the total picture out of many small experi- 
ments by a slow process of going from fact to 
hypothesis and back to fact again.” It makes a 
methodologically fascinating illustration of what 
can be done inductively to explore a new field when 
the explorers possess the proper wit. 

The experimental studies are preceded by a long 
theoretical chapter by McClelland on the nature 
of animal and human motives. In this chapter, 
as in the Introduction, one perceives that the 
author has come only a little way beyond the 
crude understanding of motivation provided by 
animal studies, in which the drives are always 
“primary” and arousal nearly always by depriva- 
tion. Even though at one point the writer ex- 
plicitly acknowledges that human secondary needs 
do not have to be in a state of deficit to be aroused 
and that an earlier study of the projection of need 
Food may not be the best preparation for the study 
of the projection of need Achievement, the im- 
portant differences between primary drives and 
secondary motives are never sufficiently explored. 
Even though McClelland acknowledges himself 
to be nearer to Hebb than to Hull, hestarts from 
the reductionist postulate that “a motive is the 
learned result of pairing cues with affect,” and 
so dooms himself to underestimate the sophistica- 
tion of his human experimental animals. One is 
a bit startled to read that the only difference be- 
tween his own motivation and that of a dog in heat 


478 


is that in him "behavior is controlled not so much 
by afferent stimulation but by anticipation." 
Drive-reduction postulates apparently do not lend 
themselves to anything like a realistic description 
of “the realm of value." It seems to this reviewer 
that both the theory propounded in McClelland's 
opening chapter and the measure that became the 
basis of all his experiments are therefore based on 
a dubious premise, 

The consequence for experimental design was 
that “we accepted the notion from experimental 
studies of animals that motives could be experi- 
mentally aroused and their intensity controlled 
by manipulating arousal conditions. Our problem, 
then, boiled down to the attempt to arouse and 
control the intensity of a human motive and to 
measure its effect on imagination or fantasy... .” 
The projective hypothesis, derived from clinical 
psychology, was thus by-passed. However, the 
author only says, allusively, that “in this way we 
would not be forced to make any a priori assump- 
tions as to what characteristics of fantasy indicate 
the presence and intensity of a certain motive.” 
Rather, as McClelland has remarked elsewhere, 
the TAT signs of strong need Achievement could 
be identified “by a kind of content analysis" of 
stories told by subjects under control conditions 
and under conditions of reputedly strong labora- 
tory arousal. The anomalous result is a 300-page 
book about the use of a modified Thematic Ap- 
perception Test in which the projective hypothesis 
is never discussed! 

The new, laboratory-produced measure is called 
“need Achievement,” we are told, “after Murray!” 
It bears little resemblance to that need as Murray 
has defined it. Nor do the authors seem to be aware 
of the confusion they have created. In their study 
of the infantile roots of need Achievement, for 
example, the experimental design tacitly assumes 
that their need Achievement, defined by the effect 
of verbally stimulating experimental subjects a 
few minutes before testing, and Murray’s need 
Achievement, defined by evidence of an enduring 
orientation in the person, are identical. 

Since, quite rightly, the authors anticipate that 
"the problems uncovered in the course of this 
research have been so numerous and complex 
that . . . their solution will require the cooperative 
effort of many scientists," one anticipates chaos! 
Many who will want to, or already have begun to, 
follow up on McClelland's leads will be accustomed 
to the more traditional use of the TAT. Worse, 
many who are only now becoming familiar with 
the TAT may uncritically apply McClelland's 
nonprojective measure when making clinical use 
of the projective test. But if confusion needs con- 
founding, let us envisage what will happen when, 
as McClelland et al. anticipate, their “practical 
method of measuring one of the most important 


REVIEWS 


human motives . . . can be applied to other motives 
with equal success!” 

Let us hope that before we have two of every 
human need, occasion may be found by some ex- 
perimenter to discover the empirical results of 
measuring the two need Achievements side by side. 


CHARLES C. McAnTHUR 
Department of Hygiene 
Harvard University 


Wxo SHALL Survive? FouNDATIONS or Soci- 
OMETRY, Group PSYCHOTHERAPY AND So- 
CIODRAMA. (Rev. Ed.) By J. L. Moreno. 
Beacon, New York: Beacon House, 1053. 
Price $10.00. 

Group Dynamics: RESEARCH AND THEORY. 
Edited by Dorwin Cartwright and Alvin 
Zander. Evanston, Ill.: Row, Peterson & Co., 
1953. Price $6.00. 

In the Foreword to his book, Personality: a Bio- 
social Approach to Origins and Structure, Gardner 
Murphy described the volume as somewhat like 
“an explorer’s kit, containing, to be sure, some 
standard tools, and also some maps. Some of the 
maps are sober, some eccentric, no doubt; but all 
are drawn in the belief that any map of a far 
country encourages more travel than an archi- 
tect’s finished representation of the doorway as 
it is here and now” (p. x). Imbued with this phi- 
losophy, the present reviewer welcomes the ap- 
pearance of the revised edition of Who Shall 
Survive? and of Group Dynamics. Each is an im- 
portant volume which has much to contribute to 
improving our understanding of the social nature 
of man and the nature of man’s social behavior. 
To those of us searching for a “world” view in the 
field of social psychology, the special projections 
of the cartography of these volumes must be 
properly identified before the maps can be re- 
drawn and integrated. Moreno offers us a com- 
plete theoretical globe. “Sociometry cuts through 
all social sciences as it deals with social phenomena 
at a deep level where they merge or more precisely 
before they ‘e’merge into ‘psychological,’ ‘socio- 
logical,’ ‘anthropological,’ or ‘economic’ phe- 
nomena” (p. 54). Cartwright and Zander, on the 
other hand, offer us half a dozen illustrations of 
the cartographer’s art without too clear an indica- 
tion of how they are interrelated, what the con- 
tiguous areas may be like, on what basis they were 
chosen—but each does depict a large and rich 
terrain, and the maps are beautiful. d 

Though these two books are very different 1n 
many ways, it seems desirable to consider them 
in a single review in order to contribute a counter, 
however feeble, to the schismogenic forces cut 
tently operative within the field. Because each 1s 
likely to appeal to somewhat different audiences, 
on temperamental grounds, it is well to reaffirm 


REVIEWS 


the need for cognitive awareness of their related- 
ness. Perhaps it would be well to note some of the 
differences that are likely to evoke differential 
conative responses—in the hope that forewarned 
is forearmed. Moreno’s writing includes a large 
measure of personal testament; Cartwright and 
Zander’s work is a collection of 35 papers to which 
the editors have added six integrative theoretical 
“introductions”; only one of the reprinted papers 
has an editor as author. -Cartwright and Zander 
are very modest in their offerings. Moreno rejects 
the desirability of modesty. Cartwright and Zander 
conform to the sobriety of the “scientific” style 
of writing; Moreno is, in much of the book, spon- 
taneous, personal, anecdotal. The Moreno volume 
is much longer than the Cartwright and Zander 
book; it contains over 200 more pages, stands a 
full inch wider on the shelf (though of the same 
height and depth), and is correspondingly heavier. 
Let us turn now, however, to a consideration of 
each volume by itself. 

Dr. Moreno's new version of Who Shall Survive? 
retains the same central research material that was 
presented in the 1934 original. However, the con- 
text is very different. The development of the 
sociometric movement during the intervening 
20 years is reviewed in “Preludes,” which includes 
also additional material on the pre-1933 back- 
ground. It is this section of about 100 pages that 
departs most from traditional scientific writing. 
However, this is Moreno's statement of the history 
of “his” approach. The pages are given Roman 
numerals, and even if unusually long, are com- 
pletely in the character of prefaces to which we 
are more accustomed. There are in the Moreno 
personal statement such items as characterization 
of the book as a bible (*"The Words of the Father,” 
the old testament; “Who Shall Survive?" the new, 
p. lxvii); an insistence on sole authorship of the 
book, emphasizing that “An idea book like WHO 
SHALL Survive? cannot be conceived ‘in collabora- 
tion'" (p. Ixxxvii) (yet in the 1934 edition, Helen 
H. Jennings is described as a collaborator); the 
description of an imaginary correspondence with 
H. A. Murray in which “he sends me letters which 
he never actually wrote and I send him letters, 
often in the middle of the night, which he receives 
promptly and without benefit of mail carriers" 
(p. xc); and an attack upon the integrity of some 
of the leaders of the current group dynamics move- 
ment (p. cii f£). However future historians may 
accept or "correct" the version Moreno gives, 
these Preludes are important documentation for 
a history of the social science, and valuable for 
a study of the process of writing history. Moreno 
recognizes that there is far more controversy about 
his person than there is about his theories. He con- 
cludes this presentation with the statement: “I 
have tried to explain why an individual of my 
type was particularly suited to produce them. I 
hope that I succeeded some and that everyone, 


479 


friends and enemies, will enjoy them, because they 
are autobiography ‘unadulterated.’ Now I fear 
only that many will read the Preludes and not the 
book" (p. cviii). Your reviewer shares these fears 
and has dwelt on this section of the book to make 
explicit what is likely to prove the major draw- 
back for many readers. Other readers, however, 
will appreciate the insights this kind of statement 
will give them into the nature of the originator of 
one of the most productive approaches currently 
available to students of social processes. 

The body of the book remains much as in the 
original. The first five parts, the sociometric 
system, evolution of groups, sociometry of a com- 
munity, construction and reconstruction of a com- 
munity, and sociometric planning of society, 
follow, as in the first edition, the report of original 
researches, with only minor changes in the dis- 
cussion, paragraphing, and headings. The final 
section, however, ^Who Shall Survive?" is greatly 
elaborated. Instead of the illustrative materials 
included in the 1934 version, we now have about 
100 pages of “The Sociometric System and Ad- 
vanced Sociometric Theory." In this section, in 
addition to summaries of further research data, 
about 30 pages are devoted to developing the 
statistics for sociometric measurement and re- 
search based on deviation of obtained from chance 
findings. Finally, starting on page 696, there are 20 
pages of "General Hypotheses and Recommenda- 
tions for Further Research." In some ways these 
hypotheses may be considered as introduction, 
in some ways as summary for the 1953 position 
of sociometry. 

It is interesting to note that the first edition 
contained only 41 items on a one-page bibliog- 
raphy, while the new volume claims over 1,300 
references in a 31-page bibliography. This single 
quantitative index may serve as a clue to the im- 
pact of Moreno's ideas during the 20-year period, 
and the difference between the original and the 
second edition of the book. 

The volume edited by Cartwright and Zander 
brings us to the first major effort to organize and 
make available the theory and findings, the 
methods and interpretations of those identified 
with the “group dynamics” movement. The editors 
felt that it would be premature to prepare a sys- 
tematic "text" inasmuch as there were still too 
many areas of thinking for which there was no 
empirical test yet, and too many significant studies 
that would not fit easily into a single theory. They 
have chosen to combine integrative interpreta- 
tion, in the form of introductions to each of the 
six sections of the book, with the reproduction of 
the original research papers that have contributed 
most significantly to their formulations. It is, 
therefore, both text and readings. The introduc- 
tions provide general perspective on the selected 
study areas, overviews and integrations of the 


480 
readings included, and questions for further re- 
search. 


The book is divided into six parts: approaches 

to the study of groups, group cohesiveness, group 
pressures and group standards, group goals and 
group locomotion, the structural properties of 
groups, and leadership. Each part stands by itself, 
with introduction and from three to ten readings. 
"Though the introductions establish the theoretical 
orientation toward the group dynamics approach 
(and of this approach to various problems), there 
is no clear continuity from one part to another. 
Thus, part five, on the structural properties of 
groups, does not refer to leadership, and part six, 
on leadership, seems independent of the discussion 
of. the structural properties of groups. 
- The 35 readings range in time of original ap- 
pearance from one by French which appeared in 
1941 through one specially prepared by Kahn 
and Katz for this volume to two still to appear in 
other vehicles (but already committed). Thirty 
may be identified as having appeared in the 1950's. 
Ten of the selections are from Human Relations, 
eight from the Journal of Abnormal and Social 
Psychology, six are selections from books, three 
from the American Sociological Review, and the 
remainder single borrowings from various other 
sources. 

The editors describe their criteria for the selec- 
tion of readings thus: “With few exceptions, we 
have included only reports that present empirical 
findings based upon quantitative research, and 
when several related articles were available, we 
attempted to choose the one which best sum- 
marized the total set of research. Some of the 
classics have, accordingly, been left out because 
more recent publications summarize them as well 
as provide additional findings” (p. xi), This state- 
ment of intention is well borne out by the contents. 
They well reflect the vitality of the research pro- 
grams that may be identified with “group dy- 
namics,” 

The editors also suggest that “a reader who de- 
sires a brief overview of the field of group dynamics 
will discover that a consecutive reading of the 
introductory chapters is useful for this purpose” 
(p. xi). This is probably true, but the present re- 
viewer feels that within this framework more could 
have been done. The first part of the book is pri- 
marily theoretical and tries to describe some of the 
major varieties of approaches to the study of 
groups. It includes selections by Cattell, Bales, 
Stogdill, Scheidlinger, and Jennings. By no means 
would all of these be identified with what is usually 


Reviews 


considered "group dynamics" research, nor are 
they a complete review of the approaches cur- 
rently employed. On a sampling basis, they may 
serve some purpose in indicating the variety of | 
approaches being made to similar problems, Yet 
in the Preface, the editors acknowledge that they 
had to omit work that is more clearly within the 4 
framework of the group dynamicists, for example, 
the work of Herbert Thelen, William F. Whyte, 
and others. To the extent that one is interested in 
the variety of research approaches to these prob- 
lems, some of the nonlaboratory approaches ought - 
to have been included. To the extent that one is 
really concerned with an overview of group dy- 
namics, some of the stimulating special emphases 
and new directions must be added to the reading list. 

Nevertheless, this book does serve a very useful 
function in assembling much of the major work 
in the major areas in which the study of group 
dynamics has progressed, and in providing the 
six general theoretical statements and integrations, — 
with the suggestions for further work. 

Developments within the field of social psy- 
chology have been rapid. Theory, research, and 
application have all developed apace. It is in- 
creasingly difficult to maintain a general perspec- 
tive and to "keep up" with the current output. — 
The two books here discussed illustrate the diffi- 
culties, Each represents a “line” one may pursue | 
with great profit, yet the overlap in terminology, 
Tesearch methods, reference groups, and training, 
and even the channels of communication used, is — 
relatively slight. And there are other lines that are .. 
nowhere represented. What of the work of Trigant 
Burrow and the development of phyloanalysis? 
What of the promising trends in group psycho- 
therapy outside of the sociometric pattern? What | 
of the continued developments in attitude, opinion, 
and communication research? These are but a few 
of the major trends that are readily identified. 
The alert specialist must go farther. What new 
insights and points of departure does the trans- 
actional approach offer for the study of social be- 
havior? As each line of promise becomes more 
fully worked out, in the light of the reasonableness 
of the approach and the significance of the findings, 
it becomes increasingly difficult to establish the 
relationships between figure and ground in the 
Perception of the general field and to guide such 
organization rather than to let the stimulus and 
autochthonous factors prevail. 


EuGENE L. HARTLEY 
The City College, New York 


NEW BOOKS RECEIVED 


ALLPORT, G. W. The nature of prejudice. Cam- 
bridge, Mass.: Addison-Wesley Publishing 
Co., 1954. Pp. xviii + 537. Price $5.50. 

Bacu, G. R. Intensive group psychotherapy. New 
York: Ronald, 1954, Pp. xi + 446. Price 
$6.00. 

Barron, M. L. The juvenile in delinquent society, 

' New York: Knopf, 1954. Pp. xix + 349 + 
vii. Price $5.00. 

Baucuanp, P. The child audience: a report on press, 
Jilm and radio for children. Paris: Unesco, 
1952. Pp. 198. Price $2.00. 

BzRGLER, E., & Krocer, W. S. Kinsey’s myth of 
Jemale sexuality. New York: Grune & Stratton, 
1954. Pp. viii + 200. Price $3.75. 

BrrrELHEIM, B. Symbolic wounds. Glencoe, Ill.: 
Free Press, 1954. Pp. 286. Price $4.75. 

Brack, P. The mystique of modern monarchy. Lon- 
don: C. A. Watts & Co., 1953. Pp. xii + 92. 
Price 8s 6d. 

Brackmumsr, J. H. Body-mind and creativity. 
New York: Philosophical Library, 1954. Pp. 
186. Price $3.00. 

BLAKE, FLORENCE G. The child, his parents and 
the nurse. Philadelphia: Lippincott, 1954. 
Pp. xvii + 440. Price $5.00. 

Burrow, T. Science and man’s behavior. New York: 
Philosophical Library, 1953. Pp. xii + 564. 
Price $6.00. 

Curistiz, R., & Jamopa, Marre. (Eds.) Studies 
in the scope and method of “The authoritarian 
personality.” Glencoe, Ill.: Free Press, 1954. 
Pp. 279. Price $4.50. 

Drezen, L. Neurosis, crime and progress. Mesnil, 
France: Typographie Firmin-Didot & Cie, 
1953. Pp. 88. No price supplied. 

Drezen, L. Shakespeare, Marc Antony and mass 
persuasion. Mesnil, France: Typographie 
Firmin-Didot & Cie, 1953. Pp. vii + 67. 
No price supplied. 

Durxuerm, E. Sociology and philosophy. (Trans. 
by D. F. Pocock.) Glencoe, Ill.: Free Press, 
1953. Pp. xli + 97. Price $2.50. 

Eus, A. The’ American sexual tragedy. New York: 
Twayne Publishers, 1954. Pp. 288. Price 
$4.50. 

Evans, JEAN. Three men. An experiment in the 
biography of emotion. New York: Knopf, 1954. 
Pp. xviii + 297. Price $3.75. 

Eysenck, H. J. Uses and abuses of psychology. 
Baltimore: Penguin Books, 1953. Pp. 318. 
Price $ .65. 

FRANE, L. K. Feelings and emotions. New York: 
Doubleday, 1954. Pp. vi + 38. Price $ .85. 

FRENCH, T. M. The integration of behavior. Vol. 
II: The integrative process in dreams. Chicago: 


ut a ig 
Univer. of Chicago Press, 1954. Pp. xi + 367. 
Price $6.50. , à 

Frevp, S. On aphasia. (Trans. by E. Stengel.) 
New York: International Universities Press, 
1953. Pp. xv + 105. Price $3.00. | 

FRIEDENBERG, E. Z., & RorH, J. A. Self-perception 
in the university: a study of successful and un- 
successful graduate students. Chicago: Univer. 
of Chicago Press, 1953. Pp. vii + 102, Price 
$1.75. 

Gannon, T. Psychology: the unity of human be- 
havior. Boston: Ginn & Co., 1954. Pp. xii + 
482. Price $4.75. * 

Ganz, MADELAINE. The psychology of Alfred Adler’ 
and the development of the child. New York: 
Humanities Press, 1953. Pp. xxiii + 203. 
Price $4.50. " 

GARDNER, G. E. (Ed.) Case studies in childhood 
emolional disabilities. Vol. I. New York: 
Amer. Orthopsychiat. Ass., 1953. Pp. vii 4- 
368. Price $5.00. d 

Gruman, R. Driven. New York: David McKay 
Co., 1954. Pp. 307. Price $3.50. i 

Gum, J. (Ed.) For a science of social man. New 
York: Macmillan, 1954. Pp. vii + 289, Price 
$4.00. 

GrapwiN, T., & Sarason, S. B. Truk: man in 
paradise. New York: Wenner-Gren Founda- 
tion for Anthropological Research, 1954. Pp. 
655. No price supplied. 

GovrpNER, A. W. Patterns of industrial bureau- 
cracy. Glencoe, Ill.: Free Press, 1954. Pp. 282. 
Price $4.50. 

Gray, J. S. Psychology applied to human affairs. 
(2nd Ed.) New York: McGraw-Hill, 1954. 
Pp. vii + 581. Price $6.00. 

Harbino, D. W. Social psychology and individual 
values. New York: Longmans, Green, 1953. 
Pp. vii + 184. Price $2.40. 

Incuay, K. V., & Love, LeoNonE R. The process 
of psychotherapy. New York: McGraw-Hill, 
1954. Pp. ix + 270. Price $5.00, 

Kamar, A. H. The ethics of civilization. Washing- 
ton: Public Affairs Press, 1954, Pp. vi + 80. 
Price $2.00. 

Karpman, B. The sexual offender and his offenses. 
New York: Julian Press, 1954. Pp. xiii + 744. 
Price $10.00. jaye 

LANDER, B. Towards an understanding of juvenile 
delinquency. New York: Columbia Univer. 
Press, 1954. Pp. xv + 143. Price $3.00. 

Lo, L. M. Measuring group cohesiveness, Ann 
Arbor: Univer. of Michigan Press, 1954. 
Pp. ix + 111. Price $2.00. ; 

Lrovp, Frances. Educating the sub-normal child. 
New York: Philosophical Library, 1954. Pp. 
vii + 148. Price $3.75. 


* 481 


.' State Coll id 1 
| $5.00. + 
dong H. G: The 


NT pom Yale Univer. Press, 

43. Hs ie qom 
Mian, Mar And lo; the star. New 
ww deh 1953. Pp. 116. Price 
cron, L. S. Mental health in the home. New 
ork: Bookman Associates, 1953. Pp. 243. 

Price $3.50. 

Moroney, J, C. Understanding the Japanese mind. 


New York; Philosophical Library, 1954. Pp. 
* xviii + 252. Price $3.50. 


Puy of y of Shakepeare 
, 1953. Pp. 


^. Nur, J. Táche réussite ei, tchec: thtorie de la 


conduit humaine. Antwerp, Belgium: N. V. 
, Standaard-Boekhandel, 1953. Pp. ix + 530. 
"No price supplied, — 

BM R. B. Realms of value. Cambridge, Mass.: 

Harvard Univer. Press, 1954. Pp. xii 4- 497. 

Price $7.50. 

Rose, A. M. Theory and method in the social 
' ^. sciences. Minneapolis: Univer. of Minnesota 
Press, 1954. Pp. xii + 351. Price $5.00. 
Roruney, J. W. M. The high school student: a book 
of cases. New York: Dryden Press, 1953. Pp. 

xv + 271. Price $1.90. 

SAENGER, G. The social psychology of prejudice. 
New York: Harper, 1953. Pp. xv + 304. 
Price $4.00. 

SaPPENFIELD, B. R. Personality dynamics: an 
integrative psychology of adjustment. New 
York: Knopf, 1954. Pp. xiv + 412 + xvi. 
Price $5.50. 

Scuitper, P. Medical psychology. (Trans. and 
ed. by D. Rapaport.) New York: International 
Universities Press, 1953. Pp. 428. Price $7.50. 

SuaPmo, H. L. Race mixture. Paris: Unesco, 
1954. Pp. 56. No price supplied. 


c 


lality. New York: eal 

Pp. 248. Price $3.00. 

Know your doctor. Boston: Little, 

Brown? 1953, Pp. 173. Price $3.00. : 

SrELLMAN, H. H. Successful management of matri- 
monial cases. New York: Prentice-Hall, e 
Pp. xiv + 306. Price $5.65. ® 

Srorz, Lors M., AND COLLABORATORS. Father " 
lions of war-born children. Stanford: Stanford, 
Univer. Press, 1954. Pp. viii + 365. Price 
$4.00, paper. : 

Srorr, D. H. Saving children from delinquency, 
New York: Philosophical Library, na gei 
x + 266. Price $4.75. 

TAYLOR, W. S. Dynamic and abnormal e 
New York: American Book Co., 1954. Pp. 
xiv 4- 658. Price $5.50. ; 

VEDDER, C. B. The juvenile offender. New York: 
Doubleday, 1954. Pp. xii + 510. Price $6.00. 

WEINBERG, S. K. Society and personality disorders. ' 
New York: Prentice-Hall, 1952. Pp. viii + 
536. Price $7.65. 

Wirxiy, H. A., et al. Personality through perception. | 
New York: Harper, 1954. Pp. xvi + 571, 
Price $7.50. 

Wotrrnem, NELLY. Psychology in the nursery 
school, (Trans. by C. L. Hannam.) New York: - 
Philosophical Library, 1953. Pp. 144. Price 
$3.75. 

Wricut, A. F. (Ed.) Studies in Chinese thought. — 
Chicago: Univer. of Chicago Press, 1953.4 
Pp. xiii + 317. Price $4.00. 

Younc, K. Isn't one wife enough? New York: 
Henry Holt, 1954. Pp. xiv + 476. P 
$6.00. 

Studi della facolia medica senese. Siena, Italy: Att 
Accademia dei Fisiocritici, Siena, Sezione 
Medico-Fisica, 1953. Pp. 5 16. No price sup- 
plied. 


a battery of personality and motivational 
measures to predict adjustment to mili- 


y T T; RELATION to the problem of developing 


H 


tary and combat flying, the following questions 
were considered: (a) Can a group of expert 


» Clinical psychologists, given a description of the 


$ 


problem and a definition of the criterion, pre- 
dict this criterion on the basis of a battery of 
personality tests? (b) If such predictions can 
be made successfully, what are the effective 
predictor signs which the clinicians use, and 
how do they integrate these signs to obtain 


,' Successful predictions? The experiment re- 


ported below was undertaken primarily. for 
the purpose of developing hypotheses for the 
systematic, quantitative scoring of a group of 
experimental psychiatric screening tests for 
flying personnel. 

» The six tests in this study were part of one 
experimental battery of group tests similar to 
those used by psychologists in clinical prac- 
tice. These tests are: (a) Background Infor- 


» mation, a biographical history inventory; (b) 


Ink-Blot Test, a group adaptation of the 
Rorschach test; (c) Feeling and Doing, a psy- 


st "chosomatic inventory reporting both symp- 


toms experienced and what the subject did 
about them; (d) What Is He Saying?—a sen- 
' tence completion test designed for Air Force 
‘use; (e) L-D Test, a group test booklet of the 
Szondi test; and (f) Drawing Test, a group 
version of the Draw-A-Person test. These tests 
have been described in detail by Sells and 
others in a series of Air Force publications 
(1, 2, 4, 6, 7). 
A major problem in adapting these clinical 
testing procedures to selection use is that of 


1 The research reported here was conducted under 
terms of Contract No. AF 33(038)-13887 between the 
U. S. Air Force and The University of Texas in co- 
ordination with the Department of Clinical Psychology, 
USAF School of Aviation Medicine, Randolph Field, 
Texas. This research was discussed at a symposium en- 
titled "Clinical Prediction of Flying Success," American 
Psychological Association, Washington, D. C., Septem- 
ber, 1952. 


Dee 


dir ud. Lot ut 
YING SUCCESS BY CLID 
[ TEST PROTOCOLS: i 
+ W. H. HOLTZMAN 
University of Texas 
^. AND S. B. SELLS ` i 
" ` USAF School of Aviation Medicine - 


"x 


ANALYSIS | 
22 5x Y 


developing adequate formal, quantitative scot- ` 
ing techniques that meet the requirements of 
psychometric standards and at the same time - 


incorporate the interpretive schemes of the | 


clinical method. This problem was attacked in 
a number of ways, for the most part by,em- 
pirical analyses in which empirical scoring 
keys were developed by painstaking and sys- 
tematic comparison of the responses of - dis- 
crete bipolar criterion groups and cross-yali- 
dated on independent samples? The present’ 
experiment was undertaken as one part of the a 
coordinated attack on this problem. 


I. 


METHOD 


After careful consideration of the test and criterion 
data, the probable availability of skilled clinicians to 
evaluate the test protocols, and the objectives of the 
research program, an experimental design for the 
clinician study was developed. From a total of 1,504 
test subjects (Ss) for whom complete criterion data 
were available, 100 aviation cadets were selected for 
the present study. Fifty of them had made a highly 
successful adjustment to the stress of flight training, 
and 50 had been eliminated from the program because 
they developed overt personality disturbances and were 
unable to complete the training course. The nature of 
the overt personality disturbances varied, but in every 
case the disturbance was sufficiently severe to be called 
to the attention of the flight surgeon, psychiatrist, and 
psychologist. The criteria used for selection of the 100 


cadets are: in 


Successful Group 


1. Passed the flight training course 
2. Pilot aptitude of Stanine 6 or higher 


2 Of the six tests used in this study, empirical studies 
have thus far developed valid scoring keys for three. 
Reports on the Rorschach (6) and Sentence Completion 
Test (7) have been published. Unpublished results on 
the Background Information inventory are reported in 
a general discussion of the psychiatric selection research 
program by Sells and Barry (5). Two studies of the 
Draw-A-Person test, one by Anastasi and Foley (1) 
and an unpublished study by Nurse and Barry, a series 
of analyses of the Szondi test by Ellis and Sells, andan 
extensive study of the Feeling and Doing test by Barry 
and Cobb have all yielded negative findings on cross- 
validation of experimental keys. ý 


* 
18th percentile as 
inst 
for role of Alr Force 
+ 
from the fight training program for 
frier [rend 
o 
disturbance œo- 
these criteria 
waccrsaful and SO 
random from the 
(uet. Code numbers were amigned to these 
sets of 20 cadets 
SAh cadets 
varying randomly about an er- 
the saccum of à given 
by use of the 
and sufficient 
provide each par- 
wet of tent 
peychetoginte, a 
Wed Wer IL, wae complied; k consisted. chiefy of 
 fetnqeped siheri lo clonal procetsrm as apy diet 
te tent interpretation snd perycheshagnesia As partial 
LAN s Clinical evaaa 
> a codes, a eal hononuium of HO wes 
ach oullabeorster for bis participation. The 19 
Sal who acrepted thee invitation are 
mE I4 Layman 
t Artes Besten sao Meh 
A Siy Iyn Daniel Mee 
po LA Patronii 
000 Oum Cater LL. A 
ys o Tim 
= -yn 7A Sason ehh 
naue Meyer Viae 
Kiegder. Los Wieder 
neat B ataa 
TABLE 1 
e Dess 
Nad Meu Me of Cuts 
Grm Chia Culets Wis Fasai 
- 3 » , 
1 3 » 5 
3 5 » n 
‘ 3 » u 
$ 1 » $ 
m » o L] 


population 

cadets and outlined the critical requirements for sane 
coms were inchuded with the test materials, Detailed 
instructions for making and recording judgments, @ 
brief summary of the over-all experi al design, and 
a short description of cach test were dowd. 


Two forms of recording sheet were used, one for 
cording judgments based on only one text at a thee 
and one for recording global evaluations based eu 
consideration of all or part of the available data for 


j and a rating of "C" denoted sè 
whatsoever in the judgment. 
Table | summarize the essential! features of 


Te 
QUEM RE Uo cues ctim © 
L — ——- d o poate wb ool 


mm 


pvg* ut NEC. 
| HS HH BRE inm "n (i 3 5n 
Iu TE Hi Wii aH ub 


"na i: i s Hiddá ilh nur ii 
Ai pia Dur n ai Pi li 
" i "n j n ftr. I I Hints lit 
i | 38 iab sul i H2 niil LE i 
4i ft n m iii i 
. ERU mun TAE. 
iu Hil i pon Hn TUE, 

in nhi 
TI diilnut-ib hini fül ER 


a 


we 


488° 


e As in the case of the global evaluations, an 


» 


analysis wa$ made of the results obtained for 
each. separate test, using majority agreement 
of individual clinicians as a basis for predic- 
tion, The: number of correct predictions did 
- not differ significantly from chance expectation 
for any of the six tests. Clearly, there is little 
relationship "between actual adjustment and 
redictions based upon an analysis of one test 
ata time, 


" 
*  *  DpriscussioN 


', In the present study there is little doubt 
that the clinical assessments of beginning avia- 
tion cadets have no relationship to a criterion 
of adjustment in the basic flight-training pro- 


“gram. The objective findings can hardly be 


D 


„ ignored, and'it is of utmost importance to con- 


== sider the possible explanations of these results 
‘+ in order to appreciate fully their significance. 


- 1. Are the criterion data faulty? A major 
factor in the success or failure of any predic- 
tion experiment is the validity of the criterion 
measure. This immediately raises the question 
“Validity for what?” The present study is con- 
cerned with predictions of the adjustment to 
the stresses of flight training of two groups of 
aviation cadets carefully selected to reflect 
highly effective and highly inadequate patterns 
of behavior, The clinician is called upon to 
predict (in retrospect) the future behavior of 
each cadet on the basis of test performance 
obtained when he first entered training. In this 
sense, the evaluations called for are prognostic 
of later adjustment rather than diagnostic of 
present behavior. 

The likelihood that our criterion groups of 
successful and unsuccessful cadets are in error 
is small. Few, if any, experimental investiga- 
tions of adjustment to stress have been based 
upon criteria as objective and manifold as 
those used in the present study. Although it is 
true that some members of the successful group 
may later experience breakdown under high 
5), using only cues provided by Background Informa- 
tion. To aid him in improving his subsequent judg- 
ments he was given the criterion data for his original 
set of protocols (Group 1). C first made predictions for 
the 40 cadets without specific knowledge of his earlier 
successes and failures. Twenty-five of the 40 predictions 
later proved to be correct. Then C re-evaluated the 40 
test records after examination of the earlier data, and 
changed several predictions. Twenty-two of the 40 
predictions were correct. Since in both cases chance 
expectation was 20 correct, the null hypothesis could 
not be rejected. 


W. H. HOLTZMAN AND S. B. SELLS 


situational stress and may manifest behavior 
indicative of poor adjustment, the anxiety, 
functional disorders, and overt personality 
disturbance which played a prominent role in 
the failure of each cadet in the unsuccessful 
group are outright evidence of maladjustment 
and preclude consideration of these cadets in 
any other category. 

One explanation of the inability of the psy- 
chologists to predict the criterion may be the 
relatively restricted range of adjustment. 
Since each cadet entering flight training had 
successfully passed a number of hurdles prior 
to admission, the likelihood of an individual 
with severe neurotic or psychotic manifesta- 
tions being included in the test population is 
extremely small. This objection has been 
partially overcome by selecting a small ex- 
perimental sample of 100 cases from a much 
larger group in such a way as to reflect only 
the extremes. This was deliberately done to 
maximize the likelihood of obtaining significant 
predictions. 

2. Is it possible that test evaluations were 
valid with respect to adjustment at the time the 
lest data were gathered, and that later adjustment 
has little relationship to assessment of initial 
personality status? Because of the exhaustive 
mental, physical, and educational require- 
ments for aviation cadets, an overtly neurotic 
or disturbed candidate is rare. However, the 
problem of prediction is to discover latent or 
subclinical characteristics which constitute 
predisposition to later manifestation of ex- 
cessive stress reaction when the actual stresses 
of military flying are experienced. Hence, if 
the clinical assessments were concerned with 
manifest indications of disturbance at the time 
the tests were administered, it might be ex- 
pected that they would be unrelated to later 
adjustment, unless a very disturbed applicant 
were found. It is possible that this task is un- 
fair to the clinicians before more detailed 
empirical research data are available on the 
signs indicative of predisposition. However, at 
the time the research was undertaken it was 
hoped that the insights of the clinicians might 
facilitate the prosecution of the empirical stud- 
ies. The results of the empirical studies which 
have been completed, such as those with the 
Rorschach, sentence completion, and other 
tests, confirm the conclusion that the items 
found most discriminating are frequently not 
those which were expected on a priori grounds. 


eh 


PREDICTING FrviNG Success BY CLINICAL ANALYsIS 


3. Were the judges sufficiently familiar with 


` the prediction problem to evaluate the test pro- 


locols in a proper context? The psychologists 
who participated in this study represented a 
wide variety of interest and background. Sev- 
eral, while recognized as highly skilled clini- 
cians, had had little or no contact with military 
populations. The majority, however, had 
served during the past war as military psy- 
chologists, and a few were thoroughly familiar 
with the problem, having either gone through 
a flight-training program themselves or worked 
as psychologists on similar problems for the 
Air Force. In addition, to ensure some famili- 
arity with the problem, an extensive discussion 
of the training situation and the types of stress 
present, together with several representative 
case reports of cadets who failed to adjust, was 
forwarded to each participant. The uniformly 
negative results strongly suggest that lack of 
familiarity with the training situation and the 
types of adjustment which characterized the 
criterion groups is not a deciding factor. 

4. Did the judges have sufficient previous ex- 
perience with the types of test materials to inter- 
pret them properly? Inexperience with the tests 
used in the Randolph Field battery may have 
contributed appreciably to the lack of positive 
findings. To make feasible their administration 
to large groups, each of the six tests had been 
adapted from more familiar individual testing 
procedures. The Ink-Blot Test was a mixture of 
the commonly accepted individual and group 
Rorschach tests with one or two minor in- 
novations, The L-D Test was a group form of 
the Szondi with only one trial rather than 
several spread over different days as is stand- 
ard practice when individually administered 
in a clinical setting. Though similar in struc- 
ture to other such tests, the SAM Sentence 
Completion Test (What Is He Saying?) was 
devised especially for use with student pilots, 
A large number of the test items, however, 
were identical to those used in earlier, well- 
known tests of this type. Except for the lack 
of an “inquiry,” the group form of the Draw- 
A-Person test was essentially similar to the 
standard individual procedure. Similarly, the 
general type of response given in Feeling and 
Doing and in Background Information was 
familiar to most clinicians, although the con- 
text in which the responses were made was 
different from the usual clinical setting. 

Several participating psychologists have ex- 


489 . 


LI hi 
pressed the belief that these modifications of " 
standard procedures in order to ‘make group 
administration feasible may have changed the 
meaning of test responses as commonly inter- |. 
preted in clinical practice. Others have argued. 
that the basic response patterns should rerhain ’ 
relatively constant, although less information 
may be available in the group tests. Whatever 
the case may be, it is worth noting that many 
of the psychologists felt they were making * 
meaningful interpretations of the data; 9, «. |, 

5. Is it possible that the personality evaluations | 
were largely correct, but the criterion predichons ` 
failed due to inadequate integration of the various ` 
personality characteristics observed? The judge 4, * 
ment process consists of at least two’ phases: V. 
(a) the description of the individual's person: * & 
ality, and (b) the weighing of assets and lia- T 
bilities noted in the individual and the pres sii 
diction of adjustment to the training program. 
Since none of the predictions exceeded chance ^ . 
expectations, it is impossible to tell from the ' 
predictions alone whether the personality de- 
scriptions are invalid or whether they have 
been used inappropriately in arriving at pre- 
dictions. In all likelihood both phases of judg- 
ing have suffered from faulty inferences. Be- 
cause at least three different psychologists 
evaluated each cadet, the variation in per- 
sonality descriptions from one clinician to the 
next in the kind and degree of personality 
descriptions written makes a thorough, sys- 
tematic analysis of this type impossible. Even 
when only those cases on which there is 
unanimous agreement as to predicted outcome 
are considered, there is little congruence of 
terminology, each clinician favoring concepts 
related to his own frame of reference. A. major 
problem in further analysis of data of this 
kind is finding an adequate system for classify- 
ing descriptive terms to make comparisons 
among clinicians. 

In spite of such variability, there are under- 
lying consistencies among clinicians which 
cannot be ascribed to chance alone. The a 
priori probability that three judges will unani- 
mously agree in their predictions for a 
single cadet is .25. For the global approach the 
amount of agreement among the psychologists 
was significantly better than chance. There 
was a low but statistically significant agree- 
ment in predictions based upon Background 
Information, Feeling and Doing, and the 
Drawing Test, when considered individually. 


490 


Less reliability was present in predictions 
based upon the Ink-Blot Test and the SAM 
Sentence ,Completion *Test. No agreement 
among clinicians was observed for the L-D 
Test. Although none of the psychologists was 
ablé to predict beyond chance expectation the 
actual adjustment of the cadets, they did tend 
to use the test battery as a whole, and to some 
extent the individual tests, with some degree 
of consistency. 
* To see what cues formed the basis for judg- 
ments, the evaluations of cadets upon whom 
ere was unanimous agreement were ex- 
amined. A rating of failure seemed to depend 


W. H. Horrzwaw AND S, B. SELLS 


logical tests and other clinical evidence will 
depend upon the completion of adequate quan- 
titative, controlled empirical research relating 
standardized measurement instruments to spe- 
cific criterion situations. Such situations in- 
clude diagnostic groups, job criterion groups, 
and various other life situations. In this re- 
spect, the present results are closely related to 
those of the Michigan program. 


REFERENCES 


1. Anastast, ANNE, & FoLey, J. P., JR. A research 
program on the psychiatric selection of flying 
personnel. V. The Human-Figure Drawing Test 
as an objective psychiatric screening aid for 


4 uently upon the presence of signs of low 
* rüotivation for flying, immaturity, strong 
* " dependency needs, anxiety, and sexual conflict. 


student pilots. USAF, Sch, Aviat. Med., Proj. 
Rep., 1952, No. 21-37-002, Rep. No. 5. 
2. Barry, J. R., & Raynor, G. H. A research program 


; Ratings of success were less consistent and 

. often based upon conflicting cues. In general, 
too few clinicians listed cues in sufficient num- 

" bers to permit any systematic analysis. Further 
development and standardization of terms for 
the description of personality must take place 
before any appreciable understanding of the 
process of judging will be possible. 

In view of the results of the present study, 
several questions immediately arise. What are 
the implications of these findings with respect 
to the problem of predicting individual adjust- 
ment to other life stress situations? Can these 
results be accepted as reliable evidence of the 
general invalidity of clinical assessment pro- 
cedures as presently constituted for coping 
with prediction problems of this nature? How 
do the results of this study fit in with the gen- 
eral picture as it is gradually unfolding in 
‘somewhat similar studies such as the Michigan 

. VA clinical trainee assessment program? 

In the opinion of the writers, the improve- 

ment of clinical judgments based on psycho- 


on the psychiatric selection of flying personnel, 
Research on the Cornell Index. USAF, Sch. Avial. 
Med., Proj. Rep., 1953, No. 21-0202-0007, Rep. 
No. 2. 

3. KELLY, E. L., & Fiske, D. W. The prediction of per- 
formance in clinical psychology. Ann Arbor: 
Univer. of Michigan Press, 1951. 

4. SELLS, S. B. A research program on the psychiatric 
selection of flying personnel: I. Methodological 
introduction and experimental design. USAF, 
Sch. Aviat. Med., Proj. Rep., 1951, No. 21-37-002, 
Rep. No. 1. 

5. SELLS, S. B., & Barry, J. R. A research program to 
develop psychiatric selection of flying personnel. 
I. Theoretical approach and research design; I 
Research progress. J. aviat. Med., 1953, 24, 29-41. 

6. Sets, S. B., Frese, F. J., Jr., & Lancaster, W, H. 
Research on the psychiatric selection of flying 
personnel: II. Progress report on development of 
SAM Group Ink-Blot Test. USAF, Sch. Aviat. 
Med., Proj. Rep:, 1952, No. 21-37-002, Rep. No. 2. 

7. TRrres, D. K., Horrzman, W. H., TEMPLETON, 
R. C, & Sers, S. B. Psychiatric screening of 
flying personnel: research on the SAM Sentence 
Completion Test. USAF, Sch. Aviat. Med., Prd: 
Rep., 1953, No. 21-0202-0007, Rep. No. 3. 


Received June 1, 1953. 


Se ÁÁ€— 


THE INFLUENCE OF SET ON SEMANTIC GENERALIZATION! 


NATHANIEL 


H. EISEN 


New York University? " 


eralization to other semantically related 

words of a response originally condi- 
tioned to a single word. This is a form of 
learned, mediated, or secondary generalization, 
a concept which has become increasingly im- 
portant to those who attempt to describe com- 
plex human behavior in terms derived from 
Hull’s learning theory. For example, Cofer 
and Foley (1) have used secondary generali- 
zation to explain such phenomena as “mean- 
ingfulness,” set, memory, reasoning, intelli- 
gence, and "unconscious anxiety." Dollard 
and Miller (3) place considerable emphasis on 
the concept in their discussion of the devel- 
opment of personality and the process of psy- 
chotherapy. They attribute the transfer of 
anxiety to new situations and the development 
of neurotic symptoms in part to secondary 
generalization; similarly, the development of 
the transference relationship during therapy 
is due in part to “response-mediated . . . mech- 
anisms of generalization.” They suggest that 
a major function of the therapist is to help the 
patient in identifying his generalized responses. 
and in “ .. . attaching the same label or other 
cue-producing responses to different situations 
[which] should increase the amount of second- 
ary stimulus generalization and [in this way] 
help the patient to respond adaptively to cul- 
turally emphasized similarities” (3, p. 323). 

! This article is based on a dissertation submitted in 
partial fulfillment of the requirements for the degree of 
Doctor of Philosophy at New York University, Febru- 
ary, 1952. The writer wishes to express his indebtedness 
to Professor Leland W. Crafts for his guidance and 
assistance throughout the course of this study. The 
writer also wishes to thank Dr. Bernard Locke, of the 
VA Regional Office, Brooklyn, New York, who pro- 
vided the psychogalvanometer and the polygraph; Drs. 
Jacob Cohen and Lawrence Karlin, for advice and 
assistance with regard to statistical problems; and the 
many members of the teaching staffs, of the Washing- 
ton Square College and the School of Education of 
New York University, through whose efforts subjects 
were obtained. Above all, the writer wishes to express 
his gratitude to Benjamin S. Brody, who designed and 
built the shock apparatus and the various electrodes, 
who served as assistant during the collecting of the 
data, and who in many ways made this experiment 
possible. f 

2 Now at the University of Illinois. 


S ii generalization refers to the gen- 


491 


The experimental evidence for semantic 
generalization has been reviewed most recently 
by Razran (14). A few of the previously re- — 
ported studies are directly related to the ex- 
periment reported here. Razran (11) found 
that the salivary response conditioned to one 
word showed greater generalization to words 
of similar meaning (synonyms) than to words 
of the same sound (homonyms or homo- 
phones). Riess (15), using the GSR as his CR, 
and a buzzer as his CS, found that adult Ss 
showed generalization to antonyms interme- 
diate between that to synonyms and homo- 
nyms, thus obtaining a gradient of generali- 
zation from synonyms to antonyms to 
homonyms. Wylie (17), also using the GSR, 
but shock as her US, obtained maximum gen- 
eralization to homonyms, with greater gener- 
alization to synonyms than to control words. 
Cofer and Foley (1) explain the discrepancy 
between Wylie's findings and those of Riess 
and Razran on the basis of the "intentions," 
"expectations," and "beliefs" held by Wylie's 
Ss as to the purpose of her experiment. 

The related concepts of set, attitude, ex- 
pectancy, and the like have frequently been 
invoked either explicitly or implicitly to ex- 
plain experimental findings in this area in a 
posteriori fashion (e.g., 1, 2, 15, 17). However, 
few studies have reported the effects of ex- 
perimental variations in Ss’ set on semantic 
generalization. Razran (13) found that (a) 
manipulation of "mental sets," induced by 
practice with lists of controlled associations, 
could increase or decrease the amount of gen- 
eralization to words in such semantic relation- 
ships to the CS as subordinate, supraordinate, 
and the like, according to the kind of practice 
involved; and (b) Ss with different political 
views differed in the directions to which they 
showed generalization to sentences reflecting 
varying political philosophies. 

; HyPOTHESES ? 

Major hypothesis. 'The amount of generali- 
zation to a word having a given semantic re- 
lationship to the CS should be greater when a 
set has been induced for generalization to that 
relationship than it is to a word in the same 


492 . tà 


relationship when a set has been induced for 
generalization to other relationships. 

This hypothesis may be deduced from the 
Cofer and Foley (1) elaboration of Hullian 
theory, and it is also consistent with Razran's 
theoretical position (14). It does not seem to 
have been stated and tested explicitly, but is 
implicit in some of the work of Razran (13) 
and others (e.g., 2). An explicit statement of a 
related hypothesis is made by Dollard and 
Miller, although in somewhat different lan- 
guage (3, p. 323). 

Minor hypothesis I. Generalization as de- 
scribed above should be significantly greater 
when an “implicit set” is induced toward a 
given semantic relationship by having S prac- 
tice in an apparently unrelated task than 
it is when a set is induced by means of ex- 
plicit instructions directed toward the same 
relationship. 

Dollard and Miller (3), in their discussion 
of therapy (pp. 281 ff.), mention three ways in 
which people may be induced to attach 
"labels" or other “‘cue-producing responses” to 
previously unlabeled situations. These are 
roughly analogous to three experimental 
methods for inducing a set: (a) The therapist 
May so arrange matters that the individual 
Spontaneously finds the proper labels himself, 
which is similar to the condition existing in the 
experiments by Razran (11) and Riess (15) in 
which Ss were permitted to discover the rela- 
tionships of the generalization stimuli to the 
CS with no help from E. (b) He may strengthen 
Some of the patient's random responses by 
means of “selective repetition" having an 
"automatic reward effect," which is analogous 
to the "implicit set" in the experiment re- 
ported here. (c) He may take an active role in 
teaching the patient by explicitly pointing out 
relationships; this is analogous to the explicit 
instructions used here for inducing a set. Ac- 
cording to Dollard and Miller, the first of these 
methods should be the most effective, because 
it offers the greatest amount of satisfaction or 
reinforcement to the patient or S, whereas the 
last method should be the least effective. 

Razran (13) has studied the effects on seman- 
tic generalization of a set induced by means of 
practice with lists of controlled associations, 
but we find no studies in the literature on 
generalization comparing the effects of differ- 
ent methods of inducing sets. 

Minor hypothesis II. Generalization as de- 


NATHANIEL H, EIsEN 


scribed above should be greater to words of 
high frequency value on the Thorndike-Lorge 
list (16) than to words of low frequency value. 

This hypothesis is derivable from Hullian 
theory, and was indirectly tested by Cofer, 
Foley, and their associates (e.g., 5) in experi- 
ments involving the recall of lists of words. It 
is also consistent with Razran’s work (e.g., 10). 
However, according to Lashley and Wade, ` 
"...the practice required in learning [is] a 
wholly unreliable measure of the learning 
process . . . So long as the effective attribute is 
present, the reaction [ie., generalization] is 
elicited as an all or none function of that attri- 
bute” (9, pp. 74, 81), presumably regardless of 
the frequency value of the stimuli. 


METHOD 


Subjects. This experiment required 64 Ss to meet 
two criteria for the establishment of a conditioned 
GSR. In order to obtain this number of Ss, 95 individ- 
uals were seen initially. These were students at various 
schools of New York University during the spring and 
summer sessions of 1951. About 50 per cent were volun- 
teers; the remaining were recruited from courses in 
general psychology in which service as the S of an | 
experiment was required. The Ss were assigned at ran- 
dom to the various experimental conditions. 

Of the 64 Ss finally selected, 40 were males; ages 
ranged from 17 to 31, with a median age of 23. All had 
learned English as their mother tongue, or concurrent 
with it. Questioning at the end revealed that all re- 
mained naive as to the purpose of the experiment. 

Materials. The conditioned and the generalization 
stimuli are shown in Table 1. Associated with each CS 
is a set of four generalization stimuli of high frequency, ^ 
consisting of a word in each of four semantic relation. 
ships (homonyms, synonyms, antonyms, and “neutral” - 
or unrelated control words), and a similar set of low 
frequency words, 

For the preconditioning trial, the words to be used 
later as CS and as generalization stimuli for a given S 
were interspersed among ten neutral words selected at 
random. The order of presentation of this list of words 
during the preconditioning trial was the same for all 58. 

For the conditioning procedure, 20 high frequency 
words were selected, each of these having a synonym, à 
homonym, and an antonym. This group of words, Ni 
repetitions of the CS interspersed among them, coul 
be arranged alternately into three different sequences, 
thus providing three different “problems” for Ss te 
“solve” during the conditioning procedure (see below). 

For the experimental treatment in which Ss were 
given practice in an apparently unrelated task, Æ de- 
vised a P (for Practice) test, consisting of 28 words, 
with its synonym, antonym, homonym, and associat 
neutral word arranged in columns in random order. E 

Apparatus. The psychogalvanometer was Es 
factured by the Lafayette Instrument Company: o 
contained a. microampere galvanometer with a E 
accuracy within 2 per cent. A vernier dial provi 
scale units of 10 ohms, and a variable shunt permitti 


Ser AND SEMANTIC GENERALIZATION —— (* 493 
TABLE 1 ` 
CONDITIONED AND GENERALIZATION STIMULI AND THEIR “G” CouNT Frequency VAEUE* 

cs Homonyms Synonyms Antonyms Neutral 

right AA write AA correct A left AA prepare AA 
rite 10 precise 6 erroneous 4 tuition 2 

buy, AA by AA purchase A sell AA country AA 
bye 1 procure 22 dispense 9 robust 4 

raise AA rays A lift AA lower AA promise AA 
raze 2 heighten 5 degrade 8 urban 4 

new AA knew AA modern AA old AA glass AA 
| gnu — unfamiliar 4 obsolete 3 puppet 6 


* See reference 16. 


the measurable control of the amplification of the GSR 
changes. The range was 0 to 10M ohms, Connected 
with the psychogalvanometer was a Lafayette Multi- 
channel Polygraph, model 603, ink recording, with a 
manual time marker, The GSR electrodes consisted of 
¥%-inch-square zinc plates fitted into $4-inch-diameter 
rubber cups containing soft sponge rubber impregnated 
with electrode paste. Both electrode contacts were on 
the palm of the same hand. The electrode paste con- 
sisted of powdered Bentonite, saturated CaCl. solution, 
and glycerine. It was slow drying, showing no change 
in conductivity in periods of an hour or more. 

The stimulus words were typewritten on a thin 
Cellophane film placed between two sheets of ted 
waxed paper (sold commercially as ‘“Radio-Mat 
Slides”). Typing was done on a Vari-Type machine in 
lower case "Copper-Plate Gothic," which provided 
simple block letters of uniform height. An attempt was 
made to control for the length of the words by varying 
the spacing between the letters. The Cellophane sheets 
were then secured in 35-millimeter Esco Slide Binders. 
"These slides were projected by means of a TDC Pro- 
jector, model D, 300 watts, fitted with a TDC Se- 
lectron Automatic Slide Projector and tray capable of 
carrying up to 30 slides. A medium gray screen was 
used, 514 feet in front of S, with the words appearing 
about 214 inches high in an 11-inch X 12-inch field. 

Shock electrodes were 4-inch-square zinc plates at- 
tached to an adjustable elastic arm band; these were 
about 1 inch apart. Contact was reinforced by means 
of the electrode paste. The shock apparatus drew cur- 
rent from a 120-volt, 60-cycle line, which, by means of 
a transformer, was changed to 480 volts at a maximum 
of 40 milliamperes. A resistor and a potentiometer in 
the circuit permitted variations in the current delivered 
to S, with a maximum of 4 milliamperes. 

The S sat at a table, with E, who operated the pro- 
jector, at his right. An assistant (A) operated the 
psychogalvanometer and the polygraph. During the 
conditioning procedure, when the polygraph recorder 
was not in use, A administered the shock and was able 
to signal E as to S's responses without the latter's 
knowledge. 

Experimental design. Simultaneous manipulation of 
three variables deriving from the three hypotheses 
called for a 4 X 2 X 2 factorial design, or 16 groups of 
Ss. Control of the effects of specific conditioning and 
generalization stimuli, and of the effects of varying 
Orders of presentation of the generalization stimuli, 


necessitated rotation of these factors within each. group 
by means of a latin square. The rotation scheme re- 
quired a minimum of 4 Ss for each of the 16 groups, or 
64 Ss in all, with no replication. 

Procedure. During the brief preliminary period, S’s 
introduction to the GSR apparatus and to shock was 
accompanied by appropriate but misleading remarks, 
and his “base resistance level” and response to shock 
were noted. Following this was a preconditioning trial 
exposure to the significant stimuli, which were inter- 
spersed among a group of neutral words. Here and 
throughout, each word was projected for 3 seconds, 
with a 7-second interstimulus interval. 

In line with suggestions made by Razran (e.g., 12), 
the instructions given S during the conditioning pro- 
cedure were designed to motivate him to attend to the 
stimuli, but to mislead him as to the purpose of the 
experiment. The CS were presented embedded in word 
sequences, and S was given the task of finding the 
basis for their arrangement. For example, in Sequence I 
the CS appeared between words beginning with the 
same letter (“...dear, right, die, close, wait, right, 
we..."). Some of the presentations of the CS were 
reinforced with the US, while A kept E informed as to 
whether S’s GSR approached the first criterion for 
conditioning (see below). When necessary, Sequences 
II and III were introduced, and cues for solving the 
sequences were given S, the aim being to prolong the 
presentation of the words until conditioning was estab- 
lished, and to terminate this part of the procedure as 
rapidly thereafter as possible. Later questioning re- 
vealed that this goal was achieved. 

Reinforcing shock, when administered, was of 
1-second duration, and the last one of the three seconds 
in which the CS appeared. The first 4 or 5 CS presenta- 
tions were always reinforced, but thereafter reinforce- 
ment was partial and random, depending upon the 
degree to which S approached the criterion. The first 
criterion of conditioning was met when (a) a point was 
reached at which two presentations of the unreinforced 
CS yielded measurable GSR's, and the preceding and 
following neutral words showed no response, or .Q 
with a few Ss for whom the neutral words never failed 
to elicit a GSR, two presentations of the unreinforced 
CS gave GSR’s at least twice as strong as that of re- 
sponses to adjacent neutral words, 

Half of the Ss were given the P test as a task os- 
tensibly unrelated either to the conditioning procedure 
or to the test of generalization to follow: Instructions 


494 


were designed to induce a set for synonyms, antonyms, 
homonyms, or neutral words. The other half of the Ss 
worked at a digit-symbol substitution test for 90 
seconds, the average time taken by Ss to complete 
the P test. 

The Ss who had taken the P test were tested for 
generalization, with instructions merely to look at the 
stimuli and to wait for questions about them. The re- 
maining Ss (those who had not taken the P test) were 
given explicit instructions to look for words in one of 
the significant semantic relationships of the CS during 
the exposure of the generalization series, but to wait 
for questions before identifying the sought-for word. 

After these instructions were given, an additional 
reinforced CS was presented, followed by the generaliza- 
tion stimuli in prearranged order and, finally, a repeti- 
tion of the CS without reinforcement. The second 
criterion for conditioning was met when S's GSR to 
the final unreinforced CS was larger than that to at 
least one of the four immediately preceding generaliza- 
tion stimuli. 

"The Ss were questioned as to their identification of 
the generalization stimuli and their expectations of re- 
inforcement. The purpose of the experiment was ex- 
plained, Ss’ questions were answered, and the need for 
Secrecy was emphasized. The procedure varied in 
duration from 45 to 65 minutes. 


RESULTS 


The GSR was recorded as a rising curve on 
polygraph paper. The height of each curve was 
measured in }{99-inch units and was then 
transformed mathematically into Logarithm 
Conductance Change units, as recommended 
by Haggard (6) and others. It was possible 
(4) to treat the postconditioning GSR's as a 
function of the conditioning procedure alone, 
ignoring the preconditioning-trial responses. 

1. Prior to conditioning, there was no sig- 
nificant difference between GSR's to the 
stimuli to be used as CS and the mean GSR's 
to the four words to be used as generalization 
stimuli. Following conditioning, however, the 
GSR's to the final unreinforced CS were sig- 
nificantly greater (¢ = 6.62, p < .01) than the 
mean of the GSR’s to the four generalization 
stimuli, from which we may infer that con- 

. ditioning was successfully established. 

2. The effects of the attempts to control 
three variables (CS words used, generalization 
stimuli used according to their semantic rela- 
tionships, and the position of the generaliza- 

` tion stimuli in their orders of presentation) 
were investigated by means of three analyses 
of variance.’ The only significant F value was 


that deriving from the interaction of the words - 


*The supplementary tables have been deposited 
with the ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washington 
25, D. C. Order document number 4215 remitting in 


NATHANIEL H., EISEN 


8) 
N 
o 


2 
x 
[1 

Sz 

of 

5 S 15 

Wo 

oz 

E 

zo 

$2 

<2 

= 8 9— SYNONYMS 0-—HOMONYNS 

©- ANTONYMS "CONTROLS 
go (NEUTRAL) 
Sy gy ee MERE 
2 3 4 


l 
POSITION IN GENERALIZATION SERIES 
Fic. 1. MAGNITUDE OF THE GENERALIZATION GSR AS 
A FUNCTION OF THE POSITION OF THE GENERALIZA- 
TION STIMULUS IN THE GENERALIZATION SERIES 


used as generalization stimuli, according to - 
their semantic relationships, and the position ~ 
of these words in their orders of presentation 
(see Fig. 1). The magnitude of the response to 
synonyms decreased as a function of the posi- 
tion of the synonym in the order of presenta- 
tion; that is, the later the synonym appeared 
in the series of generalization stimuli, the 
smaller the response to it was apt to be. The 
other semantic relationships showed no clear- — 
cut trends, and an analysis of the synonym 
trend indicated that it was not significant. The 
absence of significance, however, was due to 
the large within-positions value used as the 
error term. It should be noted that each of the 
16 entries of Figure 1 is the mean of 16 meas- 
ures, each measure having been obtained from 
one S in each of the 16 experimental groups. A 
more significant synonym trend might be 
found if within-position variance were reduced 
by decreasing the multiplicity of experimental 
treatments. 

3. An investigation of the relationship be- 
tween variations in experimental procedure 
and S's general response level to stimuli re- 
vealed that the groups subjected to different 


advance $1.25 for photoprints or $1.25 for 35 mm. 
microfilm. Make checks payable to Chief, Photo- - 
duplication Service, Library of Congress. (See Tables | 
1, 2; 3, 4; 5, 6.) i | 
n" 


SET AND SEMANTIC GENERALIZATION 


treatments were not homogeneous with respect 
to response level. Whether this difference is 
attributable to the effects of the experimental 
treatments themselves or to some bias in the 
selection of Ss was not investigated. 

4. In the process of testing the various pre- 
dictions arising from the hypotheses, 36 ¢ tests 
were made. (Four were necessary to test the 
major hypothesis, 8 each to test the two minor 
hypotheses, and 16 to test the major hypothesis 
within each of the 16 experimental groups.5) 
The number of significant, or almost signifi- 
cant, /'s was four, just about the number one 
might expect by chance alone. We therefore 
cannot reject the null hypothesis that “set” as 
varied in this experiment did not affect seman- 
tic generalization. Alternative statistical treat- 
ments, involving nonparametric statistical 
techniques, yielded similar negative results. 


Discussion 

We may eliminate the following factors (4) 
as possible bases for the failure to obtain posi- 
tive findings, either in the predicted direction 
or in the form of a generalization gradient: (a) 
lack of sensitivity of the apparatus to intra- 
individual differences, (6) lack of acquaintance 
of Ss with the stimuli, (c) inadequate condi- 
tioning, and (d) inadequate statistical treat- 
ment of the data. Two factors, however, may 
explain the obtained results. These are: (a) the 
pattern of reinforcement utilized, and (6) the 
degree of discrimination training achieved dur- 
ing conditioning. 

1. It might be argued that owing to differ- 
ences in the pattern of reinforcements, Riess’s 
(15) Ss expected shock when exposed to the 
generalization words, whereas in this experi- 
ment the Ss did not expect shock at this time. 
Such an explanation seems to call upon some 
form of “expectancy theory,” but is probably 
translatable into S-R terminology (8). 

2. The conditioning procedure was such as to 
maximize discrimination between the CS and 
other words. If we assume that complete dis- 
crimination did not occur in the other studies 
(there is no evidence to contradict this assump- 
tion), then gradients of generalization are at- 
tributable to failure to observe differences 
among the CS and generalization stimuli. This 
seems to be consistent with the Lashley-Wade 


‘Refer to footnote 3 regarding supplementary 
tables; see Table 7. 

5Refer to footnote 3 regarding supplementary 
tables; see Tables 8, 9, 10. 


495. 


(9) interpretation of generalization. However, 
the Lashley-Wade position is too inconsistent 
to serve as a basis for a thorough explanation 
of generalization. From their discussion it can 
be argued* that the very process of "restriction 
of associations," which is supposed to bring 
about generalization, tends to increase dis- 
crimination and so tends to decrease the “fail- 
ure to note distinguishing characteristics" of 
the stimuli, which is also supposed to increase 
generalization. 

Razran's theory does not account for the re- 
sults obtained here either. Razran (12, 14) 
attributes "true generalization" to automa- 
tized categorizing, or rating habits, of S, habits 
which increase in strength with "maturity, 
alertness . . . and other capacity-enhancing in- 
fluences,” and which are correlated, presum- 
ably, with the ability to discriminate among 
stimuli. Although the Ss here were able, for the 
most part, to identify the generalization stim- 
uli as related to—and therefore were able to 
categorize them as similar or dissimilar to—the 
CS, neither the hypothesized nor any other 
kind of generalization gradient appeared. Raz- 
ran introduces the concept of “‘pseudo-generali- 
zation” to account for instances of generaliza- 
tion due tq lack of discrimination; but to 
accept this as an explanatory concept would 
require the additional assumption that the 
failure by Ss of other studies to discriminate 
among stimuli (and thereby producing a gra- 
dient) was the result of fatigue or lack of at- 
tention (14). 

Hullian theory (7) takes into account the 
effects of discrimination training on primary 
generalization. Although the effects of such 
training on secondary generalization are men- 
tioned by Dollard and Miller (3), the mecha- 
nism by which such training operates is not 
made clear. According to Hull, the degree to 
which differential reinforcement “steepens’” 
the primary generalization gradient is an in- 
verse function of the proximity of the con- 
trasted stimulus to the CS along the stimulus 
dimension. By analogy, we should expect 
minimum effects on the secondary generaliza- 
tion gradient when the contrasted stimulus is 
an unrelated word (and maximum effects when 
the contrasted stimuluseis a synonym). As an 
alternative deduction from Hullian theory, it 
may be stated that secondary generalization 


* The writer is indebted to Dr. Lawrence Karlin for 
suggesting this criticism of the Lashley-Wade position. 


496 


may occur along separate dimensions for each 
of the semantic relationships—a separate gra- 
dient for synonyms, antonyms, etc. In that 
case discrimination training with antonyms 
should have no effect on generalization to 
synonyms, and discrimination training with 
neutral words should have no effect on gen- 
eralization to other relationships. The con- 
clusion based on either of these two lines of 
reasoning is the same. Hull's theory does not 
predict that training with neutral words will 
result in complete obliteration of the generali- 
zation gradient to related words. 


CONCLUSIONS 


In their preface, Dollard and Miller (3) state 
that most of their ideas are “hypotheses, the 
basis for research, not proven principles. Fur- 
thermore, it has not been possible to formulate 
these hypotheses in as rigorous, systematic, 
and quantitative a form as will eventually be 
desirable." The results of this study are in 
agreement with this statement. Although it has 
been possible to derive hypotheses regarding 
semantic generalization from the work of 
Dollard and Miller and others who have been 
influenced by Hull's thinking, existing theories 
of secondary generalization cannot account for 
the results obtained here. 

The present findings suggest several new 
problems and areas of research. Among these 
are: (a) the effects of different degrees of dis- 
crimination training on the secondary generali- 
zation gradient; (b) the effects of discrimina- 
tion training with stimuli in different semantic 
relationships of the CS on the gradient of 
Secondary generalization; (c) the effects of 
different patterns of partial reinforcement on 
secondary (and perhaps on primary) generali- 
zation; and (d) the effects on generalization to 
the synonym—or any other semantic rela- 
tionship—of the position in the generalization 
series at which the crucial word appears. All of 
these problems are relevant to the use of the 
concept of semantic, secondary, mediated, or 
learned generalization as an explanation of 
what occurs in therapy or in the learning ex- 
perience of the child. É 


SUMMARY 


The purpose of thiš study was to compare 
the influence on semantic generalization of 
variations in (a) the kinds of set given S, (b) 
the methods of inducing a set, and (c) the past 
experience of S with the generalization stimuli. 


NATHANIEL H. EISEN 


The findings were essentially negative, gen- 
eralization occurring randomly and without 
regard to the relationship of the stimulus to the 
CS, or of the set given S. That is, neither the 
hypothesized results nor those reported in 
the literature appeared. Current theories of 
secondary generalization are inadequate to ex- 
plain these negative findings, but several new 
areas of research are suggested. 


REFERENCES 


1. Corer, C. N., & Forxy, J. P., Jr. Mediated 
generalization and the interpretation of verbal 
behavior: I. Prolegomena. Psychol. Rev., 1942, 
49, 513-540. 

2. DivEN, K. Certain determinants in the conditioning 
of anxiety reactions, J. Psychol., 1937, 8, 291-308. 

3. Dorramp, J., & Mier, N. E. Personality and 
psychotherapy. New York: McGraw-Hill, 1950. 

4. ErsEN, N. H. The influence of set on semantic 
generalization. Unpublished doctor’s disserta- 
tion, New York Univer., 1952. 

5. Forry, J. P., Jg, & Corer, C. N. Mediated 
generalization and the interpretation of verbal 
behavior: IL. Experimental study of certain 
homophone and synonym gradients. J. exp. 
Psychol., 1943, 32, 168-175. J 

6. Haccarp, E. A. On the application of analysis of 
variance to GSR data: I. The selection of an ap- 
propriate measure. J. exp. Psychol., 1949, 89, 
378-392, 

7. Hurr, C. L. The problem of primary stimulus . 
generalization. Psychol. Rev., 1947, 64, 120-134. 

8. Jenxins, W. O., & Srantey, J. C., Jn. Partial re- 
inforcement: a review and critique. Psychol. 
Bull., 1950, 47, 193-234. 

9. Lasutey, K. S., & Wane, Marjorie. The Pav- - 
lovian theory of generalization. Psychol. Ret 
1946, 53, 72-87. 3 

10. Razran, G. Salivating and thinking in different 
languages. J. Psychol., 1936, 1, 145-151. 

11. Razran, G. A quantitative study of meaning bya 
conditioned salivary technique (semantic com 
ditioning). Science, 1939, 90, 89-90. a 

12. Razran, G. Attitudinal determinants of condition- 
ing and generalization of conditioning. J. &*/* 
Psychol., 1949, 39, 820-829. = de 

13. Razran, G. Some psychological factors in 
generalization of salivary conditioning to verb 
stimuli. Amer. J. Psychol., 1949, 62, 247-256. M 

14. Razran, G. Stimulus generalization of condition 
responses. Psychol. Bull., 1949, 46, 337-367. 

15. Riess, B. F. Genetic changes in semantic con- 
ditioning. J. exp. Psychol., 1946, 36, 143-152. 

16. THORNDIKE, E. L., & Lorce, I. The teacher's wor 
book of 30,000 words. New York: Bureau 
Publications, Teachers Coll., Columbia Univer 
1944, E 

17. WyLæ, Rura C. Generalization of semantic Er 
ditioning of the galvanic skin response. D 
lished master's thesis, Univer. of Pittsburgh | 
1940. 


Received June 1, 1953. 


CONDITIONING LEVEL IN THE BEHAVIOR DISORDERS: 


JANET A. TAYLOR 
Northwestern University 


AND 


N MODERN psychology the clinical concept 
I of anxiety can be considered to be the 
dominant and key explanatory principle 
of the behavior disorders. Freud, who is given 
major credit for originating these theories, 
stated in 1921 that anxiety was the “funda- 
mental phenomenon and the central problem 
of the neuroses" (5, p. 85). Although many 
have disagreed with Freud's notions as to its 
source and etiology, anxiety as an explanatory 
concept has been widely accepted (e.g., 2, 3, 
7, 19), the current tendency being to regard 
the behavioral reactions of the neuroses either 
as direct expressions of anxiety or as a defense 
against this anxiety. The symptoms found in 
the functional psychoses have also been simi- 
larly analyzed in terms of their relationship to 
anxiety, although usually not in the detail 
accorded the neurotic disorders. 

The development of the anxiety concept 
rests primarily on observational evidence 
gathered from clinical data. While many of the 
postulated. relationships of anxiety to overt 
neurotic and psychotic behavior appear to have 
a firm empirical basis, the concept has never 
been systematically analyzed nor have ade- 
quate experimental tests been made concern- 
ing many of the speculations about it. It is the 
purpose of this investigation to attempt to 
verify experimentally some of the assumptions 
that have been made. 


USAGES OF THE Anxiety CONCEPT 


As the writers have pointed out in previous 
studies (13, 14) the term "anxiety" appears to 
be employed by the clinician in at least two 
different ways: (a) as a term applied to a par- 
ticular set of overt behavior patterns, hereafter 
referred to as "manifest anxiety," and (b) a 
hypothetical construct, hereafter called “in- 


1 This investigation was supported by a research 
grant from the National Institute of Mental Health, of 
the National Institutes of Health, Public Health 
Service. The writers wish to acknowledge the very 
helpful cooperation of Dr. David Shakow, Chief of the 
Department of Psychology, Illinois Neuropsychiatric 
Institute, and of Dr. Samuel Liebman, Superintendent 
of the North Shore Health Resort. Acknowledgement is 
also made to George Faibish and John McAllister, who 
served as research assistants for the project. 


KENNETH W, SPENCE 
State University of Towa 


ternal anxiety state,” that is assumed to be a 
determinant of overt behavior. 

Manifest anxiety. The definition of the con- 
cept “manifest anxiety” appears to have been 
fairly clear and unambiguous. A group of overt 
reactions, directly observable or measurable, 
are identified as manifest anxiety. This be- 
havior consists of such verbal responses as re- 
Ports of chronic worry and tension, and such 
nonverbal patterns as excessive activity of 
autonomically controlled effectors, disturb- 
ances in appetite and elimination, sleepless- 
ness, panic attacks, etc. These overt behavior 
patterns are also assumed to be the accom- 
paniment, or to represent the overt expression 
of, an internal emotional state of the person. 
While attempts have been made to obtain 
physiological indices of this state, the emo- 
tionality has usually been merely assumed by 
the clinician to be present, accompanying the 
more overt, directly observable symptoms 
mentioned above. 

Internal anxiety state. In contrast to this 
concept of manifest anxiety, which is identi- 
fied with specific observable behavior symp- 
toms, the concept of internal anxiety is a 
hypothetical construct that has been intro- 
duced in connection with attempts to account 
for certain other types of behavior patterns 
that occur in the neuroses. These behavior 
patterns have typically been referred to as 
defense reactions, which supposedly protect 
the individual against the internal state of 
anxiety. According to this conception the 
individual learns to respond to emotion-arous- 
ing stimulus conditions, not with overt emo- 
tional symptoms (manifest anxiety) but in- 
stead, with certain instrumental responses. It 
is assumed, further, that these newly learned 
responses serve in some manner to avoid the 
arousal of the internal anxiety state or to re- 
duce its intensity once it has been evoked. 
Thus the presence of these defense mechanisms 
is said to result in the minimization of the 
amount of functional internal anxiety. Since 
the internal anxiety state is merely inferred 
from these defense reactions, and is not ob- 
served or measured, this second usage of 


407 


498 


anxiety can be seen to be a hypothetical 
concept. j 
Although on a functional level the internal 
anxiety state is held in check by the presence 
of the defenses, it should be noted that the 
individuals exhibiting such symptoms are still 
considered to be potentially anxious. That is, 
if the defenses are somehow prevented from 
occurring, the stimulus conditions previously 
giving rise to the defense reactions would now 
arouse the internal anxiety. Furthermore, this 
anxiety would produce the type of overt be- 
havior that is described as manifest anxiety. 
This type of interpretation would seem to 
identify the internal emotional state assumed 
to be operating at the time of manifest anxiety 
with the internal anxiety state postulated to 
lie behind the appearance of defense reactions. 
In the typical textbook treatment, the rela- 
tionship between internal anxiety and overt 
symptomatology is described for each behavior 
- disorder separately. It can be seen from these 
discussions that the assumption has not been 
made by clinical theorists that defenses against 
internal anxiety operate in uniform fashion 
and with equal effectiveness in all diagnostic 
groups, or that the presence of defense reac- 
tions precludes the appearance of some mani- 
fest anxiety responses. In obsessive-compul- 
sive disorders, for example, manifest anxiety is 
frequently observed, disappearing only when 
the obsessive thought and/or compulsive act 
have occurred. In the classical hysterical syn- 
drome, on the other hand, manifest anxiety is 
. almost completely absent (“la belle indiffé- 
rence"). Furthermore, the internal anxiety 
postulated to lie behind the symptoms is kept 
at a minimal level. Thus, the hysterical de- 
fense system may be considered to be par- 
ticularly effective in checking internal anxiety 
and the obsessive-compulsive syndrome, in 
comparison, far less so. 
Within the neurotic disorders the symptoms 
of only one group, anxiety neurotic (anxiety 
state), have not been assumed to have a de- 
fensive function. In fact, this group is often 
characterized as one in which defense reactions 
have failed to develop. That is, individuals are 
classified in this category on the basis of the 
presence of a large number of manifest anxiety 
symptoms and the absence of any well-inte- 
grated defense system. Thus, while the anxiety 
neurotic has a few scattered defense reactions, 


Janet A. TAYLOR AND KENNETH W., SPENCE 


he stands in distinct contrast to individuals in 
other diagnostic groups, being relatively un?” 
protected against his anxieties. 

As we have seen, anxiety plays a key ex- 
planatory role in the neuroses. The overt symp- 
toms in these disorders, if not direct expressions 
of internal anxiety, are defense reactions 
against it. Thus, the major differences among 
subgroups are in the particular protective 
mechanism that is used to.reduce the anxiety, 

Anxiety has been assigned a similar role in 
the functional psychoses: many symptoms in 
these disorders are assumed to have defensive 
properties with respect to anxiety. However, 
in contrast to the neuroses, the anxiety con- 
cept as an explanatory principle occupies a 
relatively minor position; anxiety and reactions 
to it are considered as being only one segment 
of a multifaceted phenomenon. Furthermore, 
the theories concerning the dynamics of these 
disorders are insufficiently detailed to permit 
direct comparison of neurotic and psychotic 
groups with respect to amount of internal 
anxiety. For example, clinical theory would 
permit us to hypothesize that the anxiety 
neurotic is more likely to be anxious at any 
moment of time than the obsessive or the 
hysteric, but would not provide a basis for 
similar judgments among psychotic groups or ' 
between psychotic and neurotic Ss. 


AN EXPERIMENTAL TEST 


A number of studies have appeared in the 
literature that have shown a relationship be- 
tween level of classical conditioning and mani- 
fest anxiety variously defined, in both normal 
Ss (1, 6, 11, 12, 13, 14) and psychiatric patients 
(9, 17, 18). The interpretation of these results 
offered by the writers (12, 14) has been that 
degree of manifest anxiety is related to the 
intensity of an internal anxiety (emotional) 
state and that this, in turn, reflects the level 
of drive (D) of a subject. Thus, according tO 
an analysis in S-R terms, Ss exhibiting much 
manifest anxiety should, by virtue of higher D, 
exhibit a higher level of conditioning per- 
formance than those with lower manifest anx- 
iety. The assumption that manifest anxiety 


the results of a number of investigations 

volving learning tasks other than class! 

conditioning (4, 8, 16). a 
Attention was directed to the fact that clini- y 


E 


l 


| 
| 
| 


| 
| 
reflects drive level has also been supported by i 


CONDITIONING LEVEL IN THE BEHAVIOR DISORDERS 


„cians have tended to postulate that the internal 

anxiety that produces manifest anxiety symp- 
toms is the same state as that responsible for 
evoking defensive reactions. If we follow this 
assumption, we see that the established relation 
between manifest anxiety and conditioning 
provides a basis for testing the amount of 
functional internal anxiety that is present in 
patients who do not exhibit manifest anxiety. 
That is, differences among subgroups of pa- 
tients in level of conditioning could be inter- 
preted as reflecting differences in degree of 
functional internal anxiety. 

In an earlier study by the writers (13) in- 
volving comparisons between normal, neurotic, 
and psychotic Ss, conditioning scores were used 
in this manner. Clinical speculations led to the 
prediction that psychotic and neurotic Ss 
would, in general, exhibit a higher condition- 
ing level than normals due to the greater degree 
of manifest and internal anxiety in these in- 
dividuals. The results of this investigation 
indicated that the conditioning performance of 
the psychotic Ss was significantly higher than 
that of both normals and neurotics, The neu- 
rotic group was superior to the normal Ss but 
the difference was not statistically significant. 

The purpose of the present experiment was 
to test further implications of clinical theory. 
Specifically the prediction was made that Ss 
with the diagnostic label “anxiety state” 
should show higher conditioning scores than 
Ss in other neurotic groups in which defense 
behavior is said to predominate. 


METHOD 


Subjects. The Ss were psychiatric patients obtained 
from the inpatient and outpatient services of the Illinois 
Neuropsychiatric Institute and from the North Shore 
Health Resort. Patients were used as Ss in the experi- 
ment as they became available without any prior 
selection or screening as to diagnosis. Some screening 
was involved, however, in that the psychiatrists re- 
ferred only those patients whom they judged to be 
capable of cooperating in the experimental situation 
and who would not be excessively disturbed by- the 
experience, 

Psychiatric diagnoses for the Ss were drawn from 
the individual files maintained by each institution. In 
addition to the diagnosis, other types of information 
were obtained for each patient. These included items 
of personal data (age, occupation, onset of illness, etc.) 
drawn from case histories and a psychiatric check list 
that was completed for each patient by the psychiatrist 
in charge of the case. The check list consisted of state- 
ments describing common behavior patterns found in 
the functional behavior disorders, including a detailed 


499 


TABLE 1 


DISTRIBUTION OF DIAGNOSES IN THE NEUROTIC AND 
Psycuotic GROUPS 


Neurotics Psychotics 
Diagnosis N Diagnosis N 

Anxiety state 15 Schizophrenia 
Anxiety hysteria 5 Simple 4 
Obsessive-compulsive 3 Paranoid 12 
Depressive reaction 10 Hebephrenic 3 
Hysteria (misc.) 5 Mixed type 2 
Hypochondriasis 2  Involutional psychosis 3 
Psychoneurosis mixed 17 Miscellaneous 8 
Alcoholism 7 
Drug addiction _ 4 
Behavior & character 

disorders 6 
Total . ^^ Total 42 


list of overt anxiety symptoms. The psychiatrist was 
asked to indicate absence or severity of each symptom 
if present, and further, to assign to each patient an 
over-all rating as to the degree of overt anxiety. In 
addition, an individual form of a manifest anxiety 
scale developed by one of the writers (15) was admin- 
istered to each patient, usually several days after the 
conditioning session. 

One hundred and forty-five patients participated in 
the conditioning situation. The records of 29 of these 
were excluded from consideration in the results of the 
investigation for the following reasons: 6 for equipment 
failure, 3 due to inability or unwillingness to cooperate 
in the conditioning situation, 2 because of unavail- 
ability of any diagnosis, and 18 because they did not 
meet the latency criteria set up to exclude voluntary 
responders, Of the 116 remaining Ss, 74 were diagnosed 
as neurotic and 42 as psychotic.? A more detailed break- 
down of the composition of the neurotic and psychotic 
groups by diagnostic category is shown in Table 1. 

Apparatus and method of recording. The equipment 
for recording the eyelid closure and presenting the CS 
and UCS has previously been reported (13) and will 
not be described here. Seated in a straight-backed chair 
in a semi-darkened room containing the conditioning 
apparatus, S was instructed to blink upon receiving a 
verbal ready signal and then to fixate on a 6-cm, circu- 
lar milk glass window or disc, placed at a distance 6 ft. 
in front of him. The brightness of the disc between 
trials was approximately 1 apparent foot-candle. The 
CS consisted in an increase in brightness of the disc 
to approximately 24 apparent foot-candles, The dura- 
tion of the CS on each trial was 520 msec. Following the 
onset of the CS by 470 msec., the UCS, a puff of air, 
was administered to the right eye. The air puff was 
produced by the fall of a 30-mm. column of mercury, 
magnetically released, in a manometer. Intertrial inter- 


2 Of this total, the data from 34 neurotic and 21 
psychotic Ss were presented in an earlier study (13) 
by the writers. Additional Ss were tested and the entire 
group utilized in the analyses reported in the present 
investigation. 


500 


vals of 15, 20, and 25 sec., averaging 20 sec. and fixed 
according to a prearranged schedule, were used. 

Studies in the Iowa laboratory have shown that 
some Ss attempt to avoid the air puff by voluntarily 
closing their eye to the CS, keeping it closed until after 
the occurrence of the UCS. In order to eliminate such 
voluntary responders, objective latency criteria utilized 
in previous studies (10, 13) were employed. Briefly, the 
records of all 5s who gave more than 50 per cent CR's 
with latencies less than 300 msec. were excluded from 
the sample. 

Conditioning procedure. Each S first received three 
presentations of the light alone in order to check for 
any initial conditioned response tendencies to the 
light. A single puff of air, unaccompanied by the light, 
was then administered and spontaneous blinking re- 
corded for 40 sec. Following these preliminary trials, 
all Ss were given 80 paired presentations of the light 
and air puff. d , 


RESULTS AND DISCUSSION 

Comparison of neurotic groups. In comparing 
the conditioning scores of anxiety neurotics 
with those in other neurotic groups a question 
arose as to whether or not certain diagnostic 
categories should be included in the analysis, 
namely: behavior disorders, alcoholism, drug 
addiction, and mixed psychoneurosis. The latter 
diagnosis is typically a “wastebasket” category 
into which individuals failing to correspond to 
any of the other neurotic patterns are placed. 
Since many of these Ss may be expected to 
exhibit a number of overt anxiety symptoms 
in addition to other reactions, opposing this 
group to anxiety neurotics might not be con- 
sidered a fair test of the clinical theory. The 
other three groups (alcoholics, behavior dis- 
orders, and drug addicts) likewise raised a 
problem since the relationship of their behav- 
ior to anxiety has not been clearly specified. 


MEAN PER CENT CR 


BLOCK OF TEN TRIALS 


Fic. 1. ACQUISITION Curves SHOWING THE MEAN 
NUMBER or CR's rog THE NEUROTIC GROUPS IN 
Successive BLOCKS or TEN TRIALS 


Janet A. TAYLOR AND KENNETH W. SPENCE 


Accordingly, two analyses were made between 


the conditioning scores of anxiety neurotics 


and those of other groups, one including all 
other neurotics and the other with the four 
questionable categories omitted. 

Curves of acquisition of the CR for anxiety 
neurotics and the other groups are shown in 
Fig. 1. Although the curve for anxiety neurotics 
tends to be above that for all other neurotics, 
as would be predicted by clinical theory, there 
was some overlapping. The mean conditioning 
scores for anxiety neurotics were also slightly 
higher than for all other neurotics, as is shown 
in Table 2. The difference between means, 
however, was not statistically significant. 

When the scores for the individuals in the 
four categories previously mentioned were ex- 
cluded from the nonanxiety category, the 
conditioning scores for this group were de- 
creased. Although these Ss ("other neurotics") 
exhibited a lower conditioning performance on 
every block of trials than did anxiety neurotics, 
the ¢ ratio comparing the mean conditioning 
of this reduced group with anxiety neurotics 
was only 1.22, a value falling far short of the 
level needed to reject the null hypothesis. 

In attempting to explain the failure to find 
significant differences between the neurotic 
groups a check was made on the fundamental 
assumption that the presence of the defense 
reactions precludes, or at least reduces, the 
appearance of manifest anxiety symptoms. On 
a questionnaire filled out for each patient by 
his psychiatrist the patient was classed into 
one of three groups according to the over-all 
amount of manifest anxiety exhibited: absent 
or mild, medium, or severe. (All judgments 
were made in terms of a patient population.) 
A chi-square test of independence indicated 
that there was no significant difference between 
the number of anxiety neurotics and all other 
neurotics assigned to each of these categories. 
When the four questionable groups were re- 
moved, the anxiety ratings for the latter group 
decreased slightly but the value of chi square 
again failed to reach statistical significance. 
A comparison of means on the manifest anx- 
iety scale, described earlier, also failed to show 
differences, the mean of the anxiety neurotics 
being 30.00 and of all other neurotics 30.96. 
Since the groups were similar in these measures 
of overt anxiety, it is not surprising that their 
conditioning performance also did not differ 


significantly. 


CONDITIONING LEVEL IN THE BEHAVIOR DISORDERS 


The results of the present investigation, 
then, do not conform to the predictions devel- 
oped from the clinical hypotheses concerning 
the significance of certain neurotic symptoms. 
Not only was there failure to support the 
inference that these symptoms reduce internal 
anxiety, as shown by the conditioning data, but 
also the analyses of the estimates of manifest 
anxiety indicated that the groups did not differ 
in this respect. In this group of patients at 
least defensive neurotic symptoms seemed to 
be merely added to manifest anxiety reactions 
and did not replace them. 

A number of qualifications could be added 
to the defense theory which would leave it 
essentially intact and still explain the present 
findings. Individual differences might be ex- 
pected in the degree to which a defense system 
was perfected. For example, patients whose 
symptoms have appeared only recently might 
still be in the process of developing them and 
hence be more anxious than persons who had a 
long history of illness; some individuals might 
be attempting to give up their neurotic symp- 
toms and others might be content with them. 
Failure to take such variables into account 
might obscure the relationship of well-devel- 
oped defenses to anxiety. Or it might be 
claimed that in terms of the number and effi- 
ciency of the defenses, neurotic disorders lie 
along a continuum in such a fashion that 
opposing one extreme group with all others, as 
was done in the present study, will reveal no 
differences in either manifest or internal anx- 
iety unless extremely sensitive measuring in- 
struments are used. Whatever the explanation, 
a reasonable conclusion to be drawn from the 
results of the present study is that the clinical 
hypotheses describing the significance of neu- 
rotic behavior are, at best, oversimplified and 
need elaboration. Further research, testing 
such speculations as those offered above, is 
certainly indicated. 


TABLE 2 
MEAN PERCENTAGE OF CONDITIONED RESPONSES 
ExHIBITED BY NEUROTIC GROUPS 
IN 80 CONDITIONING TRIALS 


Group N Mean ou 
Anxiety neurotics 15 28.67 5.98 
All other neurotics 59 26.18 3.32 
Other neurotics minus 
four categories 25 20.50 2.90 


50! 
‘60 
PSYCHOTICS 0 ——9. 

sol NEUROTICS s— a ST 
[4 
o 
L? 
z 
5 
PE 
& 
È 
2 20 
$ 
= 


, 2 3 4 5 6 7 8 
BLOCK OF TEN TRIALS 
Fic. 2. Acquisition Curves SHOWING THE MEAN 
NUMBER or CR’s FOR THE NEUROTIC AND PSYCHOTIC 
Groups IN Successive BLOCKS or TEN TRIALS 


Psychotics vs. neurotics. The Ss in the present 
investigation include 34 neurotic and 21 psy- 
chotic individuals whose conditioning per- 
formance was reported by the writers in 
another study (13), plus an additional num- 
ber of Ss tested at a later time. In the original 
study the psychotic Ss, when compared to 
neurotics, exhibited a significantly superior 
conditioning performance. In order to deter- 
mine whether this difference would still be 
found with a larger sample from each popula- 
tion, conditioning scores for the groups were 
again compared. The conditioning curves 
shown in Fig. 2 indicate that the psychotic 
group continued to exhibit a consistently 
higher performance throughout the course of 
the conditioning trials. However, as may be 
seen from Table 3, the results of a ¢ test indi- 
cated that the difference between means for 
the total conditioning scores was significant 
only between the .05 and .10 level of confi- 
dence, somewhat lower than was found in the 
earlier study. j 

The two groups were also compared with 
respect to the data from the psychiatric ques- 
tionnaire and the manifest anxiety scale. For 


TABLE 3 


MEAN PERCENTAGE OF CONDITIONED RESPONSES ON 
80 CONDITIONING TRIALS 


Group N Mean om t 
ti 74 * 26.69 2.98 s 
Neurotics ue 
Psychotics 42 35.89 4.11 


* Significant between the .05 and .10 level of confidence for 
104 df. 


502 


the former, a chi-square test of independence 
indicated no difference in the proportion of 
neurotic and psychotic Ss in each of the three 
overt anxiety categories. A comparison of the 
means on the manifest anxiety scale also re- 
vealed no significant difference between groups. 
Thus, it would appear that if the superiority 
of the psychotic group in conditioning were to 
be interpreted in terms of anxiety, the differ- 
ences would be in the amount of functioning 
internal, rather than manifest, anxiety. Ac- 
cording to this interpretation, psychotic 
symptoms mask or partially prevent the ap- 
pearance of overt anxiety responses without a 
corresponding reduction in the amount of 
internal anxiety. 

A further possibility that should be men- 
tioned is that the greater conditioning per- 
formance of psychotics may be due to factors 
other than anxiety. While previous studies 
have demonstrated a relationship between 
conditioning and manifest anxiety, variously 
defined, the correlation coefficients that have 
been reported between these two variables 
(6, 13) indicate that a relatively small amount 
of the variability among Ss can be accounted 
for in terms of anxiety. Thus, it is logically 
possible that the superiority of psychotics over 
neurotics, and, as was found in an earlier 
study (13) over normals, is wholly or partially 
due to as yet unidentified factors that separate 
psychotic individuals from other groups. 


SUMMARY 


The present study was designed to investi- 
gate differences in conditioning performance 
between anxiety neurotics (anxiety state) and 
neurotics in other diagnostic categories, and 
between neurotics and psychotics. Seventy- 
four neurotic Ss (including 15 anxiety neu- 
rotics) and 42 psychotic Ss were run in an eye- 
lid conditioning situation under the same ex- 
perimental conditions. 

The conditioning performance of the anxiety 
neurotics and other neurotics did not differ 
significantly. Further, no difference between 
the two groups was found in the amount of 
overt anxiety exhibited, as measured by a test 
of anxiety and a psychiatric questionnaire 
filled out for each patient. The implications of 
these results for clinical hypotheses concerning 
the function of certain neurotic symptoms 

were discussed. 
When the psychotic Ss were compared to the 


Janet A. TAYLOR AND KENNETH W. SPENCE 


total neurotic group, their conditioning per- 
formance was found to be superior between 
the .05 and .10 level of confidence. Speculations 
concerning this difference were offered. 


REFERENCES 


1. Brrrerman, M. E., & HorrzwaN, W. H. Condi- 
tioning and extinction of the galvanic skin re- 
sponse as a function of anxiety. J. abnorm. soc. 
Psychol., 1952, 47, 615-623. 

2. Brown, J. F. Psychodynamics of abnormal behavior. 
New York: McGraw-Hill, 1940. 

3. Cameron, N. The psychology of behavior disorders: 
a biosocial interpretation. Boston: Houghton 
Mifflin, 1947. 

4. FARBER, I. E., & Spence, K. W. Complex learning 
and conditioning as a function of anxiety. J. exp. 
Psychol., 1953, 45, 120-124. 

5. Freup, S. The problem of anxiety. New York: 
Norton, 1936. 

6. Hicarp, E. R., Jones, L. V., & Karran, S. J. 
Conditioned discrimination as related to anxiety. 
J. exp. Psychol., 1951, 42, 94-100. 

7. May, R. The meaning of anxiety. New York: 
Ronald, 1949, 

8. MoNraGvE, E. K. The role of anxiety in serial rote 
learning. J. exp. Psychol., 1953, 45, 91-96. 

9. Scmrz, E., Doucan, C., & Wetcu, L. The con- 
ditioned PGR and the EEG as indicators of 
anxiety. J. abnorm. soc. Psychol., 1949, 44, 549- 
532. 

10. Spence, K. W. Learning and performance in eyelid 
conditioning as a function of the intensity of the 
UCS. J. exp. Psychol., 1953, 45, 57-63. 

11. Spence, K. W., & Farner, I. E. Conditioning and 
extinction as a function of anxiety. J. exp. Psy- 
chol., 1953, 46, 116-119. 

12. Spence, K. W., & Tavron, Janet A. Anxiety and 
strength of the UCS as determiners of the amount 
of eyelid conditioning. J. exp. Psychol., 1951, 42, 
183-188. ~ 

13. Spence, K. W., & Tavronm, JANET A. The relation 
of conditioned response strength to anxiety in 
normal, neurotic and psychotic subjects. J. exp. 
Psychol., 1953, 45, 265-272. : 

14, TAYLOR, JANET A. The relationship of anxiety to 
the conditioned eyelid response. J. exp. Psychol, 
1951, 41, 81-89. $ 

15. TAYLOR, JANET A. A personality scale of manifest 
anxiety. J. abnorm. soc. Psychol., 1953, 48, 285- 
290. 


16. TAYLOR, JANET A., & SPENCE, K. W. The mise 
Ship of anxiety level to performance in seria 
learning. J. exp. Psychol., 1952, 44, 61-64. 

17. Wexcu, L., & Kusts, J. Conditioned PGR un 
galvanic response) in states of ,pathologi 
anxiety. J. nero. ment. Dis., 1947, 106, 372-381. 


18. Wzrcm, L., & Kunis, J. The effect of anxiety OD 


the conditioning rate and stability of the PGR. 
J. Psychol., 1947, 23, 83-91. 


19. Wrrz, R. W. The abnormal personality. New York: — 


Ronald, 1948. 
Received June 5, 1953. 


] 
| 


THE ROLE OF EXPECTANCY IN THE PERFORMANCE . 
OF POSTHYPNOTIC BEHAVIOR: 


SEYMOUR FISHER 
Army Medical Service Graduate School 


ESPITE a wealth of anecdotal material 
D and case reports, there have been few 
experimental investigations of the 
performance of posthypnotic behavior. Aside 
from an important contribution by Erickson 
and Erickson (3) and several studies dealing 
with the duration of posthypnotic suggestions 
summarized in Weitzenhoffer's paper (8), the 
major variables remain essentially unexplored. 
The purpose of the present communication 
is to report the results and implications of a 
simple, but relatively well-controlled, experi- 
ment on posthypnotic suggestion. The primary 
variable under investigation was the nature of 
the setting in which the suggestion was carried 
out, and an attempt was made to manipulate 
the subject's (S's) expectancies affer the ter- 
mination of the formal hypnotic state. Thus, 
the general hypothesis being tested is that 
successful performance of a posthypnotic act 
is not only a function of the specific sugges- 
tions given under hypnosis, but is also con- 
tingent upon the S's subsequent expectations. 


METHOD 
Subjects 


A total of 13 Ss was employed in this study. All were 
capable of attaining either a medium or deep hypnotic 
state, and all had previously shown that they could re- 
spond to simple posthypnotic suggestions. Twelve Ss 
were males who had earlier participated in a separate 
hypnotic experiment; the thirteenth S was a high school 
girl who had had only a few hours of hypnotic experi- 
ence prior to this experiment. 


Procedure 


While under hypnosis each S was given the following 
posthypnotic suggestion, which was generally repeated 
two or three times: 

“Now listen carefully. After I awaken you, you'll 
notice a very interesting thing occurring. Every time 
you hear the word ‘psychology’ you’ll reach up and 
scratch your right ear. You will be wide awake, and 
aware of everything that’s going on, but each time the 
word ‘psychology’ is mentioned, you’ll feel compelled 
to scratch your right ear. Do you understand that 
clearly?” 


1 Based on one portion of a dissertation submitted to 
the faculty in partial fulfillment of the requirements for 
the Ph.D. degree at the University of North Carolina, 
1952. 


During the administration of the suggestion, 5 was 
alone in the room with the hypnotist (hereafter referred 
to as Ej). At the conclusion of the suggestion, 5 was 
awakened and the experiment proceeded as follows: 

Period 1. One or more experimenters (associates of 
the hypnotist who were collaborating in the study) 
entered the room and were introduced to S. A general 
discussion ensued which included S, Ej, and whoever 
else was present. Each of the various Es managed to 
weave into the conversation the word “psychology” a 
number of times (a minimum of five), and also at- 
tempted, by means of indirect questioning, to get S 
himself to say the word. While the discussion was car- 
ried on in a friendly, informal, nonexperimental at- 
mosphere, it was apparent at all times that the Es 
were taking note of S's ear-scratching behavior. 

Period 2. After the discussion had gone on about ten 
minutes, an attempt was made to alter inconspicuously 
the nature of the situation. The other Es temporarily - 
dropped out of the conversation, and E; began to chat 
with S about the posthypnotic suggestion which had 
been given him under hypnosis. The ancillary Es soon 
joined in this discussion, and during the next five or 
ten minutes every effort was made to create and 
sustain the impression that the “experiment” was com- 
pleted—i.e., that the Es had satisfactorily witnessed 
the effects of the posthypnotic suggestion, and were 
now no longer interested in ear-scratching behavior per 
se. Throughout this period, however, the stimulus word 
"psychology" was again mentioned by all the Es on 
numerous occasions. 

Tn order to clarify the technique used during Period 
2, the following is a truncated account of the conversa- 
tion with one 5: 

E;: “Have you noticed anything unusual during the 
past few minutes, Jim?” 

S: “Why, yes, I've been scratching my ear every 
once in a while." ' 

Ei: “Uh-huh, as you may have guessed, you were re- 
sponding to a posthypnotic suggestion I gave you 
earlier. Did you notice when you were scratching your 

» i 
mi “Tt seems like it was to a word or something." 

E: “That’s right. I had given you the suggestion 
that every time you heard the word 'psychology" you 
would scratch your ear. a wap feel the first time 

ou heard the word ‘psychology’ 
X S: “Well, I just found my hand reaching up.” 

Ey: “Yes, posthypnotic suggestions work that way. 
All that’s necessary is to tell you while you’re in hyp- 
nosis that you'll scratch your ear whenever you hear 
the word ‘psychology,’ and sure enough you find your- 

doing it.” 
pe ag ee then continued along the same 
lines with the other Es joining in. 

Period 3. As an additional control E; now en- 
deavored to restructure S’s expectancies by intimating 
that the experiment was still in progress. The hypnotist 


503 


504 
TABLE 1 
PRESENCE (+) OR ABSENCE (—) OF A PosrHYPNOTIC 
RESPONSE OF EAR SCRATCHING TO HyPNorIsT's 
PRESENTATION or StmmuLUs Worb 


Subject Period 1 Period 2 Period 3 


Co 
Pa 

Ho 
Ro 
He 
Mo 
Ea 

Bl 

Bu 
Wa 
Ed 
Me 
Ha 


* This S failed to respond the first three times the stimulus 
word was presented, and then suddenly began to respond. 

1 This S held her hand against her ear throughout the second 
period. 


wd dS 


++++++4+44+4444 
LEHRER REI A 


IE d GFEAÉg I 


informed S that he was going to rehypnotize him and 
remove the suggestion: 

“All right, Jim, in a few moments I'm going to 
hypnotize you once again and remove the suggestion I 
gave you, I'm sure you wouldn't want to go around 
scratching your ear indefinitely. Before I put you back 
in hypnosis, though, I was just wondering . .. do you 
remember the name of the last psychology book you 
bought?" 

The other Es in the room then presented the stimulus 
word a few times (both in sentence form and as a single, 
isolated word), and finally S was directly requested to 
say the word himself. Regardless of whether or not he 
responded to the stimulus, S was subsequently re- 
hypnotized and the suggestion was permanently re- 
moved. 

At a later date, ranging from one day to a week, 
those Ss who failed to respond during Period 2 were 
interviewed individually by E, alone. In this interview 
interest was focused upon S’s attempts to account for 
the differences in behavior between Periods 1 and 2. 


RESULTS 


Table 1 records the presence (+) or ab- 
sence (—) of the ear-scratching behavior in 
response to the stimulus word “psychology” as 
presented by E, during the three periods of 
the experiment. As was readily expected, all 
thirteen Ss responded positively during Period 
1.? Under the altered conditions of Period Fd 
however, nine Ss completely Stopped respond- 
ing; two Ss (Bu and Bl) continued to respond 


2 Although most Ss were completely consistent in 
either responding or not responding during any given 
period, two Ss during Period 1 missed the cue on single 
occasions. 


SEYMOUR FISHER 


in the same manner as they did in Period 1; 
and one S (Me) confused the issue considerably - 
by keeping her hand against her ear through- 
out the second period, making it difficult to 
determine whether or not she was responding. - 
One S (He) failed to respond in Period 2 the 
first three times the stimulus word was men- | 
tioned by E, but then abruptly resumed the 
response and continued for the remainder of 
the period. 

Table 1 further indicates that, in Period 3, 
seven of the nine Ss who ceased responding in 
Period 2 reverted to the ear-scratching be- 
havior in the absence of any intervening formal 
hypnosis. 

Of particular interest are the data pertaining 
to the variability in response consistency. That 
the instructions were interpreted individualis- 
tically by different Ss is clearly seen in an 
examination of Table 2. While all the Ss with- 
out exception responded in Period 1 when the 
stimulus word originated from Æ, four Ss did 
not respond when they themselves uttered the 
word; two of these four, moreover, failed to 
respond when any of the ancillary Es pre- 
sented the cue. Table 2 shows that five Ss 
could not be induced to say the word “psy- 
chology,” and it is noteworthy that all Ss 
manifested a decided reluctance to say the 
word. 

One of the most striking findings was ob- 


TABLE 2 
PRESENCE (+) OR ABSENCE (—) or A PosTHYPNOTIC 
RESPONSE OF EAR SCRATCHING RELATIVE TO THE 
Source or STIMULATION DURING PERIOD 1 


Source of Stimulation 
Se I 0 


Other * 
Experimenters sa 


Subject 


Co 
Pa 
Ho 
Ro 
He 
Mo 
Ea 
Bl 
Bu 
Wa 
Ed 
Me 
Ha 


* A question mark (?) indicates that S could not be induced f? 
say the stimulus word. 


EGG E.M. 
t+t+ ltteer tt 
wvttiwitt+eyi il 


EXPECTANCY AND PosrHvPNOTIC BEHAVIOR 


tained in the postexperimental interview when 
the Ss were confronted with the fact that they 
had stopped scratching their ear during Period 
2. Of the nine Ss interviewed, two (Co and Ro) 
offered elaborate "rationalizations": one said 
he noticed his ear itching severely each time 
the word was mentioned, but could not scratch 
it because his hands were in his pockets (which 
was untrue); the other claimed he was unable 
to respond because he had a book in his hands 
at the time (equally untrue). The remaining 
seven Ss insisted they did not hear the word 
mentioned during Period 2! 


Discussion 


Nature of posthypnotic behavior. A compari- 
son of the behavior in the three experimental 
periods strongly suggests that the performance 
of a posthypnotic suggestion is a function of 
S's understanding of the hypnotist’s expecta- 
tions—not only when the suggestion is given 
during the formal hypnotic state, but also in 
the process of enacting the suggestion. In 
Period 1 Ss responded each time the signal 
was perceived, and it may be safely assumed 
that this behavior would have continued if the 
hypnotist had not restructured the situation 
in Period 2. Although S was allegedly “wide 
awake," obviously some aspect of the hyp- 
notist’s behavior affected S’s posthypnotic 
behavior. To the extent that the hypnotist was 
successful in manipulating the experimental 
periods, it seems reasonable to assume that 
Ss who failed to respond in Period 2 were led 
to believe that the ear-scratching behavior was 
no longer expected. While the actual suggestion 
stipulated nothing about the termination of 
the behavior, and specifically directed S to re- 
Spond every time the word was mentioned, the 
majority of the Ss were, so to speak, “caught 
off guard.” When the hypnotist implied at the 
beginning of Period 3, however, that the sug- 
gestion was supposed to be still operative, Ss 
tended to revert to the posthypnotic act.* 


5 Further support for the validity of this interpreta- 
tion is found in a hypnotic interview held with one S 
(He) at the conclusion of the experiment. When asked, 
under hypnosis, to account for the fact that in Period 
2 he failed to respond the first three times the word 
was presented but then abruptly began to respond 
again, after extensive questioning he stated that he 
suddenly realized that “the experiment wasn't over 
yet." One might hypothesize that had it been possible 
to structure the situation even more precisely during 
Period 2, all 13 Ss might have been “caught off guard.” 


505 


The fact that a posthypnotic suggestion is 
carried out only in a specific context has never 
been sufficiently recognized in the literature. 
Indeed, it has been generally implied that, how- 
ever unexpectedly a posthypnotic signal may 
arise, the good S finds himself responding 
(1, 4, 6). 

A number of studies indicate that when an 
S is given a posthypnotic suggestion without 
a terminal cue (i.e., indicating when S should 
stop responding), the behavior tends eventu- 
ally to cease as a function of time (e.g., 5, 7). 
That this behavior, however, can be “turned off 
and on” by controlling S’s expectations in the 
subsequent “waking” state seems to be a hith- 
erto unrecognized phenomenon which refocuses 
the entire problem of posthypnotic suggestion. 
In effect, the question may be posed: Is S' in 
some form of hypnotic state during the total 
time that he is prepared to respond post- 
hypnotically, even when the behavior extends 
over a period of days without any intervening 
formal hypnosis? The answer to this question 
is certainly not a simple one, but will be 
touched on very briefly here. 

Erickson and Erickson (3) have elaborated 
on Moll’s earlier observation (6) that S fre- 
quently appears to pass into a spontaneous 
hypnotic state immediately preceding the sug- 
gested posthypnotic act; this spontaneous state 
terminates as soon as the behavior is com- 
pleted, and .S resumes his “normal” activity. 
The specific manifestations of the state are 
thoroughly described by the Ericksons, who 
also point out that this new state of hypnosis 
can be most clearly demonstrated by pro- 
pitiously interrupting S in the process of per- 
forming the posthypnotic act: S will now heed 
the hypnotist's suggestions once again, al- 
though both before and after the response it is 
claimed that S is no more suggestible to the 
hypnotist's requests than is a nonhypnotic S. 

In the present study the manifestations of 
the spontaneous, self-limited state were clearly 
observedi—but only on each occasion when S 
began to reach up to scratch his ear. That is, 
during the greater part of the interaction be- 
tween S and Es, there were no indications of 


4 Amnesia for the ear-scratching behavior, however, 
did not occur even in those Ss who were ordinarily 
amnesic for hypnotic events. It is probable that the 
discussion during Period 2, in which specific reference 
was made to the ear-scratching, contributed to the lack 
of subsequent amnesia. 


506 


any hypnotic-like state; whenever the stimulus 
word was effective, however, the new hypnotic 
state became momentarily evident until S 
finished scratching his ear. Now if it is assumed 
that Ss were “wide awake and normal" in be- 
tween the ear-scratching responses, then it is 
difficult to see why the behavior changed in 
Periods 2 and 3. When the hypnotist restruc- 
tured the setting at the beginning of Period 2, 
there were no indications of a hypnotic state 
at that moment; and certainly when Ss re- 
sumed the ear-scratching behavior in Period 
3, no formal hypnosis was instigated. 

The results of this study suggest, rather, 
the view that during the entire time S remained 
in the setting after being told to awaken, a 
certain degree of rapport existed between him 
and the hypnotist. This rapport was quite 
specific, it appears, in the sense that S con- 
tinued to be concerned about the hypnotist’s 
expectations regarding the posthypnotic be- 
havior previously suggested. In no other way 
did he appear hypersuggestible. 

This concern over the hypnotist’s expecta- 
tions is further seen in the denial behavior ob- 
served during the postexperimental interview 
with the hypnotist. Although the tendency for 
hypnotic Ss to rationalize a posthypnotic sug- 
gestion is commonplace, it appears that a 
hypnotic S is equally prone to rationalize his 
lack of posthypnotic behavior—particularly if 
he believes he has failed to live up to the hyp- 
notist’s expectations.’ Moreover, when one 
considers the fact that these denials and ra- 
tionalizations were obtained from some Ss as 
long as one week after the last formal hypnotic 
session, it becomes apparent that the line of 
demarcation between termination of hypnosis 
and "normal waking behavior" is a hazy one 
indeed. Some form of rapport seems to persist 
between S and the hypnotist since the defense 
mechanisms manifested by Ss reflect a degree 
of ego involvement which would not be ex- 
pected in the absence of a strong (hypnotic?) 
relationship. One rather obvious question 
comes to mind: Does S ever completely resume 
his prehypnotic, “normal” state, where he is 
no longer affected by the hypnotist? A promis- 
ing area of research might deal with the type 

and duration of the subject-hypnotist relation- 


5 By implication, it follows that introspective re- 
ports of Ss who have been used in hypnotic experi- 
mentation may be of dubious validity and should be 
interpreted with extreme caution. 


interesting one, and warrants further research. 


SEYMOUR FISHER 


ship after conclusion of formal hypnotic 
sessions. 

Interpretation as a function of expectancy. In | 
a paper by Erickson (2) it is claimed that the 
hypnotic S interprets the hypnotist’s sugges- - 
tions in a concrete, literal manner. The present 
data are clearly not in line with her contention, 
since Ss do not attach any literal meaning to 
the phrase “every time the word ‘psychol- 
ogy’ is mentioned.” Instead, it would seem: 
that whatever interpretation is given to the 
explicit suggestion depends upon what S be- 
lieves the hypnotist intended. It will be recalled! 
that when the suggestion was presented during 
hypnosis, Ss did not know that additional 
people were to be later introduced. In Period 
1, after S is awakened, it is almost as though 
he has to decide “does he (the hypnotist) ex- ~ 
pect me to respond to these people also?” The 
fact that eleven out of the thirteen Ss did re- 
spond to the ancillary Es may possibly be 
explained by noting that these Es were pre- ^ 
sented to Ss by the hypnotist and might thus 
be considered as "associates." The prediction 
could be made that if the hypnotist ignored, 
or in some way appeared antagonistic toward, 
the other Es, fewer Ss would respond to them 
as a source of stimulation. 

Similarly, it seems doubtful whether it oc- 
curred to Ss during the formal hypnosis (when 
they were given the posthypnotic suggestion) 
that they themselves might be asked to present 
the stimulus word. The peculiar difficulty in 
getting Ss to say the word “psychology” dur- 
ing Period 1 cannot be readily accounted for 
by the present data, but is consistent with the 
possibility that Ss were so uncertain as to the 
hypnotist's expectations on this score that they 
were reluctant to have to commit themselves 
one way or the other. This phenomenon is an 


SUMMARY AND CONCLUSIONS 


The primary purpose of this study was to 
investigate the extent to which the effective 
enactment of a posthypnotic suggestion is de - 
pendent upon the nature of the setting 19 
which the suggestion is carried out. On the 
basis of data obtained from thirteen hypnotic 
Ss, the results were as follows: (a) ‘The con- 4 
tinued performance of a posthypnotic sugges- 
tion is a function of S's belief that the hypnotist 
expects the behavior to occur; i.e., if D | 
“caught off guard,” the posthypnotic behavii | 

| 
A 


EXPECTANCY AND PosrHYyPNOTIC BEHAVIOR 


tends to disappear. However, additional ma- 
nipulation of the situation by the hypnotist 
can reinstate the behavior in the absence of 
any further formal hypnotic suggestion. (b) 
The hypnotic S does not usually tend to in- 
terpret the hypnotist's suggestions in a con- 
crete, literal manner, but rather, the eventual 
meaning appears to be determined by S’s 
inferences as to the hypnotist's expectations, 
(c) An incidental finding of some interest in- 
dicates that Ss who are “tricked” into a lapse 
of carrying out a posthypnotic suggestion seem 
to find it necessary to “save face,” either by a 
complete denial of their “negligence” or by a 
more elaborate form of rationalization. The 
tendency for hypnotic Ss to rationalize their 
posthypnotic behavior is well-established in 
the literature; it appears, however, that hyp- 
notic Ss are equally capable of rationalizing 
their lack of posthypnotic behavior. 

The results suggest that when an SS is carry- 
ing out a repetitive posthypnotic suggestion 
over an extended period of time, a certain 
degree of rapport continues to exist between 
him and the hypnotist. This rapport-is evi- 


507 


denced by the S’s unceasing concern over the 
hypnotist’s expectations in regard to the pre- 
viously suggested posthypnotic behavior. 


REFERENCES 

1, Bernnem, H. Suggestive therapeutics. New York: 
London Book Co., 1947. 

2. Erickson, ErizABETH M. Critical comments on 
Hibler’s presentation of his work on negative after- 
images of hypnotically induced hallucinated colors. 
J. exp. Psychol., 1941, 29, 164-170. 

3. Erickson, M. H., & Erickson, ELIZABETH M. 
Concerning the nature and character of post- 
hypnotic behavior. J. gen. Psychol, 1941, 24, 
95-133. 

4. EsrABROOKS, G. H. Hypnotism, New York: Dutton, 
1948, 

5. Krrrocc, E. R. Duration of the effects of post- 
hypnotic suggestion. J. exp. Psychol, 1929, 12, 
502-514. 

6. Mott, A. Hypnotism. New York: Scribner & Wel- 
ford, 1890. 

7. PATTEN, E. F. The duration of posthypnotic sug- 
- J. abnorm. soc. Psychol., 1930, 25, 319— 


8. P. MP A M. A note on the persistence 
of hypnotic suggestion. J. abnorm. soc. Psychol., 
1950, 45, 160-162. 


Received June 8, 1953. 


SELF-PERCEPTION, PERCEPTUAL DEFENSE, AND ADJUSTMENT: 


BERNARD CHODORKOFF 
University of Wisconsin? 


13) and Snygg and Combs (2, 14), defensive- 

ness is described as primarily a perceptual 
phenomenon which follows as a consequence 
of threat to the individual's self. Defense, in 
essence, is the prevention of accurate percep- 
tions of what is threatening from reaching 
awareness. As a result, aspects of the environ- 
ment and of the person himself may be denied 
to awareness or may be misperceived. It is 
in this way that the individual insures the sta- 
bility of his self. Furthermore, the adequacy 
of the individual's personal adjustment is 
considered to be inversely related to the degree 
to which experiences are denied awareness. 

The present research was designed to study 
the implication that the degree of defensive- 
ness and its perceptual consequences will be 

_ the same, regardless of whether you select for 
examination the individual’s perception of his 
environment or his perception of himself, and 
that both will be related to the adequacy of 
the individual’s personal adjustment. 

The hypotheses which were set up to be 
tested may be stated as follows: 

1. The greater the agreement between the 
individual’s self-description and an objective 
description of him, the less perceptual defense 
he will show. 

2. The greater the agreement between the 
individual’s self-description and an objective 
description of him, the more adequate will be 
his personal adjustment. 

3. The more adequate the personal adjust- 
ment of the individual, the less perceptual 
defense he will show. 


‘This article is based on a thesis submitted in 
partial fulfillment of the requirements for the degree of 
Doctor of Philosophy at the University of Wisconsin. 
The author expresses his gratitude to Dr. E, E. Baugh- 
man for his supervision and assistance, and to the 
other members of his research committee, Drs. D. A. 
Grant and J. C. Gilchrist, for their suggestions. The 
author also expresses his appreciation to the judges who 
participated in evaluating the projective material—Drs. 
S. Epstein and H. A. Page and Miss E. Whitehouse, 

*Now at the Veterans Administration Hospital, 
Dearborn, Michigan. 

* Perceptual defense here refers to the differential 
recognition of threatening and nonthreatening words. 


IE THE personality theories of Rogers (12, 


508 


METHOD 


The methodology employed is a blend of clinical 
and experimental techniques. This approach made it 
possible to obtain relevant information about several 
currently important areas of research; these include 
perception and personality theory, perceptual defense, 
the role of perception in adjustment, and the relatione 
ship between self-perception and adjustment. 

Instruments. Defensiveness in perception of the 
self was studied by means of a Q sort of self-descriptive 
statements. Disagreement between the individual's 
Q-sort self-description and the Q-sort description of 
him based on projective material represented defensive- 
ness in this situation. 

Defensiveness in terms of perception of the environ- 
ment was studied by means of the tachistoscopic 
presentation of neutral words and threatening words 
for which the threat is personally relevant. In this 
situation defensiveness manifested itself as greater 
difficulty in recognizing the threatening words. 

An adjustment rating scale and the Munroe Inspec- 
tion Rorschach Check List (9) were used to evaluate 
the projective material. Two measures of adjustment 
were derived from these. 

Subjects. The subjects (Ss) were 30 male under- 
graduates who were taking introductory psychology 
at the University of Wisconsin. Their ages ranged from 
18 to 29 years; mean age was 21.4 years with a standard 
deviation of 1.92 years. None of the Ss had previously 
taken any of the tests used here, and none had ever | 
been or was currently involved in psychotherapy or 
counseling. 

Procedure, Three sessions were spent with each S. 
The first two sessions were scheduled as close together 
as possible, and the third was held exactly one week 
following the second session. f 

During the first session each S filled out a bio- 
graphical inventory, and the Rorschach was admin- 
istered. The biographical inventory consisted of 45 
multiple-choice and incomplete-sentence-type items. 
The items were adapted from an inventory descril d 
by Kelly and Fiske (5). They provided backgroum 
information about each S. y 

The second session included the administration of a 
word association test* and the Thematic Apperception 
Test. The word association test consisted of 100 words, 
MPRA SER er vi et A 


* The following have been deposited with the Amer- 
ican Documentation Institute: (a) the word esse 
test—Document No. 4163; (b) the Q sort Deel 
No. 4162; (c) the adjustment rating scale D 
No. 4161. Order by document number from the d 
Auxiliary Publications Project, Photoduplicatt® 
Service, Library of Congress, Washington 25, D. C» 
remitting in advance for each one $1.25 for 35 mm” 
microfilm or $1.25 for 6 by 8 in. photocopies. Make 
checks payable to Chief, Photoduplication Servic 
Library of Congress. 


SELF-PERCEPTION, PERCEPTUAL DEFENSE, AND ADJUSTMENT 


50 emotional and 50 neutral. These were five-letter 
words selected on the basis of (a) equivalent frequency 
of usage as determined by the Thorndike-Lorge (16) 
word counts and (b) agreement between four judges 
as to the classification of each word. The S's reaction 
time to each word was measured by means of a voice- 
key type of reaction time apparatus. 

The Rorschach test and the TAT were administered 
and treated according to the procedures outlined by 
Klopfer and Kelley (6) and Murray (10). 

The perceptual defense procedure was administered 
one week after the word association test. On the basis 
of the reaction times obtained on the word association 
test, two groups of ten words each were selected for 
each S to be presented as the stimuli for the perceptual 
defense procedure. One group consisted of the emo- 
tional words that had the highest reaction times; the 
other group consisted of the neutral words with the 
lowest reaction times. In no case was a word selected 
if S gave indication of not knowing its meaning. In this 
way it was possible to select for each S those words 
which represented personally relevant threatening and 
neutral stimuli, 

Three practice words were presented tachistoscop- 
ically to S, and these were followed by the 20 selected 
words presented in random order. Each word was 
presented in the tachistoscope beginning at the .01-sec. 
exposure level, and the exposure time was increased 
consecutively by .01 second until S was able to give 
two consecutive, correct reports of the stimulus word. 
The exposure time at which he correctly reported the 
word for the second time was used as the recognition 
score for that word. 

Immediately following the perceptual defense pro- 
cedure S was given the Q-sort cards with instructions 
to sort them so that they described himself. This Q sort 
consisted of 125 short self-descriptive statements,? e.g., 
“T am impulsive”; “I usually ask for help if I can get 

pi 

It was E's purpose to have the Q sort contain items 
about which information could most readily be ob- 
tained from the Rorschach and TAT, and which would 
describe most people to some degree. An attempt was 
made to have the final sample follow the principle of 
randomization (15); that is, the items should interact 
in S's judgment about them, but each item should not 
be contingent upon any other one. This final Q sort 
was developed with the cooperation of four clinically 
trained persons. 

The following Q-sort distribution was employed: 


Frequency; 3 5 7 10 13 16 17 16 13 107 5 3 
Scalevalu: 121110 9 8 7 6 5 4 3210 
N = 125 


Three judges and E participated in the task of 
evaluating the projective and biographical material ob- 
tained from Ss. The judges were selected on the basis of 
their clinical experience and ability to deal with pro- 
jective data, Each judge worked on ten cases; these 
were assigned in consecutive order as the material 
became available. The E, as the fourth judge, teamed 
up with each of the other judges in evaluating the 
MN and, therefore, participated in evaluating all 

Ss. 


5See footnote 4. 


509 


At the end of the second and before the third session, 
one of the judges was given the following material: the 
biographical inventory, the Rorschach protocols with 
the scores summarized, the word association test data, 
and the TAT protocols. 

The judge and E worked up their independent 
interpretations of the data, and then met to discuss 
them. After the two reached some agreement about 
their evaluation of the material, they sorted the Q-sort 
items to describe S as they saw him.® 

The judge and E independently rated S on an ad- 
justment rating scale’ which consisted of 11 subscales, 
A final rating for S on each of the subscales was then 
agreed upon. The content of each scale was derived 
primarily from Maslow and Mittleman (8, ch. IV). A 
descriptive paragraph was provided for the extreme 
ratings of each scale so that these might serve as 
anchoring points for the judges. 

After the judges completed their work, E applied 
the Munroe Inspection Rorschach Check List to Ss' 
Rorschach protocols. 

Measures and statistics. Three basic scores were 
derived from the data. The first was the accuracy of 
self-description (ASD) score, which was obtained in the 
following way: 

The judges’ Q-sort description of S was considered 
the objective description of him. The S’s Q-sort self- 
description was correlated with the judges’ Q-sort 
description. The product-moment r obtained was trans- 
formed to Fisher’s z, and this score was used as’ the 
ASD score. The higher the ASD score, the greater the 
agreement between the objective description of S and 
S’s self-description. 

The second basic score was the perceptual defense 
(PD) score. Here two scores were derived, One was the 
difference between the mean recognition threshold for 
neutral words and the mean recognition threshold for 
threatening words (Mt — Mn); this was called the 
PDm score, The other was the difference between the 
standard deviation of the distribution of recognition 
thresholds for the neutral words and the standard 
deviation of the distribution of recognition thresholds 
for threatening words (ct — on); this was called the 
PDs score. 1 

The third score was the adjustment (A) score, Here 
again, two measures were derived, one based on the 
Munroe check list and the other derived from the 
judges’ ratings of adjustment. The check-list adjust- 
ment (Ac) score was the number of checks S received, 
ds AA RD ES OS SOME EIR SPS 

8 This procedure was suggested by Dr. Julius See- 
man, Counseling Center, University of Chicago. 

7See footnote 4. É 

* Tn the present study it was hoped that the possi- 
bility of the occurrence of response suppression in the 
perceptual defense situation, and intentionally pro- 
duced inaccurate self-description in the Q-sorting pro- 
cedure, would be minimized by using only male Ss and 
by conducting the experimental procedures in an 
atmosphere which would not inhibit Ss. It is doubtful 
that these did occur to any significant degree especially 
since the sequence of the précedures would also tend to 
lessen the possibility of inhibition. However, if these 
reactions did occur, then these should also be considered 
as defensive reactions. 


510 


The rating scale adjustment (Ar) score was derived 
by converting each S's rating on each scale into a 
Standard score based on the distribution of the 30 
ratings for that scale. For each S, the Ar score was the 
sum of his standard scores on the 11 subscales. Poor 
adjustment was indicated by a high check-list score, 
or by a low rating on the adjustment rating scale. 


RESULTS 


Table 1 presents the correlations that were 
obtained between the basic scores. It will be 
noticed in this table that in addition to pro- 
duct-moment correlations, biserial and mul- 
tiple correlations were also employed. Pro- 
duct-moment correlations were not used with 
the Ar scores because these scores were based 
on values obtained from a rating scale on 
which the equality of units was questionable. 
The group of 30 Ss was divided into good and 
poor adjustment groups by using the median 
as the dividing point. Biserial r was then 
appropriate (3). Multiple correlations were 
used because of the possibility that a combina- 

` tion of the two measures of adjustment would 
give a better estimate of .S's adjustment level 
than would either measure alone. 

The findings are listed below as they relate 
to their respective hypotheses: 

Hypothesis 1. Both the PDm and PDs scores 
correlated negatively and significantly with 
the ASD scores. These findings support the 
first hypothesis. The more accurately the 
individual described himself, the less percep- 
tual defense he showed. 

Hypothesis 2. The Ar scores correlated posi- 


TABLE 1 
SUMMARY OF CORRELATIONS 
E of 
esis Basic Scores Correlated lation Value 
1 ASD with PDm r ase hie 
ASD with PDs r — .44* 
2 Acwith ASD r —.34 
Ar with ASD n YES 
, Ar and Ac with ASD R .13** 
3  Acwith PDm r 29 
Ac with PDs f .37* 
Ar with PDm fb 3530274 
Ar with PDs n zs 68** 
Ar and Ac with PDm R PR ha 
Ar and Ac with PDs R 2680€ 


Note.—Product-moment coirelation has been indicated by r, 
Biserial correlation has been indicated by r5 and multiple correla- 
tion byLR. 

* Significant at the .05 level of confidence. 

** Significant at the .01 level of confidence, 


BERNARD CHODORKOFF 


tively and significantly with the ASD score 
Although the Ac scores did not correlate sj 
nificantly with the ASD scores, the correla 
tion nevertheless approached significance i 
the predicted direction. A combination of th 
Ar and the Ac scores correlated positively and 
significantly with the ASD scores. In genera 
these results support the second hypothesi: 
The better the adjustment of the individua 
the more accurate was his self-description, 

Hypothesis 3. The Ar scores correlated ne ga: 
tively and significantly with the PDm scores 
The correlation between the Ac and PDm 
scores, although not significant, was in the 
predicted direction. A combination of the Ai 
and the Ac scores correlated negatively and 
significantly with the PDm scores. 1 

The Ar scores correlated negatively and 
significantly with the PDs scores. The Ac 
Scores correlated positively and significantly. 
with the PDs scores. A combination of the Ar 
and the Ac scores correlated negatively and 
significantly with the PDs scores. These re- 
sults confirm the third hypothesis. The more 
adequate the individual's personal adjustmen! 
the less perceptual defense he showed. i 

In addition to the correlations listed im 
Table 1, the two measures of adjustment were 
correlated. A biserial r of —.56 was obtained 


obtained criterion measures. The judges! 
Q-sort descriptions of Ss were based on the 
same projective material from which both 
measures of adjustment were derived. It would A 
have been advantageous to have obtained 
an independent evaluation of adjustment] 
but unfortunately this was not feasible. Con- 
sequently, the possibility that the correlations: 
of the Ar and Ac scores with the ASD scores 
are somewhat inflated must be considered. 
The correlations involving the PD scores are? 
not open to the same criticism, however. 


Discussion E 

Adjustment and the handling of perceived 
threat. An interesting finding became evident) 
when the individual scores were perused. O j 
the 30 Ss, 11 recognized the threatening words. 
more quickly than the neutral words. E 
Ss appear to be the sensitizers spoken about | 
in other perceptual defense studies (1). OF 
these 11 Ss, 10 belong to the group of 15 4 | 


SELF-PERCEPTION, PERCEPTUAL DEFENSE, AND ADJUSTMENT 


who have the highest ASD scores. Also, of 
these 11 sensitizers, 8 are included in the group 
of 15 Ss who have the highest Ar scores. When 
these results are considered with the major 
findings of this study, the following formula- 
tion is suggested: 

The better adjusted individual attempts to 
obtain mastery over threatening situations by 
getting to know, as quickly as possible, what 
it is that is threatening. He may be alerted to 
the possibility of threat and may be influenced 
by a set which leads him to try to differentiate 
and symbolize aspects of himself and his en- 
vironment which are potentially threatening. 
The less adequately adjusted individual, in 
contrast, may be thought of as influenced by 
& set to keep threat inadequately differen- 
tiated and inadequately symbolized. 

Furthermore, it can be argued that adequate 
behavior depends upon adequate perception. 
If an individual cannot differentiate and sym- 
bolize aspects of himself and his environment, 
his behavior in turn will not be adequate. He 
will also be unable to resolve threatening situa- 
tions when they confront him, for if he does 
not recognize threat, how can he be expected 
to deal with it effectively? 

It may be that for the well-adjusted person 
threat is a relatively momentary phenomenon. 
He is only threatened initially; then he extends 
himself in order to handle the threat. As a 
result, the threat is minimized. For the mal- 
adjusted person, threat persists because he 
cannot deal with it directly. Instead he tries 
to remove himself from it by denial or misper- 
ception. 

Perceptual defense an individual, not a 
general, reaction. The concept of perceptual 
defense, an unconscious mechanism of resis- 
tance to recognition of threatening material, 
has frequently been used to explain the higher 
recognition thresholds for taboo words than 
for neutral words (1, 7). Whether or not such 
a mechanism is necessary has been questioned 
by some workers in the field (4, 11). 

In the present study, no difference was 
found between the mean recognition thresholds 
for threatening and nonthreatening words. 
The ¢ test was used to evaluate the difference 
between the means (Mt = 15.11, ct = 7.62; 
Mn = 14.35, on = 8.91), and a value of £ 
= .75 was obtained. This is obviously not 
significant and indicates that perceptual de- 
fense did not occur for the group as a whole. 


511 


The matter cannot be dismissed at that, how- 
ever, for meaningful differences have been 
found in terms of individual reactions. The 
importance of these differences has been in- 
dicated by the major findings of this study. 
These findings suggest therefore that percep- 
tual defense is not a general phenomenon but 
may be a process which some individuals are 
more apt to manifest than others. 

Importance of using individually selected, 
personally relevant stimuli. Consideration must 
be given to the possibility that there are 
different kinds of threat. Taboo words like 
whore and bitch may present threat in the 
sense of social prohibition which is uniformly 
imposed on most people; these words there- 
fore, can generally be expected to produce 
elevated recognition thresholds, Other words 
like mammy and haven, if threatening to the 
individual, may represent more basic psy- 
chologic threat, and may be more intimately 
related to the person’s personality trends. 
The handling of such threatening material is 
much more of an individual matter, 


SUMMARY 


The present research was designed as an 
attempt to verify the following hypotheses: 

1. The greater the agreement between the 
individual's self-description and an objective 
description of him, the less perceptual defense 
he will show. 

2. The greater the agreement between the 
individual's self-description and an objective 
description of him, the more adequate will 
be his personal adjustment. j 

3. The more adequate the personal adjust- 
ment of the individual, the less perceptual 
defense he will show. 

The results in general supported all hypoth- 
eses. The following can be concluded: 

1. In a group of Ss who show varying degrees 
of adjustment and defensiveness, one finds 
that the more inaccurate and faulty the in- 
dividual's perception of his environment, the 
more inaccurate and faulty is his perception 
of himself; and the more inaccurate and faulty 
the individual's perceptions of himself and 
his environment, the more inadequate is his 

nal adjustment. j j 
xpo com defense is meaningful when it 
is considered as an individual phenomenon 
rather than a general or universal one. Im- 
portant differences exist between persons and 


512. 


apparently can be demonstrated if personally 
relevant, individually selected stimuli are 
used. 


REFERENCES 


1. Bruner, J. S., & Postman, L. Emotional selec- 
tivity in perception and reaction. J. Pers., 1947, 
56, 69-77. 

2. Comps, A. W. A phenomenological approach to 
adjustment theory. J. abnorm. soc. Psychol., 
1949, 44, 29-35. 

3. Cronsacu, L. J. Statistical methods applied to 
Rorschach scores: a review. Psychol. Bull., 1949, 
46, 393-429. 

4. Howes, D. H. & Sorowow, R. L. A note on Mc- 
Ginnies’ “Emotionality and perceptual defense.” 
Psychol. Rev., 1950, 67, 229-234. 

5. Ketty, E. L., & Fiske, D. W. The prediction of 
performance in clinical psychology. Ann Arbor: 
Univer. of Michigan Press, 1952. 

6. KrorrEm, B., & Kerrevy, D. M. The Rorschach 
technique. Yonkers, N. Y.: World Book Co., 1946. 

7. McGiwNrss, E. Emotionality and perceptual 
defense. Psychol. Rev., 1949, 66, 244-251. 


BERNARD CHODORKOFF 


8. 
o 


10. 


11. 


12. 
13. 
14, 


15. 
16, 


Received June 25, 1953. 


Mastow, A. H., & MrrrLEMAN, B. Principles 
abnormal psychology. New York: Harper, 1941, 

Munroe, Ruts L. Prediction of adjustment and 
academic performance of college students by a 
modification of the Rorschach method. Appl. 
Psychol. Monogr., 1945, No. 7. 

Murray, H. A. Thematic Apperception Test, 
Cambridge: Harvard Univer. Press, 1943. 

Postman, L., Bronson, W. C., & Gnorrzn, G. L. 
Is there a mechanism of perceptual defense? J. 
abnorm. soc. Psychol., 1953, 48, 215-223. 

Rocers, C. R. The organization of personality. 
Amer. Psychologist, 1947, 2, 358-368. 

Rocers, C. R. Client-centered therapy. New York: 
Houghton Mifflin, 1951. 

SNvoo, D., & Comps, A. W. Individual behavior: a 
mew frame of reference for psychology. New York: 
Harper, 1949, 

STEPHENSON, W. Some observations on Q tech- 
nique. Psychol. Bull., 1952, 49, 483-498. 

THORNDIKE, E. L., & Lorce, I. The teacher's word 
book of 30,000 words. New York: Bureau of 
Publications, Teachers Coll., Columbia Univer., 
1944, 


AN EXPERIMENTAL INVESTIGATION OF THE EFFECT OF UNSTABLE 
INTERPERSONAL RELATIONS IN A GROUP: 


LEON FESTINGER 
Laboratory for Research in Social Relations, University of Minnesota 
AND HERMAN A. HUTTE 


š 


N ORDER to work together, live together, 
I or even to maintain the barest degree of 

social contact with one another, it is 
necessary for persons to share ideas, to com- 
municate attitudes and feelings and informa- 
tion. While much communication is intended 
as such, some of the communication which 
occurs among persons may be termed “in- 
direct” or “inadvertent” communication. This 
occurs when information is conveyed to 
others, not by actually talking about it, but 
by behaving in such a way that someone else 
can draw inferences. 

There are some kinds of things about 
which communication is almost never direct 
but is typically inadvertent. Among adult 
persons of the same sex, for example, it is 
quite rare to have direct communication 
about how much they like each other. This 
type of direct communication about social 
preferences does go on among children, but 
among adults it would generally be embar- 
rassing. People do, however, behave differ- 
ently toward those they like and toward those 
they dislike and in this manner achieve com- 
munication about these feelings. Certainly, 
among people who interact with one another, 
there is knowledge of how others feel about 
them even though there may never have been 
any direct communication on the subject. 

How much knowledge persons in a group 

1 The collaboration on this study was made possible 
by a grant from the Rockefeller Foundation. Initial 
planning of the study was carried out jointly by the two 
authors in Ann Arbor, Michigan. After preliminary ex- 
perimentation was done, the final design was agreed 
upon during conferences in Leiden, Holland. The actual 
experimental work in Holland was done at the Neder- 
lands Instituut voor Praeventieve Geneeskunde with 
funds provided by the Nederlandse Organisatie voor 
Zuiver Wetenschappelijk Onderzoek. In the United 
States it was done at and supported by the Laboratory 
for Research in Social Relations of the University of 
Minnesota. The authors wish to express their apprecia- 
tion and indebtedness to Aart Hazewinkel and Els v. d. 
Schoot, who helped conduct the experiment in Holland, 
and to Jay Jackson and Robert Hall, who helped in the 
United States, 


Nederlands Instituut voor Praeventieve Geneeskunde, Leiden, Netherlands 


have concerning how others feel about them 
is determined by two factors, assuming a 
complete absence of any direct communication 
on the subject. One factor is the amount of 
difference between behavior toward those who 
are liked and toward those who are disliked. 
The other factor is how sensitive others are to 
these differences in behavior. In a group that 
behaves very formally and strictly according 
to some rules of etiquette and politeness, for 
example, we would expect the differences in 
behavior toward those liked and disliked to be 
minimal. We would not, consequently, expect 
to find very accurate knowledge in such a 
group concerning likes and dislikes. Also, of 
course, no matter how large the differences in 
behavior directed toward different persons 
might be, we would expect little accurate 
knowledge if, for one reason or another, 
people paid no attention to, or were not sensi- 
tive about, these cues. 

This latter factor of sensitivity is maxi- 
mally important in inadvertent communica- 
tion and minimally important in direct 
communication. In direct communication the 
person must only be listening in order to have 
the communication impinge upon him. In 
inadvertent communication, however, this 
factor of sensitivity may make a great dif- 
ference. 

The present experiment attempts to study 
the effect of the stability of the interpersonal 
relations within the group on the accuracy of 
perception of social preference. Tt may be 
expected that persons whose relations with 
others in a group are stable may be different 
from those whose relations are unstable with 
respect to both sensitivity and the cues they 
make available concerning their feelings for 
others. » he 

It is possible that those whose relations in a 
group are stable, because of „their greater 
security, would be less cautious in their 
behavior and, consequently, provide more 
cues from which others could infer their likes 


man 


514 


and dislikes. One might also expect that those 
whose relations in the group are unstable 
would be more sensitive to the cues available 
because they would be more concerned with 
finding out how others feel toward them. The 
experiment was consequently designed so that 
half the people in each group were put in a 
situation where they felt secure and stable 
about their interpersonal relations in the 
group while the other half were made to feel 
unstable about these relations. After they had 
had a period of time in which to interact 
with one another, measurements were made 
‘from which one can calculate how accurate 
their knowledge was of how others felt about 
them. From observation of the interaction 
among them one can calculate whether there 
was any difference between Ss in the two 
conditions in behavior toward others. 

The experiment was conducted simultane- 
ously in two different countries, Holland and 
the United States. The same number of 
groups was run and the identical procedures 
were used (within the limits of translation of 
language between English and Dutch) in 
both countries. The comparison of the results 
in the two countries can provide us with some 
evidence concerning how independent the 
findings are of cultural differences, at least 
between these two countries. 


METHOD 


Subjects. In each of the two countries data were 
collected on 40 experimental groups, each group con- 
sisting of six subjects (Ss) of the same sex. Twenty 
male and 20 female groups were run in Minneapolis; 28 
male and 12 female groups were run in Holland. All Ss 
were undergraduate students at the University of 
Minnesota, Leiden, or Amsterdam. The Ss scheduled 
for any particular group were always strangers to one 
another. For the purposes of this experiment stranger 
was defined as not knowing the other person’s name. 
Occasionally there were persons in the same group who 
knew each other by sight but did not know each other’s 
name. In Holland there were occasional deviations from 
this rule. Persons who knew each other’s name were 
sometimes in the same group if the experimenter (E) 
felt sure that their acquaintance was only superficial. 

Procedure. The Ss, upon their arrival, were seated in 
a U-shaped arrangement. In the United States the 
observers (Os) were located behind a one-way observa- 
tion mirror which faced the open end of the U. In 
Holland one O was sometimes behind a one-way ob- 
servation mirror and sometimes in the room with Ss, 
The other Os were always in the room with Ss. In this 

way it was possible for Os to see all the faces of the 
persons in the group. A letter from A to F was placed 
in front of each S for identification purposes. 


LEON FESTINGER AND HERMAN A. HUTTE l 


The Ss were asked to introduce themselves to each 
other by telling their names, what part of the country. 
they came from, and what they were studying. This was 
the only information any of the Ss had about the others 
at the start of the experiment. 

The E then verbally gave the following instructions; 

This afternoon you will have a discussion 
among the six of you in which you will make 
up a plot for a radio play. Before we start doing 
that it is, however, necessary for the purposes 
of this experiment for us to know how you feel 
about each other. After the experiment I will 
explain to you why we do this, We would like 
you to write down on a slip of paper which will 
be passed out to you in a minute how well you 
like each of the other persons in this group. On 
this slip we would like you to write down the 
letters of each of the other persons in your 
group alongside that statement that best ex- 
presses your feelings toward each other. 

The following slip was then passed out: 


My Letter is __ 
- likes me exceptionally well 
likes me quite well 
likes me moderately 
likes me a little 
rather indifferent 
. dislikes me a little 
. dislikes me definitely - 


As soon as the slips had been passed out, E continued! 
Some of you may be a bit doubtful about 
these “first impressions" of people. I can, how- 
ever, assure you that we have found in the 
past, in many studies we have done, that first 
impressions of people are extremely im- 
portant and surprisingly valid. For example, 
it frequently happens that spontaneous lasting 
friendships are formed which correspond 
exactly to such first impressions. Also, spon- 
taneous dislikes sometimes occur between 
people who met for the first time which prove 
to be quite stable. We would like you, then, to 
take your time in filling out these slips so that 
you can do it seriously. 

The last instruction was given in an attempt a 
make Ss take these initial ratings seriously. Since te. 
adequacy of the experimental manipulation depended 
to a great extent on how important Ss felt these ini 
ratings to be, this last instruction was put quite force: 
fully. After Ss had finished filling out the slips, they 
were also asked to encircle the letters of the two persons 
in the group whom they liked best. The slips were 
collected and E continued: 

We would now like you to estimate how 
each of the others rated you a moment ago- For 
that purpose a slip of paper will now be 
out to you. Will you put the letter of 
other person in the group alongside 
statement which describes the way you 
that person feels toward you. 

After this second slip, similar in form to the 
had been filled out and collected, E went on: | 

We will now give you some information 
about how the two that you like best in the 


PRpoRuan 


that 


first slip, 


Some EFFECTS OF UNSTABLE INTERPERSONAL RELATIONS IN A GROUP 


group feel about each other. It is possible to 
give you this information because while you 
were filling out the second slips, Mr.——— — 
has been tabulating the first slips. We will now 
pass out to each of you a slip which will tell 
you how those two which you said you liked 
best feel about each other. 
The slips containing this information were then dis- 
tributed. The actual information given to Ss at this 
point was fictitious and constitutes the experimental 
manipulation. We will describe this in detail in the 
next section. The E then said: 

We are now ready for you to start your 
discussion about the radio play. We will pass 
out to you a plot which is unfinished and we 
would like you to discuss among yourselves 
how you think it should end. You are com- 
pletely free to decide upon any solution of the 
problems in the plot. Keep in mind a short 
scene lasting about ten minutes. I will now pass 
out copies of the unfinished plot to each of you. 
Please read it carefully and let me know when 
you are ready to start the discussion. 

The following plot was then passed out: 

Henry, the son of a physician, has a friend, 
Jim, who is under the care of Henry's father. 
Henry knows that Jim is incurably ill. Both 
are in love with a girl, Ellen. Jim doesn't know 
what kind of disease he has; neither does Ellen 
know he is incurably ill. One night Henry calls 
on Ellen just after he had decided to give up 
his studies and accept a job in California, He 
intends to ask her that night to marry him and 
go with him to California. Henry knows that 
for many years Ellen has wanted to go to live 
there. Before he gets a chance, however, to tell 
her, Ellen announces her engagement to Jim. 
What is Henry going to say and do and what 
will happen? 

After all six Ss had finished reading the plot, E 
started the discussion, telling 5s that he himself would 
sit at the side of the room and would not participate. 
The discussion went on for 20 minutes during which 
time Os recorded certain aspects of the communication 
process. These observations will be described in detail 
later. When 20 minutes had gone by from the time 
the discussion started, E said: 

We now have to interrupt your discussion 
for a moment because we want you to repeat 
what you did at the beginning, that is, to in- 
dicate again on the slips of paper to be passed 
out to you how you feel about the others in the 
group. 

Slips identical in form to the first slips were then 
passed out. After they were filled out by Ss and col- 
lected, E continued: 

And we also would like you to again 
estimate how the others in the group rated you. 
You see, one of the main purposes of the 
experiment is to see how likes and dislikes are 
formed and how well people know how others 
feel about them. So please give this your full 
attention and think it over well before you 
write down your estimates. 

After these slips had been distributed, filled out, and 


515 


collected, Ss were given a questionnaire to answer. We 
will describe the exact questions asked as we deal with 
them in discussing the results. The Ss were then told 
that the experiment was over and were given a full 
explanation of what had been done and why. 

Experimental manipulation of stability of interpersonal 
relations. When Ss were given information as to how 
the two persons they liked best? felt about each other, 
this information was systematically fabricated to 
produce conditions of stability or instability of inter- 
personal relations. In each group, three Ss were told 
that their two top choices liked each other (one rating 
of 6 and one of 7), while the other three were told that 
their top choices disliked each other (one rating of 1 
and one of 2). The former information was intended to 
make Ss feel that their relations in the group were 
comfortable in that those persons whom they liked best 
were also compatible with one another. This condition, 
which we will call the compatible condition, should 
have the effect of stabilizing their preferences for others 
and should make them feel relatively secure about their 
personal relations. The condition in which Ss were told . 
that their top choices disliked each other was intended 
to have the opposite effect. Here it was felt that the 
knowledge of the incompatibility between the two 
people they liked best would make them feel uncom- 
fortable about their relations with others in the group. 
This condition, which we will call the incompatible 
condition, should make Ss relatively uncertain about 
their interpersonal relations in the group and should 
make their preferences for others unstable. Such rela- 
tions among three persons have also been discussed by 
Heider (1) ina manner similar to the present discussion. 
We will have more to say later concerning the relevance 
of some of our findings to Heider’s theory. 

‘The Ss were put at random in either the compatible 
or the incompatible condition. Prearranged sequences 
were followed which systematically varied condition 
by position around the table. The Ss were in one or 
another condition depending upon the particular posi- 
tion sequence predesignated for that group. 

Observation of the group discussion, During the 20- 
minute discussion, observations were made of who 
talked, for how long, and to whom, In both countries 
the recording of the observations was done on a moving 
tape so that the duration of any communication could 
be computed afterwards. The Os had to indicate which 


bserved by three ns. The records of one of the 
Observe ed for the analysis. His record was modified 
if both other Os disagreed with him. The reliability of 
observation of who spoke to whom was .84 computed 
by product-moment correlation. For the data collected 
in Minneapolis, two Os were present for only 10 of the 
40 groups. On 66 per cent of all items recorded the two 
Os agreed both as to who wgs talking and to whom the 
lina 


2 For the sake of ease of expression, we will refer to 
the two persons who are most liked as the two top 


choices, 


516 


communication was directed. Most of the disagree- 
ments involved one O's saying that the remark was 
directed to some specific person while the other recorded 
it as having been directed toward the group. The data 
used in analysis are those observations made by that O 
who observed all of the groups. 

Questionnaire data. Data collected on slips which Ss 
filled out during the experiment and also data collected 
on the postexperimental questionnaire can both be con- 
sidered questionnaire data. There follows a summary of 
the data thus collected: 

1. Ratings of how well Ss like the other members of 
the group both before and after the discussion. 

2. Estimates of how well each of the others like 
them both before and after the discussion. 

3. Estimates of how much Ss’ top choices like each 
other after the discussion. 

4. Perceptions of how much Ss talked to others and 
others talked to them. 


RESULTS 


Adequacy of the experimental manipulation. 
It will be recalled that the experimental 
manipulation consisted of an attempt to have 
some persons think that the two members of 
their group whom they liked best liked each 
other (compatible condition), while others 
were given the impression that their two top 
choices disliked each other (incompatible con- 
dition). In order to demonstrate how ade- 
quately this was accomplished, it is not 
sufficient to show that Ss believed what they 
were told. The course of the experiment, in- 
cluding the discussion, might present other 
evidence to them. The adequacy of the ma- 
nipulation can only be judged in terms of 
whether, at the conclusion of the experiment, 
they still retained the impression which they 
were given. 

In order to gather these data, Ss were 
asked in the postexperiment questionnaire to 
State once more which two members of the 
group they had liked best at first. They were 


TABLE 1 


Jupcments or How WELL THE Two INrriALLY BEST 
LIKED Persons LIKE EACH OTHER AFTER THE 
EXPERIMENT 


Minneapolis Leiden 
Condition 
N Mean N Mean 
Rating* Rating* 
Compatible 120» 3.4 120 3.3 
Incompatible 118f 2.1 120 1.9 


* Scale from 1 (they dislike each other) to 4 (they like each 
other very much). 
t Two Ss did not answer the question. 


] 
then asked, “What is your best judgment of 
how those two feel about each other now?” 
They were given an opportunity to check one 
of four possible answers on a scale from (1) 
they dislike each other to (4) they like each | 
other very much. Table 1 presents the results 
of the answers to this question. 

It is clear that in both countries the manip- 
ulation was successful. The Ss in the com- 
patible condition judged that their top choices) 
like each other much more than did Ss in the 
incompatible condition. In every one of the 
80 groups the average for Ss in the compatible 
condition was higher than for Ss in the in-j 
compatible condition. We may then conclude 
that the difference is highly significant. It is 
also interesting to note that the mean values 
for each of the two conditions are very much 
alike for both countries. 

Let us now turn our consideration to the 
indirect effects which it was hoped the manip- 
ulation would have. Our hypothesis was that 
Ss in the incompatible condition would be 
less stable in their liking of others than Ss in 
the compatible condition. Because they had 
been given the impression that their top 
choices liked each other, Ss in the compatible 
condition would find themselves in a stable 
interpersonal situation, and we would not 
expect many of them to change their prefer- 
ences during the experiment. On the other 
hand, Ss in the incompatible condition who 
had the impression that their top choices 
disliked each other, would feel uncomfortable 
about the interpersonal situation, and we 
would expect many of them to change their 
feelings about at least one of their two best” 
liked persons during the experiment. ' 
Course, there are many other factors aside 
from this one that could induce people 1 
change their preferences, and so, if the above 
hypothesis is correct, we can only hope to 
demonstrate a difference between the con® 
tions. Table 2 presents the data on the num. 
ber of persons in each condition who chang 
at least one of their top two choices. 3 

Tt can be seen that those in the incompatible 
condition do indeed change at least one 0 
their two top choices much more frequen y 
than do those in the compatible condition: 
Again, the data from both countries are very 
similar although there is a slightly larger ( 
ference in the Leiden data than in the Ma” 
neapolis data. When we evaluate the si 


LEON FEsTINGER AND Herman A. HUTTE 


* 
Some EFFECTS OF UNSTABLE INTERPERSONAL RELATIONS IN A GROUP 


TABLE 2 


NUMBER oF Persons CHANGING Two Tor Cxoices 
DURING EXPERIMENT 


Number Changing 
at Least One of Number not 
Condition Their Two Top Changing 
Choices 
Mpls. Leiden Mpls. Leiden 
Compatible 66 53 54 67 
Incompatible 96 94 22 26 


-cance of the differences on the basis of 
consistency from group to group, we find the 
differences to be highly significant. The number 
of groups in which more incompatibles than 
compatibles changed at least one of their two 
top choices is 25 and 26 for Minneapolis and 
Leiden respectively. The number of groups in 
which the direction of difference was reversed 
was 7 and 4 for Minneapolis and Leiden 
‘respectively, The same number changed in the 
other 18 groups. Tested by chi square (ignoring 
the equal groups) the difference is significant 
beyond the .01 level of confidence in each 
country. 

The data do corroborate the hypothesis, and 
we may conclude that in both countries, one 
of the effects of the manipulation was to 
produce a tendency for the incompatibles to 
change at least one of their two top choices 
during the experiment. This result is also 
corroboration of Heider’s (2, 3) theory con- 
cerning such relationships. 

If it is true that many Ss in the incompatible 
condition are changing their two top choices 
because of the perception that those two dis- 
like each other, we would expect them to 
perceive greater liking between those they 
finally choose as the ones they like best. 
Among the Ss in the compatible condition, 
however, changes which occurred would have 
been for any of a variety of other reasons. We 
would then expect no such change in perception 
of how the two whom they liked best after the 
experiment like each other. . 

In the questionnaire at the conclusion of the 
experiment Ss were asked to state which two 
in the group they now liked best and also to 
State their best judgment of how these two 
now felt about each other. These data are 
presented in Table 3. 

For those in the compatible condition, the 


517 


changes in average perceived liking between 
their two top choices are negligible. For Ss 
in the incompatible condition the shifts in 
their two top choices are significantly toward 
persons who are perceived as liking each other 
better. Looking at consistency from group to 
group we find that in 32 of the Minneapolis 
groups and in 33 of the Leiden groups the 
change in perceived compatibility from the 
initial top choices to the final top choices was 
greater in the incompatible than in the com- 
patible condition. In only four groups (all in 
Minneapolis) was the direction reversed. We 
thus have confirmation that, in the incom- 
patible condition the experimentally produced 
belief that top choices did not like each other 
was a major factor in producing changes in 
preferences. 

The relation between social preference and 
frequency of communication. During the course 
of the 20-minute discussion, observations were 
made of who spoke, for how long, and to whom 
the communications were directed. We may 
look at these data to determine whether the 
direction of communication was affected by 
the preferences which persons had for the 
others in the group. To the extent that there 
are differences in how often members spoke 
to those they liked, to those they disliked, or 
to those they were indifferent to, we may say 
that cues were present during the discussion 
from which these preferences could. be judged. 
The data on the average length of each 
communication do not show any clear trends. 
Our presentation will consequently deal with 
the frequency of communication, that is, how 
often communications were addressed to 
certain others in the group. 

In examining the relation between prefer- 
ences for others and number of communica- 
tions addressed to them, it is possible to look 


TABLE 3 


PERCEIVED PREFERENCES BETWEEN Two Top CHOICES 
By THOSE WHO CHANGE AT LEAST ONE OF THE ToP 


CHOICES 
ell Two T. 
Pag dg o 
Condition Minneapolis Leiden 
e 3 ial inal 
LE RR 
Compatible 3.1 3.22/73:1 2.9 
Incompatible 24102190 was 2:8 


518 


at the data in relation to initial preferences 
(ratings made at the beginning of the dis- 
cussion) or in relation to the final preferences 
(ratings made after the discussion). In general, 
the same trends are reflected in both ways of 
examining the data. Since our primary 
objective is to determine whether or not cues 
were present from which members of the group 
could judge how others felt about them, and 
since the accuracy of this judgment is in terms 
of the preferences stated after the discussion, 
it makes more sense to look at the relation 
between frequency of communication and final 
preferences. We shall, therefore, confine our 
discussion here to this relationship. Table 4 
presents the relevant data. 

There is clearly a relationship between the 
rating given to a person (how much one likes 
him) and the number of communications 
addressed to that person. But this relationship 
‘is not monotonic. For both the compatible and 
incompatible conditions, both in Minneapolis 
and in Leiden, the same type of relationship is 
obtained, The fewest number of communica- 
tions are addressed to those rated 3 (indif- 
ferent). The frequency of communication to 
those rated 1 or 2 (dislike) is considerably 
greater, and it also increases on the other side 
of the continuum. More and more communica- 
tions are addressed to people the better they 
are liked. The differences between the fre- 
quency of communication to those rated 3 
and to those rated 6 or 7 are significant at 
at least the .05 level for each condition in each 
country. The differences between the number 
of communications addressed to those rated 3 
and to those rated 1 or 2 are also significant at 
beyond the .05 level for each condition in each 
country. We may conclude that the amount of 


TABLE 4 


MEAN NUMBER OF COMMUNICATIONS ÁDDRESSED TO 
Orners Accorpinc TO FINAL RATINGS OF 


PREFERENCE 
Mean Number of Com- 
munications Addressed 
Condition to Those Rated 
Compatible (M) 


Compatible (L) 
Incompatible (M) 
Incompatible (L) 


LEON FEsrINGER AND Herman A. HUTTE 


communication was an available cue which 
could be used by the members of the group to 
judge how others felt about them and that” 
these cues were the same, roughly, in both. 
countries. It is likely that the communications. 
addressed to those rated 1 or 2 were qualita- 
tively different from those addressed to people 
rated 6 or 7 but, unfortunately, no data were 
collected which are relevant to this point. 

We may now raise the question as to whether 
or not the communication pattern, and hence 
the available cues, were different between the 
compatible and incompatible conditions. We: 
may note in Table 4 that for both countries 
the frequency of communication addressed to 
those rated 1 or 2 is higher for the compatible 
condition than for the incompatible condition. 
The figures are about equal between the two 
conditions for those rated 3 (indifferent). In 
the incompatible condition Ss communicated 
more to those rated 5, 6, or 7 than in the 
compatible condition. In other words, in the 
incompatible condition Ss communicated 
relatively more to those they like and rela- 
tively less to those they dislike. While the 
consistency of the differences between the two 
countries lends weight to them, the differences” 
within each country between the compatibles 
and incompatibles are not highly significant 
For the Minneapolis data the difference’ 
between the conditions is significant ($ < 01) 
for communications addressed to those rated 
4 or 5. For the Leiden data the differente 
between conditions in frequency of communica 
tion to those rated 1 or 2 is significant 
(p < .05). None of the other differences. 
approaches an adequate level of significance 
Mainly because of the startling consistency” 
between the two countries, we may conclude 
that there is some difference in the nature 0” 
the cues made available by those in different 
conditions. ‘ 1 

We can make only tentative suggestions as 
to why this difference exists between 
conditions. It is possible that Ss in the COT. 
patible condition, feeling more secure m d 
personal relationships, are more able 
directly express their likes and dislikes tow? 
others. Thus they are able to react to ue i 
they dislike. The Ss in the incompa h g 
condition, on the other hand, may feel E 
able to express their dislikes directly and 50 
whenever possible, prefer to withdraw 


Some EFFECTS OF UNSTABLE INTERPERSONAL RELATIONS IN A GROUP 


contact with those they dislike and maintain 
more contact with those they like. If this 
interpretation of the difference between 
conditions is correct, we would expect to find 
that Ss in the compatible condition show their 
dislike when talking to those they rate 1 or 2. 
Unfortunately we do not have these data, and 
so the interpretation must remain highly 
tentative. 

In comparing the data from Minneapolis 
with the data from Leiden one is struck by the 
consistently higher numbers of individual 
communications in the Leiden data. This is a 
function of the proportion of the communica- 
tions that Os designated as directed toward the 
group rather than toward specific individuals. 
In Minneapolis, 58 per cent of the communica- 
tions were recorded as directed toward specific 
individuals while in Leiden the comparable 
figure was 75 per cent. This was probably a 
difference in observer practice rather than a 
real difference in behavior of the groups. If we 
correct the Minneapolis or the Leiden data for 
the difference in proportion of communications 
judged to be addressed to specific individuals, 
the difference in absolute frequency disap- 
pears. 

Perception of the pattern of communication. 
In the preceding section we have demonstrated 
that cues were available from the frequency of 
communication on the basis of which it would 
be possible for an individual S to judge how 
others in the group felt toward him. Before 
these cues could be used, of course, it would be 
necessary to perceive them. We will now 
examine data relevant to the question of 
whether or not these cues were perceived. 

On the questionnaire after the experiment 
Ss were asked to indicate on a scale from 
0 (not at all) to 4 (very much) how much they 
had talked to each of the others in the group 
and also how much each of the others had 
talked to them. We may then examine these 
data to determine if Ss were aware of the 
pattern of communication that existed in the 
group. 

Table 5 presents the data on the relationship 
between Os' records of the discussion and the 
ratings on the questionnaire. 

The data in Table 5 show clearly that Ss 
were able to report the pattern of communica- 
tion fairly accurately, both as to whom they 
talked to and who talked to them. Again the 


519 
TABLE 5 

ACCURACY OF PERCEPTION OF THE COMMUNICATION 
PATTERN 


Rating on Question- 
naire of “How Much 
I Talked to Them” 


Observed average frequency of 
communication given by 
subject in: 

Compatible Condition (M) 51,42 
Incompatible Condition (M) 6/1.8/3 
Compatible Condition (L) 41.73 
Incompatible Condition (L) 62.04 


Rating on Question- 
naire of “How Much 
They Talked to Me” 


Average observed frequency of 
communication received by: 


Compatible Condition (M) .51.42.34.5| 7.0 
Incompatible Condition (M) .5/1.2/3.0,5.6| 7.7. 
Compatible Condition (L) .511.74.57.112.1 
Incompatible Condition (L) | 1.22,44.2/7.512.3 


data are similar between the two conditions 
and between the two countries. It is important 
to note that, considering the data in a some- 
what different light, Table 5 not only shows 
that Ss perceived the pattern of communica- 
tion, but also it corroborates the validity of 
the observation of direction of communication. 
We may now ask whether or not the amount 
of communication directed toward specific 
persons, which potentially could give cues for 
judging how the other person felt, seems to 
have been used in this manner by Ss. Table 6 
presents the relation between the average 
number of communications received and Ss’ 
estimates of how well the communicator liked 
them. ( 
Inspection of Table 6 shows that Ss in both 
conditions and in both countries seem to have 
used the available cues. In all instances, the 
same nonmonotonic relationship that we 
observed in Table 4 holds. The Ss judged those 
who talked to them least as being indifferent 
toward them. Those whom Ss judged to either 
like or dislike them direct communications to 
them more frequently than those judged to be 
indifferent. In two out of the four comparisons 


520 


TABLE 6 
RELATION BETWEEN AVERAGE FREQUENCY OF CoM- 
MUNICATION RECEIVED AND FINAL ESTIMATES OF 
How WELL OTHERS Like THEM 


Final Estimates of Preference 
Condition EEn O — € 


Compatible (M) S. 
Incompatible (M) | 2. 
Compatible (L) Sa 
Incompatible (L) 5 


(two conditions in each of two countries) the 
difference in amount of communication 
received from those judged to be indifferent 
(rating of 3) and those judged to dislike them 
(ratings of 1 or 2) is significant (p « .05). 
The difference between communication re- 
ceived from those judged to rate them 3 and 
those judged to rate them 6 or 7 is significant 
at the .01 level in every instance. We may then 
conclude that Ss in both conditions, in both 
countries, seem to have used the cues, and we 
may then expect to find that they know, 
significantly better than chance, how others in 
the group feel toward them after the discussion. 

Accuracy of knowledge of preference. We may 
now turn our attention to the question of 
whether or not the accuracy of estimation of 
how others felt about them was better than 
chance. 

For each estimate that Ss made concerning 
how others felt about them, we may calculate 
the average actual rating of preference 
received. If there are no differences among 
these average ratings for different estimates, 
then we would conclude that Ss had no 


TABLE 7 
ACCURACY OF ESTIMATION OF PREFERENCE BEFORE 
DISCUSSION 


Estimate of Others’ Preference 


Condition for Themselves 
1or2| 3 4 5 |6or7 
Compatible (M) 5.1| 4.7 | 4.7 | 4.6 | 4.9 
Incompatible (M) | 4.7 | 4.5 | 4.7 | 4.8 | 5.0 
Compatible (L) 4.3 | 4.3 | 4.5 | 4.8 | 4.9 
Incompatible (L) 4.2, | 4.2 | 4.5 | 4.7 | 5.0 


Note.—Each figure in the table is the average actual rating 
given to those who estimated that that person would rate them a 
given way. 


E 


LEON FEsTINGER AND Herman A. HUTTE 


knowledge of how others really felt about 
them. Table 7 presents the data on accuracy | 
of estimation of preference at the beginning of 
the experiment, that is, before Ss knew each ' 
other or had had much opportunity to inter- - 
act. 

Examination of Table 7 shows that there 
was very little knowledge beforehand of how 
others in the group felt about them. This is of 
course to be expected. The groups were 
composed of people who were strangers to one 
another and, at the time these ratings and 
estimates were made, knew nothing about 
each other except their name, what part of 
the country they came from, and what they - 
were studying. The Minneapolis data show no 
relation between estimates made and actual 
preference. The Leiden data do show a slight 1 
relationship. The average ratings tend to go 
up as the estimates increase. The differences 
for both conditions between the average 
ratings for estimates of 3 and estimates of 5 
are significant beyond the .05 level. It is 
possible that in Holland, name, part of 
country, and field of study are more important 
items of information than they are in the 
United States. Such a difference, which it is 
plausible to assume, would account for the 
slight evidence of knowledge in the Leiden 
data before the discussion. 

There are, of course, no differences between 
the compatible and incompatible conditions 1n 
either country. It would indeed be strange if 
there were differences since the actual manipu- 
lation which created the distinction between. 
the conditions was performed after the ini 
ratings and initial estimates had already been 
made. 

The same kind of analysis of the ratings and 
estimates made after the discussion will show 


TABLE 8 


ACCURACY OF ESTIMATION OF PREFERENCE AFTER THE 
Discussion 


Estimate of Others’ Preference 
for Themselves 


Condition 
(6 or T. 
1or2| 3 4 5 pe 
Compatible (M) | 5.1 | 4.9 | 5.0 | 49 a : 
Incompatible (M) | 4.5 | 4.9 | 5.1 5.0 25 
Compatible (L) 4.6 | 4.3 | 4.5 | 4.9 2 
Incompatible (L) | 4.1 | 4.5 | 4.6 | 4.9 | > 


Some ErrECTS OF UNSTABLE INTERPERSONAL RELATIONS IN A GROUP 


us whether or not Ss learned something during 
the discussion about how others felt toward 
them. These data are presented in Table 8. 

In Table 8 there is clear evidence that Ss in 
the incompatible condition had significant 
knowledge, after the discussion, of how others 
in the group felt about them. This is true for 
both countries. The average rating received 
from those they think dislike them is signifi- 
cantly lower than the average ratings received 
from those they think will rate them 3, 4, or 5. 
These differences are significant at the .05 
level or better in each country. Those that the 
incompatible Ss think rate them 6 or 7 (like 
very much) actually do give them higher 
average ratings. In each country, these 
average ratings are significantly higher than 
any of the others (p < .05). 

The evidence is not nearly as clear that Ss 
in the compatible condition have knowledge 
of how others feel about them. In neither 
country is there any difference among the 
average ratings received from those who Ss 
thought rated them anywhere from 1 (dis- 
likes) through 5 (likes me moderately). Those 
that the compatibles think rated them 6 or 7, 
however, do give them somewhat higher 
ratings. In the Leiden data the differences 
between this average rating received from 
those they think will rate them 6 or 7 and any 
of the other average ratings are significant 
(p < .02). In the Minneapolis data the dif- 
ferences are smaller and only the difference 
between the average ratings from those who 

_ Ss think will rate them 6 or 7 and those they 
think will rate them 4 or 5 is significant at the 
05 level. But still, the data for both countries 
are extremely similar. The compatibles seem 
to have correct knowledge of how others feel 
about them only for those who they think 
like them very much. The incompatibles, on 
the other hand, also have correct knowledge 
about those they think dislike them. 

The differences between the two conditions, 
while in the same direction for both countries, 
are larger for the Minneapolis data than for 
the Leiden data. In the Minneapolis data the 
difference between the compatible and in- 
compatible conditions in the average rating 
that Ss receive from those they think dislike 
them is significant at the .10 level of confidence; 
between those they think like them very much 
it is significant at the .06 level of confidence. 


521 


The conjunction of both these differences is 
certainly highly significant. In the Leiden 
data the difference between the compatible 
and incompatible conditions in the average 
rating that they receive from those they think 
dislike them is significant at only the .20 
level of confidence. No other differences 
between conditions approach significance. The 
fact that all the differences are in the same 
direction in both countries, however, lends 
strength to the assertion that the difference is 
a real one. 

In comparing the data between the two 
countries one may note that the ratings made 
by the Minneapolis Ss are consistently higher 
than the ratings made by the Leiden Ss. 
Nothing can be made of this, however, since 
one cannot be certain that the translation of 
the rating scales from English into Dutch 
preserved intact the distances between points 
on the scale or the exact location of each point 
on the scale. 

It will be recalled that in the introduction 
the hypothesis was advanced that Ss in the 
incompatible condition, because of their 
instability and uncertainty about their 
relations with others in the group, would be 
more sensitive to the cues that others were 
giving and would know more accurately how 
others in the group felt about them. This part 
of the hypothesis has, in general, been con- 
firmed by the data. It was also stated in the 
introduction that one would expect Ss in the 
compatible condition, because of their greater 
certainty in the situation, to be more able to 
directly express their feelings toward others. 
If this hypothesis were true we would expect 
to find that Ss knew the feelings of those in the 
compatible condition more accurately than 
they knew the feelings of those in the incom- 
patible condition. This is not borne out by 
the data. There is no difference in accuracy of 
knowledge, about the preferences of Ss in the 
two conditions. 

On the other hand, in Table 4 we observed 
that there was some difference between the 
two conditions in the nature of the cues 
concerning their preferences for others. Those 
in the compatible condition spoke more to 
those they disliked and less to those they 
liked than did Ss in the incompatible condition. 
These data are still consistent with the 
notion that Ss in the compatible condition 


522 


were more able to directly express their 
dislikes. However, since no difference was 
found in accuracy of knowledge about Ss in 
the two conditions, it seems necessary to as- 
sume that other, equally good cues were made 
available by Ss in the incompatible condition. 
At any rate, the evidence from the experiment 
gives no direct support to the second part of 
our initial hypothesis. 


SUMMARY AND CONCLUSIONS 


The major findings from the experiment 
which has been described may be summarized 
as follows: 

1. If persons in a group feel that those 
members of the group whom they like best 
dislike each other, this tends to make them 
uncertain and unstable about their inter- 
personal relations in the group. 

2. Both those who are stable and those who 
are unstable about their personal relations 
reveal something about how much they like 
others in how often they talk to them. They 
tend to talk least to those toward whom they 
feel indifferent. 


LEON FESTINGER AND HERMAN A. HUTTE 


3. There is some difference between the two 
conditions in how sensitive individuals are to 
the cues which others give off. Those who feel? 
unstable about their personal relationships: 
tend to have more accurate knowledge of how 
others feel about them. It is perhaps plausible: 
to infer that those who are unstable about their? 
personal relations watch more closely and a 
able to make better use of the available cues, ^ 

4. The results in both countries, Holland 
and the United States, are remarkably similar. 
The consistency of the results tends to 
strengthen confidence in their significance and 
also tends to show that these particular 
findings are independent of the cul 
differences between these two countries. 


REFERENCES 

1. Hemer, F. Social perception and phenomenal 
causality. Psychol. Rev., 1944, 61, 358-374. 

2. Hemer, F. Attitudes and Rund organization. 1 


3. HEDER, F., k Snürr, M. FA experimental study 
of apparent behavior. Amer. J. 4, 
57, 243-259. 


Received June 10, 1953. 


PROJECTION AND DISPLACEMENT: A CROSS-CULTURAL STUDY 
OF FOLK-TALE AGGRESSION: i 


GEORGE 0. WRIGHT 
Graduate School of Education, Harvard Universit? 


HE STUDY of societies reveals many 
| examples of the use of folk tales and 
rituals which are aggressive in content. 
Tn varying degrees this aesthetic aspect of cul- 
ture seems to provide a device by which ag- 
gressive feelings can be displayed in fantasy 
without the disruptive effects which otherwise 
would result from their direct expression. This 
fantasy aggression, however, presumably has 
some correspondence with real-life aggression. 
In order to understand folk-tale aggression, 
therefore, we will attempt to discover the re- 
lationship between specific antecedent condi- 
tions and the specific outcomes with which 
they are associated in folk tales. 

Part of the theoretical framework for the 
study of folk-tale aggression grows out of the 
approach-avoidance theory of Miller (1). This 
theory gives us a scheme for appraising the 
joint effects of approach and of avoidance in 
conflict situations. While the experimental 
evidence supporting the approach-avoidance 
theory originally came from work with ani- 
mals, a reasonable elaboration will transform 
its application from psychophysical phenomena 
to those which are psychocultural. 

One of the important elaborations of the 
approach-avoidance theory is the result of the 
work of Whiting (3, 4). Whiting’s hypothesis 
asserts that the intensity of the displaced re- 
sponses, as well as the cultural distance of 
displacement, is a function of anxiety. This 
hypothesis is confirmed in the doll-play inves- 
tigations in which it was found that the dis- 
play of large amounts of aggressive behavior 
in fantasy is connected with a high degree of 
anxiety arising from previous punishment for 
aggression (2). 

Another elaboration of the approach-avoid- 
ance theory is the result of the work of Whit- 


1 This material is part of a doctoral dissertation sub- 
mitted by the author to the Graduate School of Educa- 
tion, Harvard University, 1952. The author wishes to 
express his appreciation to Dr. John W. M. Whiting 
for his guidance in carrying out the research on which 
this paper is based. 

Now at the Wright Air Development Center, 
U. S. Air Force. 


ing and Child (4), in which the original theory 
as it applied to displacement is extended to 
projection. This work, which examined the 
relationship between aggression anxiety and 
the theories held by members of the society 
regarding the causes of disease, attested the 
validity of the theory and provided further 
evidence that such a theory, in fact, could be 
useful in understanding cultural phenomena. 
The work of Whiting and Child did not 
independently test displacement and projec- 
tion. This present study, however, does make 
this independent test by analyzing folk-tale 
behavior. The choice of folk tales for study was 
somewhat dictated by the fact that such an 
analysis could be carried out on the basis of 
classifying the behavior of the characters in 
the folk tales as either displacement (qualities 
of the objects of aggression) or projection 
(qualities of the agents of aggression). 


PROCEDURE 


"The materia] used in this study has been developed 
from an analysis of folk tales from 33 societies, includ- 
ing selections from Oceania, Asia, Africa, North 
America, and South America. Since the child-training 
data of Whiting and Child (4) were to be used as one 
of the variables, folk tales had to be assembled for the 
societies that were among the 75 used in their study. 
The folk-tale literature of these 75 societies was ex- 
amined, and preliminary selections on the basis of 
available material from 37 societies were made. At the 
outset it was decided not to include origin legends be- 
cause of their lack of comparability with ordinary folk 
tales. For the same reason, those folk tales told ex- 
clusively in the third person with the ethnographer as 
narrator were excluded, since they were likely to be 
overlaid with the cultural bias of the ethnographer. In 

ence of one or the other of these restrictions, 
the folk tales from four of the initial selections were 
discarded. These societies were the Ainu, Arapesh, 
Azande, and the Maori. $ 

In order to provide sufficient data for analysis, twelve 
folk tales were selected from each society. Moreover, 
in order to eliminate the inclusion of fragmentary tales, 
each Tar ADE sok tales pee MENU to be a 
complete episode. An episode was as a sequence 
EE viie a definite beginning and proceeding 
to some definite action climax. 

In order to carry out the study, a scheme permitting 
quantitative analysis of the folk tales was needed. In 
addition, it was necessary that the scheme comprise ob- 


893 


524 


jective scoring categories so that the subjective esti- 
mates of the investigator would be minimized. These 
requirements are met in the scoring scheme used by 
the doll-play investigators. Their scheme was modified 
for use with folk tales. 

Central to the scoring system is the concept of 
interaction units, As the term implies, an interaction 
unit is a complete behavior sequence between two 
persons. In order to be complete, the sequence must 
contain at least two action elements. First, a person 
must initiate an action toward some other person. 
Second, this initiated action must be responded to by the 
other person. For example, a mother initiates action by 
calling her child to lunch. The child responds to this 
action by stopping his play and coming into the house. 
The first element we can call initiation; the second, 
response. The person starting the action we can call 
the initiator; the person responding to the action we 
can call respondent. 

Now, since many of the actions are likely to be more 
than simple action-reaction, we have decided on a con- 
vention for complex actions. In our example of the 
mother calling the child to lunch, how will the action 
be scored if the child were to say, “I’m busy, mother, 
and can't come in now"? What is the action of the 
mother to be called if she in turn responds to the reply 
of the child by a threat of spanking? The convention 
adopted is to score the calling of the mother and the 
reply of the child as one interaction unit, with mother 
as initiator and child as respondent. In turn, the reply 
of the child and the threat of the mother are scored as 
the second interaction unit, with the child as initiator 
and the mother as respondent. If the threat of punish- 
ment causes the child to come into the house, this is 
scored as the third interaction unit, with mother as 
initiator and child as respondent. Allowing a reaction 
to be both a response and an initiation solves our 
scoring dilemma. 

When the interaction unit is used as the vital 
measure in the scoring system, it is possible to build 
additional classifying measures around this unit. For 
example, it is possible to classify the status of the 
initiator, the status of the respondent, the nature of 
the initiation, and the nature of the response. It is pos- 
sible, moreover, to make comparisons from folk tale to 
folk tale by the use of the quantitative measures. 

The interaction sequences were scored as aggressive 
or nonaggressive; each aggressive act was scored for 
intensity (ranging from a score of 1 for nonaccomplish- 
ment of aggressive intent to a score of 5 for killing); 
and finally, each sequence was classified as to the agent 
and object of the act. 

In order to test the reliability of the scoring system, 
four students at the Wilberforce State College (Ohio) 
were asked to score 36 folk-tale episodes selected at 
random from the collection. These students were trained 
in the use of the system for five hours prior to the 
scoring. An index of reliability was computed as fol- 

lows: the sum of the agreements for a pair of the 
student scorers was divided by the total number of 
items scored by them; the scores of the writer and of 
the students were randomly paired in all possible 
combinations for each of the scoring categories; the 
resultant index of reliability was 0.83, with 1.0 repre- 
senting perfect agreement. 

The scoring of the folk tales provides a compre- 


GEORGE O. WRIGHT 


hensive set of data on the display of aggressive feelings 
in folk tales. This is our consequent variable. Data also 
were needed for the child-training practices, our ante- 
cedent variable. For this antecedent variable, the 
child-training data of the study of Whiting and Child 
(4) were used. These data, in the form of ratings on 
seven-point scales, were available for 26 features of 
child-training practices. The details of the rating scale 
for the practice that is relevant to the present study 
are as follows (the numbers indicate the ends of the 
rating scale): 
Severity of Aggression Training 
7—Child severely punished for any form 
of aggression; fighting, disrespect, or dis- 
obedience not permitted. 
1—All but the most severe forms of ag- 
gression permitted; fighting encouraged. 

In analyzing the data, the societies were divided 
into two groups, high and low, on the basis of the rat- 
ings for the child-training practice. The data were then 
analyzed for significance of difference between means. 


Tue EFFECT or PUNISHMENT ANXIETY ON 
DISPLACEMENT 


The first hypothesis to be tested is that ob- 
jects of displaced aggression are chosen farther 
and farther out on the generalization con- 
tinuum as a function of the amount of aggres- 
sion anxiety. 

This hypothesis is derived from the ap- 
proach-avoidance theory, which considers the 
joint habit strength of aggression and ag- 
gression anxiety to be the algebraic difference 
between the two tendencies. As the degree of 
anxiety is increased, this anxiety function 
shifts the point of maximum response toward 
objects less and less like the original object. 

The approach-avoidance theory specifies 
where on the generalization continuum the 
maximum response is likely to occur, but 
Whiting’s theory specifies also how strong this 
maximum response is likely to be. This theory 
asserts that anxiety is frustrating, that 1s to 
say, it interferes with the initial response, an 
that this interference is itself drive-producing: 
Consequently, the joint effective habit poten- 
tial is a function of the product of the joint” 
habit strength (i.e., of the difference pets 
the aggression and the aggression anxiety) 8B 
the strength of frustration as indicated by 
aggression anxiety. ^ 

In order to translate this hypothesis 
terms that can be tested with the folk-tale 
data, the punishment of children for aggression 
will be called aggression anxiety; the Hero e 
be called the point of original response; M 
similarity dimension will be considered as 2% 
ginning at the Hero and extending throw 


into 


| 


PROJECTION AND DISPLACEMENT IN, FOLKLORE 


friends, relatives, and acquaintances to stran- 
gers at the least similar position; and finally, 
the frequency of aggressive acts involving a 
given category along the dimension of similar- 
ity will be called strength of response. Further, 
we will consider our hypothesis supported if 
the following conditions are found: 

1. In the folk tales of a society in which the 
child-training practice is low severity of pun- 
ishment for aggression, the frequency with 
which friends of the Hero are used as objects 
of aggression should be higher than it is in the 
folk tales from societies in which the practice 
is high severity of punishment for aggression. 
This condition should be found because our 
theory predicts that the lower the anxiety, the 
more like the original object will be the dis- 
placed objects. 

2. Conversely, in the folk tales of societies 
in which the child-training practice is high 
severity of punishment for aggression, the 
frequency of the use of strangers as objects of 
aggression should be higher than in those of 
societies in which the practice is low severity 
of punishment for aggression. This condition 
should be found because our theory predicts 
that the higher the anxiety, the less like the 
original object will be the displaced objects. 

The data used for this analysis were the 
antecedent child-training measures of severity 
of punishment for aggression versus the fre- 
quency of use of the friends of the Hero as 
objects of aggression. The scores of frequency 
of use of friends as objects of aggression were 
obtained by counting the number of times the 
friend is used as the object of aggression in 
the folk tales, and dividing this frequency by 
the total aggressive acts. 

Analysis of the data indicates that for the 
group of societies in which aggression anxiety 
is low (low severity of punishment for ag- 
gression), the mean frequency of the use of 
friends of the Hero as the object of aggression 
is 13.3, whereas the mean of the group of 
Societies in which aggression anxiety is high 
(high severity of punishment for aggression) is 
8.67. The difference is significant at the .005 
level (obtained ¢ = 3.8).5 The results of this 
and the following tests are shown in Table 1. 

The use of the measures for the friends of 
the Hero as the object of aggression instead of 


3 In all of the results reported, only one tail of the 
t distribution is used, since all of the results are in the 
predicted direction. 


525 
TABLE 1 
SUMMARY OF FINDINGS 
ession 
Mean Frequencies of Mon 5 5 
ssion ————— 
High | Low 
IN = 12) = 14 
Agent of aggression 
Hero as agent 22.3 |23.0 | 0.21 | .25 
Friend as agent 15.33/18.54| 1.18 | .13 
Stranger as agent 30.5 |25.43| 2.46 | .01 
Object of aggression 
Hero as object 27.33/29.43| 0.47 | .30 
Friend as object 8.6713.3 | 3.8 | .005 
Stranger as object 31.66/26.58. 2.21 | .015 
Intensity of aggression 19.5815.7 | 2.02 | .035 
Hero triumphant 2.14| 4.85| 2.86 | .005 


Note.—This table shows mean scores for frequencies of various 
categories of agents and objects of aggression, for intensity of 
aggression, and for fate of the hero in the folk tales. (Societies are 
dichotomized on the basis of two degrees of aggression anxiety, 
high and low.) 


the Hero himself is a question that naturally 
arises. In applying the test of the hypothesis, 
it was felt that the Hero as the object of ag- 
gression actually represented self-aggression, 
and that this fact might introduce this effect 
as an uncontrolled variable. Consequently, the 
objects considered closest to the Hero, namely, 
his friends, were chosen. 

In order, however, to provide information 
that might be useful in refining the hypothesis, 
the analysis of the data on the Hero was carried 
out. In this analysis, the means of the frequency 
of use of the Hero as object of aggression 
were compared for the high aggression-anxiety 
group of societies and the low aggression- 
anxiety group of societies. The difference is 
not significant—that is to say, the difference 
in the means which we observed are likely to 
have arisen by chance. The obtained / is 0.47; 
the significance level is approximately .30. 

The test of the converse condition, that of 
comparing the frequency of the use of stran- 
gers as the object of aggression under the 
conditions of high aggression anxiety and of 
low aggression anxiety, discloses that the 
mean of the frequency of use of strangers as 
the object of aggression for societies in which 
aggression anxiety is high is higher than in 
societies in which aggression anxiety is low. 
The means are respectively 31.66 and 26.58. 
The difference between the means is significant 
at approximately the .02 level (obtained 
1 = 2.21). Thus, in sum, the societies in which 
children are severely punished for aggression, 


526 


as compared to the mild societies, represent 
friends of the Hero less frequently and stran- 
gers more frequently, as would be expected 
from the displacement hypothesis. 


Tse EFFECT oF PUNISHMENT ANXIETY ON 
PROJECTION 


The second hypothesis to be tested is that 
the agents of projected aggression are chosen 
farther and farther out on the generalization 
continuum as a function of the amount of 
aggression anxiety. 

In order for our theory to be consistent, the 
findings for displacement should have their 
counterpart in projection. The child-training 
measures of punishment aggression, used as 
indices of aggression anxiety, should be related 
to projection in such a way that the folk tales 
of societies in which punishment for aggression 
is severe should display more aggression per- 
formance that uses the stranger as agent, since 
the tendency to express aggression under these 
conditions should be projected far out on the 
generalization continuum. Stated another way, 
high aggression anxiety should be associated 

` with a more frequent use of stranger as ag- 
gressive agent, 

The folk-tale data translated into terms of 
our theory are analyzed, using severity of 
punishment for aggression as the degree of 
anxiety about aggression, and the frequency 
of the use of various categories as agents of 
aggression; these were considered an index of 
the strength of the joint effective habit of ag- 
gression and aggression anxiety. We will con- 
sider our hypothesis as supported if the 
following conditions are found: 

1. In the folk tales of societies in which the 
child-training practice is low severity of pun- 
ishment for aggression, the frequency of use of 
the Hero as agent of aggression should be 
higher than in those of societies in which the 
practice is high severity of punishment for 
aggression. This condition should be found be- 
cause our theory states that the lower the 
anxiety, the more like the original object will 
be the projected agents. This condition should 
also be found for the use of friends of the Hero 
as agents of aggression. 

2. In the folk tales of societies in which the 
child-training practice is high severity of 
punishment for aggression, the frequency of 
use of strangers as agents of aggression should 
be higher than in folk tales from societies that 


GEORGE O. WRIGHT 


practice low severity of punishment f 
gression. This condition should be found | 
cause our theory states that the hight 
anxiety, the less like the original obje 
be the projected agents. 

The data used for this analysis we 
antecedent child-training measures of s 
of punishment for aggression versus thi 
quency of the use of the Hero as 
aggression. The scores of frequency of; 
the Hero as agent of aggression were 
by counting the times the Hero was use 
agent of aggression in the folk tales, 
dividing this by the total aggressive acts. 

Analysis of the data indicates that the 
frequency of use of Hero as agent of ag 
in the high aggression-anxiety soci 
22.3, whereas it is 23.0 in the low 
anxiety societies. The difference betw 
means, though in the predicted direc 
not significant (p is approximately 25 
tained ¢ = 0.21). 

Analysis of the data for the use of 
of the Hero as agent of aggression, Ui 
procedure similar to that for the Hero, 
cates that the difference between the 
frequency of use of friends as agents 
gression is also in the predicted directi 
is not significant (p is approximatel 
obtained ¢ = 1.18). 

In the converse condition, namely, | 
aggression anxiety accompanied by th 
frequent use of the stranger as agen! 
gression, analysis of the data indica 
for the group of societies in which 
anxiety is high, the mean frequency © 
stranger as agent of aggression is hight 
that of the group of societies in which 
gression anxiety is low. The mean frec 
are, respectively, 30.5 and 25.43. Thed 
between the means is in the predicted dire 
and is significant at the .01 level (0 
t = 2.46). 

The findings discussed here tend te 
the theorems and to suggest that the 
may, in fact, use psychologically b 
ways to express his aggression in the | 
of aggression anxiety. It is suggested, 
over, that the individual apparently 
harmless agents and objects of aj 
the basis of their categorical differe 
the agents and objects associated 
initial anxiety. The findings also suge' 
two qualifications are essential in 


PROJECTION AND DISPLACEMENT IN FOLKLORE 


pretation of these data. Both relate to the dis- 
crete placement of the cultural categories 
along the dimension of similarity. 

The first qualification is that the present 
theory, which relates anxiety to projection 
and displacement, is not refined enough to 
predict the characteristics and the behavior of 
the Hero in folk tales as a function of the anx- 
iety of the society. The explanation proposed 
is that the position which the Hero occupies 
on the generalization continuum is theoreti- 
cally influenced by the transition between real 
life and fantasy and that more things must be 
known about the influence of complex psy- 
chological factors in this transition area before 
our predictions will gain accuracy. 

The other qualification is that a precise 
placement of the intermediate categories of 
relatives and acquaintances cannot be made 
with the use of our presently developed theory. 
The explanation of this condition probably 
lies in the complexity of meaning which various 
Societies attach to cultural categories. It is 
possible, for example, that in one society the 
Similarity of relatives to strangers might be 
perceived to be greater than between ac- 
quaintances and strangers, while in another 
Society the opposite might be true. 


Tue ErrECT OF AGGRESSION ANXIETY ON 
INTENSITY OF AGGRESSION 


Another aspect of the relationship of ag- 
gression anxiety to fantasy aggression is of 
interest, namely, the apparent influence of the 
Strength of anxiety on the intensity of aggres- 
Sion in folk tales. Our theory suggests that the 
aggression anxiety will diminish at a more 
rapid rate than the intensity of the aggression. 
as displacements and projections are made 
farther and farther out on the generalization 
continuum. We should be able to foretell from 
this theory that fantasy, since it represents 
situations in many ways dissimilar from real 
life, should provide a vehicle for the expression 
of aggression, and should do so with an inten- 
sity which is directly related to the amount of 
real-life punishment for aggression. 

An examination of this theory is made in 
the third hypothesis to be tested, namely, that 
punishment for aggression increases the in- 
tensity of aggression in situations which are 
distinctly dissimilar from those in which the 
punishment occurs. 

This hypothesis, translated into terms which 


527 


are applicable to the folk-tale data, states that 
the punishment of aggression leads to aggres- 
sion anxiety. This anxiety reduces the display 
of aggressive behavior toward objects similar 
to those associated with the initial punish- 
ment. As we have seen, in such situations 
displacement and projection of aggression are 
likely to occur. Moreover, the strength of the 
joint effective habit of the aggression and of 
the aggression avoidance is thought to be the 
product of the difference between the two 
tendencies and the degree of avoidance. We 
should expect, therefore, that the greater the 
anxiety, the greater the intensity of the fan- 
tasy aggression. 

Our hypothesis will be supported if it is 
found that the folk tales of societies in which 
high aggression anxiety is generated by severe 
punishment for aggression show a greater in- 
tensity of aggression than exists in the folk 
tales of societies in which low aggression anx- 
iety is generated. 

The data used for this analysis were the 
same child-training measures of severity of 
punishment for aggression used in the previous 
analysis related to the intensity of the folk-tale 
aggression. The scores for intensity were ob- 
tained by taking the sum of the intensity 
measures for each act of aggression in the folk 
tales of a given society and dividing this sum 
by the total acts of aggression in the folk tales. 

An analysis of the data indicates that the 
mean of the intensity of aggression for the 
group of societies practicing high severity of 
punishment for aggression is 19.58, whereas 
the mean for the group practicing low severity 
of punishment for aggression is 15.7. The 
difference between the means is in the pre- 
dicted direction and is significant at approxi- 
mately the .03 level (obtained ¢ = 2.02). 


Tue Errect OF AGGRESSION ANXIETY ON THE 
FATE OF THE HERO 


There is one additional analysis by which 
information may be disclosed about the fan- 
tasy structuring of the aggression in folk tales, 
namely, the relationship between aggression 
anxiety and the fate of the Hero in the folk 
tales. Actually, such an analysis provides an- 
other test of the kind and amount of projection 
and displacement which develop as the con- . 
sequent variable of aggression anxiety. The 
fate of the Hero, moreover, is likely to be a 
more sensitive measure of the expression of 


528 . 


fantasy aggression than those involving sec- 
ondary characters. 

An examination of this theoretical position 
is done in the fourth hypothesis to be tested, 
namely, that the Hero should triumph less 
often in folk tales of societies in which the 
child-training practices include severe punish- 
ment for aggression. This hypothesis is a re- 
statement of the second hypothesis that high 
aggression anxiety results in projection far out 
from the Hero on the generalization con- 
tinuum. This theorem is built on the assump- 
tion that the behavior of the Hero is under- 
standable as a projection of the individual. 
Consequently, in cases of high aggression anx- 
iety, it is to be expected that having the Hero 
triumphant is likely to be accompanied by 
large amounts of anxiety, so that triumphant 
characters of the folk tales are likely to be less 
like the Hero. In short, a small degree of tri- 
umphant-Hero aggression should accompany 
high aggression anxiety. 

The data used for this analysis were the 
child-training data of severity of punishment 
for aggression versus the frequency of Hero 
triumphant in the outcomes of the episodes. 

An analysis of the data for the fate of the 
Hero shows that the Hero is triumphant less 
frequently in the high aggression anxiety group 
of societies. The means are 2.14 and 4.85. 
The difference between the means is signifi- 
cant at the .005 level (obtained ¢ = 2.86). 

This finding is in accord with the earlier re- 
sults in which it was seen that a comparable 
relationship was true, namely, that of the more 
frequent use of the stranger as agent of ag- 
gression in folk tales of societies in which ag- 
gression anxiety is high. In both cases, the 
theoretical assumptions underlying the func- 
tioning of the phenomenon of generalization in 
conflict are found to be valid. 


SUMMARY 


1. The analysis of the folk-tale data supports 
the essential features of the theory which was 
developed to supply insight into the conse- 
quences and outcomes of action and behavior 
in folk tales. The approach-avoidance theory 
of Miller, restated in terms of Whiting’s modi- 
fication of it for the influence of conflict-pro- 
duced drives, provides a coherent basis for the 
explanation of the phenomena of projection and 


GEORGE O. WricHT 


displacement in folk tales. Admittedly, the pre 
dictive powers of such a theory are not perfec 
but the results obtained for the tests of th 
hypotheses are all in the predicted direction, 

2. In general, it is found that high punish 
ment anxiety, when it is connected with th 
expression of aggression, leads to the kind o 
folk-tale behavior in which the objects ant 
agents of aggression are far out on the genera 
zation continuum and are least like the Hen 
In addition, the Hero in these folk tales is nd 
likely to be triumphant in the outcomes. | 

3. In general, it is found that low punish 
ment anxiety, when it is connected with th 
expression of aggression, leads to the kind 0 
folk-tale behavior in which the objects am 
agents of aggression are chosen close to thi 
Hero on the generalization continuum. li 
addition, the Hero is likely to be triumphan 
in the outcomes. 

4. High punishment anxiety, when it is con 
nected with the expression of aggression, lead 
to the display of a more intense kind of ag 
gressive behavior in folk tales than is true fo 
low punishment anxiety. f 

5. Finally, the study demonstrates tha 
folk tales as fantasy material are fruitful fi 
analysis. Enough consistency of trends hi 
been found in the present study to sugges 
that the relationships of child-training prat 
tices and the expression of aggression in fol 
tales are real and not accidental or spurious 
This consistency and its value are somewa 
enhanced by the fact that the variables 0 
child-training practices used in this anal 3 
are data from an independent investigatio! 
which were collected and analyzed by worse 
who were not directly involved in the scot 
and analysis of the present folk-tale study: 


REFERENCES 


1. Dottarp, J., & Muer, N. E. Passe 
psychotherapy. New York: McGraw-Hill, 1 j 

2. HOLLENBERG, ELEANOR, & Sperry, MARG is 
Some antecedents of aggression and E a 
frustration in doll play. Personality, 1951, 1, Ki 

3. Sears, R. R. A theoretical framework E. j 
sonality and social behavior. Amer. Psycholog 
1951, 6, 476-483. s 

4. WurrrsG, J. W. M., & Curr», I. L. Child. troi E 
and personality. New Haven: Yale Univer. 
1953. 


Received June 22, 1953. 


A MOVEMENT INTERPRETATION OF THREAT? 


ALVIN W. LANDFIELD 
Purdue University 


carried out within a theoretical frame- 

work recently advanced by Kelly (2). 
The independent and dependent variables, the 
relationship between them, and the method- 
ology employed in testing the experimental 
hypotheses are derivable from or are consistent 
with the following theoretical statements of the 
psychology of personal constructs. 


T= PRESENT investigation of threat was 


A person’s psychological processes evolve toward what 
he construes to be an optimal anticipation of events. 
Definitions: Processes— . .. the subject of psychology 
is assumed at the outset to be a process. Evolve—... 
the patterns of psychological processes undergo pro- 
gressive change in themselves as they function. Toward 
—Directionality of processes is built into the funda- 
mental postulate, What he construes—The direction- 
ality of processes is assumed to be an essential function 
of prior outcomes of those same processes. The person 
himself sets the objectives and he does it, just as the 
psychologist must, by construing, by attributing sig- 
nificance, by framing events with meaning at some 
minimal level. Anticipation—The direction of mental 
processes is established as prediction. This is an extra- 
polation to the domain of the individual of the com- 
monly accepted notion regarding the aim of science 
being prediction and control. Optimal—...some sort 
of reconciliation of potentially contradictory anticipa- 
tions must be assumed as taking place in the individual 
++. (2, pp. 29-31). 


Threat is the awareness of imminent compre- 
hensive change in oneself. Definitions: Compre- 
hensive change—Employed in the context of 
threat, it refers to a modification in the 
directionality of evolving psychological proc- 
esses. Restated, it refers to the change in the 
direction of change. Not all psychological 
change implies a modification in directionality. 

Self-movement, the independent variable, 
was chosen as one measure of an individual’s 
evolving psychological processes. Self-move- 
ment refers to the direction of change in self- 
conceptualization, inferred from the different 
Ways an individual describes himself (a) as he 
was in the past, (b) as he is now, (c) as he 
would like to be in the future. The perception 
of threatening and nonthreatening acquaint- 


1 This paper is based upon a doctoral dissertation 
completed in 1951 at The Ohio State University under 
the direction of Dr. George A. Kelly. 


ances, in terms of roles and expectancies, 
constituted the dependent variable. Defi- 
nitions: Role—the way an individual rates . 
another person. Expectancy—the way an 
individual thinks another person might rate 
him. Threatening acquaintance—one whom 
an individual wishes to avoid or whose 
behavior he would like to modify greatly. 
Nonthreatening acquaintance—one with whom 
an individual wishes to have contact and whose 
behavior he would not care to modify greatly. 
The description of acquaintances in terms of 
roles and expectancies is consistent with our 
theoretical framework. Roles and expectancies 
are common ways of anticipating or predicting 
other people. 

Since we have employed self-movement as 
one measure of evolving psychological proc- 
esses, threat may be conceptualized as an 
imminent change in the directionality of self- 
movement. Therefore, according to our 
theory, the threatening acquaintance described 
in terms of roles and expectancies is threatening 
because interaction with him might change the 
directionality of self-movement. Now if threat 
is related to the change in direction of self- 
movement, what kinds of roles and expect- 
ancies might lead to such a change? Our 
hypothetical answer to this question may be 
stated as follows: The threatening other 
person will be perceived by an individual as 
being more like himself as he was in the past, 
or is now but no longer wants to be, than the 
nonthreatening person. Also, the threatening 
other person will be seen by an individual as 
expecting him to be more like he used to be, or 
is now but no longer wants to be, than the non- 
threatening person. The latter formulation, 
known as the expectancy hypothesis, may be 
readily illustrated. The adolescent is threat- 
ened by his parents, not merely because he feels 
he is sometimes perceived as a child, but, be- 
cause the parents seem to perceive him as a 
child, he might begin acting like a child. Because 
the parental expectancy is so plausible, the 
directionality of his self-movement toward 
maturity could easily be reversed. 

Tf we can show that a relationship exists 


590 


530 


between the axes along which an individual 
perceives himself changing and his perceptions 
of threatening people, then we may conclude 
that our findings tend to support the more 
comprehensive formulation of threat within the 
psychology of personal constructs. 


METHOD 


A social concept test (1) and rating scales derived 
from this social concept test were employed as research 
tools. In the social concept technique, a subject (S) 
writes the names of certain specified persons he knows, 
30 in all, on separate cards. These cards, a name on 
each one, are then dealt to S three at a time, in certain 
combinations, As three cards are dealt, S is asked, “In 
what outstanding way are two of these persons similar 
and different from the third?" From S's responses to 
this question, rating scales may be created, scales which 
are derived from S’s descriptions of these people and are 
expressed in his own terminology. For example, S may 
state that two of the persons are “gregarious” and the 
other person is “not gregarious.” The terms “gre- 
garious” and “not gregarious” are placed at either end 
of a nine-point rating scale. It should be emphasized at 
this point that Ss do not always give opposite concepts 
in expressing a similarity and difference between three 
people. To insure opposite concepts (i. e., opposite to 
S), the examiner, when in doubt, simply asks whether 
or not the two descriptions are opposites. If the descrip- 
tions are not in opposition for S, he is asked to state 
their opposites. 

The S is then asked to place two or three persons 
with whom he is acquainted at points on each scale, 
depending upon whether he sees the scale as dichoto- 
mous or dimensional. Whenever S feels he can use only 
the poles of his scale, it is considered to be dichotomous 
and only the poles of the scale are used for ratings. Two 
referent persons then are placed on the scale, one at 
each end of the nine-point scale, and scale points 2 
through 8 are crossed out. Whenever S perceives people 
distributed across his scale, it is considered dimensional, 
and all points of the scale may be used for the ratings. 
"Three referent persons then are placed on the scale, one 
um end of the nine-point scale and one in the middle. 

n other words, scale points are represented by people. 
(See Fig. 1.) 

Both exploratory and validation designs were em- 
ployed. Fifteen adult Ss were used in the exploratory 
phase of the study. Fifty-four volunteer undergraduate 
male and female Ss, aged 18 through 24, drawn from 
economics and psychology classes at The Ohio State 
University, were used in the validation study. 

The social concept test was administered individually 
during one experimental session to each S. The 30 names 

„used in this technique were specified by the experi- 
menter (E) in order to insure adequate sampling of 
persons with whom an S is acquainted. For example, 
the following people were specified: teacher you like, 
mother, father, sister nearest your age or person like a 
a sister to you, person with whom you would not care 
to go on a vacation; etc. 

The rating scales derived from S’s responses to the 
concept test were administered individually during a 


ALvIN W. LANDFIELD 


second experimental session, at which time S was as 
to: (a) rate himself on his own scales as he would 
to be six years from now, if it were possible for him to be 
anything he wished; (b) rate himself as he feels he i 
now; (c) rate himself as he feels he was at some perio 
during the past six years when he was most different 
from the way he is now; (d) think of a threatening a 
a nonthreatening person and rate each of them in term 
of roles and expectancies. Threatening and nonthreaten- 
ing persons were of the same sex as S. 1 
As can be seen in Fig. 1, all ratings were done on & 
rating sheet and the rating scale labels were placed on 
a separate overlay sheet, The S places the right-hand 
edge of the first overlay sheet to the left of the fi 
column, and using the name designations as referer 
points, rates the first six scales by encircling the proper 
numbers in the first column on the rating sheet. The $ 
then takes the second overlay sheet with a different 
six scales and the second rating sheet and continues” 
rating in this first column. After all scales have been 
rated in the first column, S takes the first overlay sheet 
again, moves it one column to the right to the second 
column, thereby covering over the first column, and. 
proceeds as before. This procedure is continued until $' 
has completed all columns. 


SELF-MOVEMENT 
Employing the role and expectancy descrip- 
tions of the threatening and nonthreatening 
persons as dependent variables, we attempted 
to predict these descriptions from the dire 
tionality of self-movements, inferred from the 
change in self-description from past, to present, 


Sample Rating Sheet 
F2 Nb p5 Ró E7 ROB? 


Sample Overlay Sheet 


hah 
" 
7. 
ato 
HT 
AEE 
££ 
gff 
217 
566 


Role nonthreatening person 
eer. monthreatening person 


concept description 
ioquaintance placed on scale as rating reference 


Fis. 1. RATING Forms 


EBoo-ouwr wnr 
ri 
H 
8 
i 


A Movement INTERPRETATION OF THREAT 


PON OPS FUN: PB) Paw CPOUEDUNOP 
* t * * 


K UR 


*-3* * 


he 
/ 
b edd 
down movement = J up movement = x 


Fic. 2. TYPICAL Movements INFERRED FROM Past, 
PRESENT, AND FUTURE SELF-RATINGS 


* 


to future. A self-movement is inferred to have 
occurred within a single concept scale when 
there is a numerical difference between the 
past and present or present and future ideal 
self-ratings. It should be noted that the 
movement shown in Fig. 1 is from right to 
left. Our use of the overlay sheet necessitated 
this arrangement. 

Self-movement may occur between the past 
and the present, directionality up or down; 
between the present and the future ideal, 
directionality up or down; and between the 
past and future ideal, directionality up, down 
or up and down (i.e., the movement between 
both past and present and present and future 
ideal may be up. It also may be down. Then 
again, one movement may be up and the other 
down. The latter is called bidirectional 
movement). (See Fig. 2.) 

The independent variable more specifically 
was a unidirectional self-movement within a 
concept scale. Where the movement from past 
to present to future ideal is unidirectional, 
Prediction simply is made from the direc- 
tionality of this unidirectional movement. 
(See Fig. 3.) 

Where the movement from past to present 
to future ideal is bidirectional, prediction is 
made from the longest unidirectional move- 


EF N PEREON GPA Ran oP 
* 


unidirectional movements 


Fic, 3. UNIDIRECTIONAL MOVEMENTS INFERRED FROM 
Past, PRESENT, AND FUTURE SELF-RATINGS 


531 


FECN P RU PrP Nee 


eal Ad we 
2" ! 
ve ' 
lorigest the last 
unidirectional movement 
movements 


Fic. 4. BIDIRECTIONAL MOVEMENTS INFERRED FROM 
PAST, PRESENT, AND FUTURE SELF-RATINGS 


ment. (The frequency with which bidirectional 
movement occurred was small. Movement 
usually occurred only from past to present, 
only from present to future, or in the same 
direction from past to present to future.) (See 
Fig. 4.) 

In the few cases in which the two opposite 
movements are the same in length, prediction 
is made from the last movement (Fig. 4). 

As we previously stated, the threatening and 
nonthreatening other persons were described in 
terms of roles and expectancies. These de- 
scriptions, expressed as numerical ratings, were 
predicted from the direction of a unidirectional 
self-movement, relative to each concept scale, 
In other words, a set of predictions was made 
for every concept scale within an S’s scale 
repertoire in which movement occurred. 


EXPERIMENTAL HYPOTHESES 


Role hypothesis. The Ss will attribute the 
role rating nearest the end of the concept 
scale toward which the unidirectional self- 
movement is pointing more often to the non- 


‘threatening person than to the threatening 


n. 
P RA hypothesis. The Ss will attribute 
the expectancy rating nearest the end of the 
concept scale toward which the unidirectional 
self-movement is pointing more often to the 
nonthreatening person than to the threatening 


n. j 
E following example may clarify these 
hypotheses. An S has created 15 concept 
scales within which self-movement may be 
inferred. If the direction of movement suc- 
cessfully predicts the role ratings in ten scales 
and fails to predict the role ratings in five 
scales, it may be inferred that successful 


532 


predictions exceed unsuccessful predictions, 
and accordingly, the S conforms to the hy- 
pothesis. However, a significant number of 
Ss must follow the same pattern before the role 
hypothesis is supported. 


RESULTS 


Forty-eight Ss attributed the role rating 
nearest the movement-designated end of the 
scale more often to the nonthreatening person. 
Three subjects attributed the role rating more 
often to the threatening person. In three cases 
successful predictions equaled unsuccessful 
predictions. These latter cases were handled 
statistically by arbitrarily placing one and one- 
half cases with the successful prediction group 
and one and one-half cases with the un- 
successful prediction group. Chi square, with a 
50:50 theoretical frequency and 1 degree of 
freedom, was employed in the analysis. The 
statistical results (x? = 37.5) allow the null 
hypothesis to be rejected at better than the 
.001 level of confidence. 

Forty-nine Ss attributed the expectancy 
rating nearest the movement-designated end of 
the scale more often to the nonthreatening 
person. One S attributed the expectancy rating 
more often to the threatening person. In four 
cases, successful predictions equaled un- 
successful predictions. These latter cases were 
handled statistically by arbitrarily placing two 
cases with the successful prediction group and 
two cases with the unsuccessful prediction 
group. Chi square again was employed. The 
statistical results (x? = 42.7) allow the null 


ALvIN W. LANDFIELD 


‘person. Within our theoretical framework, 


hypothesis to be rejected at better than the 
.001 level of confidence. i 


SUMMARY AND CONCLUSIONS 


The results of this study support the 
hypothesis that a highly significant relation- 
ship exists between the axes along which an 
individual perceives himself changing and his 
perceptions of threatening people. More ` 
explicitly stated, the threatening person is 
perceived by an individual as being more like 
himself as he was in the past, or is now but 
no longer wants to be, than the nonthreatening 
person. Also, the threatening person is. 
perceived by an individual as expecting him to 
be more like he used to be, or is now but no. 
longer wants to be, than the nonthreatening 


these findings may be interpreted further to 
mean that the threatening person is placed by 
the threatened individual in the unique 
position of being able to reverse the individual's 
direction of movement, to interrupt his 
movement, or at least to place him under 
additional pressure to resist such modification | 
of his movement. 


REFERENCES 


1. Hunt, D. E. Studies in role concept repertory: con- 
ceptual consistency. Unpublished master's thesis, 
Ohio State Univer., 1951. 

2. Key, G. A. The psychological construction of life. 
Unpublished manuscript, Ohio State Univer, 
1951. 


Received June 22, 1953. 


THE COLOR-FORM RESPONSES OF NORMAL, PSYCHOTIC, 
AND NEUROTIC SUBJECTS 


J. D. KEEHN! 
Maudsley Hospital, London 


NVESTIGATIONS into the color-form atti- 
tudes of various special and clinical 
groups have been reported by several 

writers since Kuhlmann's (17) study of defec- 
tives in 1904. In addition to the large number 
of studies of normal adults and children (see 
Lindberg [18], Eysenck [7, 8], for reviews), 
there have been investigations of the color- 
form reactions of defectives (2), organics (11), 
spastic children (6), epileptics (1), psychotics 
(3, 4, 9, 15, 18), and neurotic patients (5, 7, 19). 

Altogether some seven different types of 
color-form tests have been employed by these 
workers, Time of sorting by color or form was 
used by Kuhlmann (17) and by Eysenck (9), 
Kibler used differential memory of tachisto- 
scopically exposed colored shapes, and Braat 
(3) instructed his subjects (Ss) to match a 
given figure with a set of figures shown tachis- 
toscopically. Grouping of symbols by color 
or shape formed the basis of tests employed by 
Goldstein and Scheerer (11), Cotton (6) and, 
in a modified form, by Eysenck (7). A test 
involving the ranking first of a set of colors, 
then a set of shapes, and third a selection of 
colored shapes—such that the most preferred 
shape appeared in the least preferred color— 
was given to a group of neurotics by Eysenck 
(7), and sorting tests were devised and adminis- 
tered by Lindberg (18), Clarke (5), and 
Eysenck (9). Finally the Ring test, designed 
and described by Lindberg (18), was used by 
Lindberg (18, 19), Campailla (4), Eysenck 
(7), and Clarke (5). 

It was not until Eysenck (7) reported that 
the average intercorrelation between the three 
tests used by him was only .24 that the sug- 
gestion that the various color-form tests might 
not all be measuring the same thing was put 
forward. Thus Eysenck speculated that either 
the tests used by him were all measuring fairly 
Specific attitudes or that if a general factor of 


! The author wishes to express his gratitude to Dr. 
H. J. Eysenck at whose suggestion and with whose ad- 
vice and encouragement this study was carried out. 
This work was done while the author was a London 
University Postgraduate Scholar at the Institute of 
Psychiatry. 


color-form attitude existed, his particular 
tests might be rather poor measures of it. 

Further doubts about the comparability of 
color-form tests arise from a study of the 
results obtained from the application of such 
tests to psychotic Ss. Thus while schizophrenics 
appear able to abstract color from form or 
form from color more easily than manic- 
depressives in the tachistoscopic memory 
test (15), no significant differences between 
the attitudes to color or form of the two groups 
were found on the sorting test employed by 
Eysenck (9). Hence the tests seemed more 
likely to be measuring the ability to dissociate 
than to be indicative of attitude to color and 
form when used with these groups. However, 
this interpretation has not been extended to 
normal subjects, for Kretschmer and his 
associates (8) have used response to color and 
form as a distinguishing feature between 
schizothymic and cyclothymic personality 
types which are posited to be the normal proto- 
types of the more extreme psychoses. 

In order to clarify the situation somewhat, 
a factor analysis of a number of color-form 
tests was carried out by Keehn (13, 14). A 
factor of color-form attitude was identified, 
but although this pointed to the comparability 
of the earlier tests in terms of method, the 
correlations between the tests were low enough 
to show that results obtained from the use of 
one test were of little value in predicting 
response to others. Hence it was felt (14) that 
a battery of tests based on the results of the 
factor analysis would give more valid and 
reliable results than those obtainable from 


the use of one test alone. 


Tue Test BATTERY 


From a large battery of color-form tests (13), nine 
were selected for special analysis (14), and it is these 
tests which constitute the present battery. The number 
beside each test refers te its serial position in the larger 
battery (13). 

2. Tossing. A disc was tossed like a coin and S could 
respond to color or shape in guessing which side would 


fall uppermost. 
a Menon! paired associates. Four capital letters 


had to be associated with four colored shapes. Colors 


533 


ieee 


534 


and shapes were later interchanged to see which cate- 
gory had been associated with the letters. 

8. Grouping. Twelve squares set out on a background 
could be grouped either by color or by disposition. 

10. Odd one out. Of nine figures on a card one was a 
lone shape and one a lone color. The subject was asked 
to pick out the odd figure. 

12. Stepping stones. The subject had to pick his way 
across a set of colored shapes from one point to another. 
Two correct paths were available, one where all the 
shapes were the same, the other where the colors were 
the same. 

13. Common elements. Three similar figures had to be 
chosen from eight. “Similarity” could be based on 
color or shape. 

19. Weigl sorting. Nine wooden blocks could be 

sorted by color or shape. 
_ 22. Card sorting. A pack of cards was to be sorted 
into four compartments. All but four of the cards could 
be sorted in only one way; the remaining four could be 
placed either by color or by shape. 

24b, Tachistoscopic matching. A given figure had to 
be selected from eight figures exposed tachistoscopi- 
cally. Selection could be by color or form. No figure 
exactly matched the original. 


The factorial analysis of these tests yielded two 
factors, the test saturations being reproduced in Table1. 
"The first factor was identified as that of attitude toward 
color or form. The obtained first factor saturations 
were used as though they were correlations of the tests 
with the factor (21), and regression weights for maxi- 
mum predictability of the color-form factor by the 
battery were calculated according to the Ledermann 
method as set out in Thomson (21). The resulting 
weights are shown in Table 2. The ensuing multiple 
correlation of the battery with the factor was found to 
be .93 thus showing the battery, weighted in this fash- 
ion, to be a highly valid measure of the factor. 

Tn order to obtain an estimate of the reliability of 
the battery, use was made of the generalized Spear- 
Kr 
1 + (K-i) 
as set out in Gulliksen (12), where rris the estimated 
reliability of a total test whose inter-unit correlation is 
r and where the unit test is increased in length K times, 
This formula rests on the assumptions that all the units 
are psychologically similar and that the correlations 


man-Brown prophecy formula rp = 


TABLE 1 
FacroR SATURATION 


TABLE 2 
REGRESSION WEIGHTS 


Test 2| 4 |8]|10 2| 13 | 19| 22| 24b 
Weight -|.20| .30 |.06| .21 |.19| .18 |.15|.31| .09 


J. D. KEEHN 


TABLE 3 


PERCENTAGE OF THE VARIOUS GROUPS GIVING THE 
COLOR RESPONSE TO THE INDIVIDUAL TESTS 


[ity rok on ote) 
Test | H Test ? ] 
aliaa aala 
AEAEE [PP HS 
à|3/33]3 AEG 
2 /|61.5| 48 | 64 | 60|| 16. |75 | 52 | 56] 6 © 
3 jól 44 | 60 | 45 || 17a [52.5 — | — | — 
4 |32 | 28} 56 |35 || 17b 63.5 — |—|— — 
5 |33 |44|32|60]/18 |55 |48]| 64] 55 
6 |33 | 40|20]| 40| 19 124 16 | 68 | 35 — 
7 |46 | 48 | 68 | 45 || 20 |44 | 36] 56} 30 
7a |47 |44| 56| 55| 21 |40 | 36 | 44] 70 
8 ]|42.5| 36 | 68 | 25 || 22 |45.5| 56 | 64 | 50 
9 |61.5| 64 | 80| 75 | 23 |45.5| 64 | 64 | 55 
10 |38 | 36 | 60| 55 | 24a |54 | 56 | 60 | 70 
11 43 | —| —| — | 24b |38 28 | 20 | 70 < 
12 |60 | 64| 72 | 70 | 25a |48.5| 32 | 32 | 80 
13 |35 | 48 | 80| 55 | 25b |49.5| 32 | 64 | 50 
14 |21.5) 36 | 28 | 20 | 26 |49 60 | 68 | 70 - 
15 .5| 52 | 60 40 | 


between them are all equal. In this case each individual 1 
test was taken as a unit and, as all nine tests have been 
shown to measure along a common dimension, the 
assumption of psychological similarity is not unreason- 
able. However, the correlations between the items 
varied between —.18 and +.44 with a mean of +.21, 
so that although the average of these correlations wasi 
used in the formula, the obtained reliability estimate 
of .71 must not be regarded as anything more than à. 
general indication of the reliability of the battery. 
A leini bai 

Although attention was primarily centered upon the 
nine tests forming the color-form battery, all the tests. 
described by Keehn (13) were given to 25 schizophrenic, 
20 manic-depressive, and 25 neurotic males whose age 
ranged from 19 to 39 years. The reason for including” 
the extra tests was twofold. First, as the original es 
group had received the whole battery it was deemed 
wiser to repeat this procedure with the patients rather 1 
than risk the introduction of an extra relevant variable 
and, second, it was anticipated that some of the extra 
tests would prove useful in differentiating between the 
groups for reasons other than their color content. 


RESULTS 


. Table 3, by showing the percentage of Mi 
giving the color response to the individu | 
tests, exhibits the sort of information pI€. 
sented by the earlier studies. Simi 
centages obtained from a group of 200 norm". 
males are also included. However, the m 
interest here is not centered upon the 1 
dividual tests but upon the nine-item color 
form test battery as a whole. Consequently) 


Cotor-Form RESPONSES 


Table 4 shows means and standard deviations 
for the groups where each test was given its 
regression weight (see Table 2) and each 
person's score was the algebraic sum of his 
weighted scores on all the nine tests. A positive 
Score was given for a color response and a 
negative one was given for a form response. 
Again the results from the 200 normals are 
included. 


TABLE 4 $ 
Group MEANS AND SD's on THE CoLOR-FORM 
BATTERY* 

Group N Mean SD 
Normals 200 —27.4 81.28 
Schizophrenics 25 43.5 98.40 
Manic-depressives 20 —2.5 90.53 
Neurotics 25 —28.5 76.58 


* Positive scores show a preponderance of form responses. 


DISCUSSION 


Normals vs. neurotics. There is widespread 
feeling among psychologists (10, 20), especially 
those having experience with the Rorschach 
test, that reaction to color is somehow linked 
with emotional instability. Consequently, 
since Eysenck (7), Clarke (5), and others have 
linked neuroticism with lack of emotional 
control it seemed reasonable to expect neu- 
rotics to show greater reaction to color than 
normals. This was shown not to be so on the 
nine-test battery; neurotics reacted slightly 
but insignificantly less to color than the nor- 
mals. 

Only one of the individual tests, number 16, 
differentiated between the two groups. As 
this was at the .05 level, and as at least one 
Such differentiation would be expected by 
chance, no significance was attached to this 
result. 

Schizophrenics vs. manic-depressives. Dif- 
ferential reaction to color and form is one of 
the bases of Kretschmer’s (16) typological 
system, and many studies carried out by his 
school (see Eysenck [8] for summary) suggest 
Strongly that manic-depressives react more to 
color than schizophrenics. However, not only 
is this not supported by the scores of these 
groups on the present battery, but the dif- 
ference, although statistically nonsignificant, 
is decidedly in the reverse direction. 

Examination of the individual tests shows 
four which do differentiate between the two 


535 


groups. Test 25a, a comparison of reaction 
time to color with that to shape, and test 245 
separated the groups at the .001 level, and in 
both cases the manic-depressives obtained the 
higher color scores. Yet tests 8 and 19, dif- 
ferentiating between the groups at the .01 
and .05 levels respectively, show the schizo- 
phrenics to react more to color. Clearly, then, 
the factor distinguishing between the groups 
is not that of reaction to color, especially as 
the three tests appearing in the color-form 
battery have the lowest weights in that bat- 
tery. 

One possible explanation of these anomalous 
findings would be that manic-depressives , 
have a lower threshold for perceiving color 
in brief exposure tests (numbers 24b and 25a) 
but react to form when time for deliberation 
is allowed, as in tests 8 and 19 where the 
response is made with the test material still 
visible to S. However, the dangers besetting 
phenomenological interpretations of these 
tests will be apparent and no suggestion 
without considerable experimental backing is 
worth taking seriously. 

Psychotics vs. normals. On the color-form 
test battery comparison of the normal and 
schizophrenic groups shows the latter to be 
significantly more color reactive beyond the 
.001 level of confidence. The manic-depressives 
fall between but are not significantly dif- 
ferent from either group. Since the combined 
psychotic group differs significantly from the 
normal group also, it is impossible to say from 
this evidence whether the battery separates 


TABLE 5 


SIGNIFICANCES OF THE DIFFERENCES BETWEEN THE 
Various GROUPS ON THE Various TESTS 


Normals vs. Normals vs. Normals 
t Schizo- Manic- vs. 
Es phrenics ^ depressives  Psychotics 
ECHTE CROPS EY NIE res En 
4 .05 
S .05 
7 .05 
.05 
D o5 
10 .05 .05 
13 .001 .001 
16 .05 
19 .001 * A .001 
21 A 
24b +003 
25a .003 
26 -05 .05 


536 


out schizophrenics in particular, the manic- 
depressives behaving on this battery similarly 
to the normals, or whether it is measuring 
psychoticism in general, the two psychotic 
groups being combined. 

In the light of Eysenck's (9) finding of a 
general factor of psychoticism but no group 
factor separating out the conventional psy- 
chotic subgroups, the latter alternative seems 
to be the most probable. Yet in Table 5, which 
depicts those individual tests differentiating 
between the normals and the schizophrenics, 
five tests occur in the color-form battery, 
whereas only one test in the battery dis- 
tinguishes between normals and manic-de- 
pressives. 

Further examination of Table 5 shows that 
as there is no overlap between the tests dif- 
ferentiating between the respective psychotic 
subgroups and the normals, it would appear 
that the postulation of a single psychotic 
dimension is inadequate. For if schizophrenics 
and manic-depressives lay along a single 
continuum of psychoticism, tests differentiat- 
ing between the closer groups would also dif- 
ferentiate between the extreme groups. Which- 
ever psychotic group lies nearer to the normals, 
in the present study this differentiation is 
not shown. 

Furthermore, as the difference between the 
means of the schizophrenics and manic- 
depressives approaches significance (t = 1.6) 
even with such small numbers, it is not un- 
likely that large groups would differ signifi- 
cantly on the battery. Hence rather than 
conclude that reaction to color on the color- 
form battery is indicative of psychoticism, it 
would seem better to regard responses to the 
battery as a measure of schizophrenia. 

Finally, in the last column of Table 5, six 
individual tests occur that discriminate sig- 
nificantly between the total psychotic group 
and the normals. Since four of these tests also 
differentiate between the normals and one or 
other of the psychotic subgroups, it is likely 
that in these cases the differences are largely 
due to the effect of the relevant subgroup. 
The remaining two tests, numbers 9 and 16, 
may be real measures of psychoticism or they 
may be exhibiting some of the chance dif- 
ferences which inevitably occur when a large 
number of comparisons are made, as is the 

case in the present investigation. 


J. D. KEEHN 


SUMMARY AND CONCLUSIONS 


A battery of nine color-form tests was given 
to 25 schizophrenic, 20 manic-depressiv 
and 25 neurotic patients, and their response 
were compared with each other and wii 
those of a normal group of 200 males. Several 


differ significantly from normals in their 
responses to color on the test battery, and 
that manic-depressives respond less than| 
schizophrenics do to color but not significantly 
so. The schizophrenics did, however, react 
significantly more to color than did the othe 
groups, and it was argued that the battery was 
measuring schizophrenia specifically rather) 
than psychoticism. 


tests showed that although schizophrenics 
sometimes gave significantly more color 
responses than manic-depressives, the revers 
was true of tests involving responses to brief 
exposed stimuli. It was therefore suggested 
that interpretation of the processes involved! 
in reaction to these tests must not be based! 
upon appearances. Thus although Kret- 
Schmer's school seems to be correct in ils 
assertion that cyclothymes react more 10 
color than schizothymes on the tachistoscopl¢ 
type color-form tests, it does not follow that 
cyclothymes are more reactive to color im 
general. i 
Finally, from the pattern of tests differen 
tiating between the normalsand schizophrenics" 
and the normals and manic-depressives 1 
spectively, it was argued that a single factor 
of psychoticism was insufücient to account 
for the variations between the groups. Tt 18 
likely that an investigation similar to that 
Eysenck (9) but including some of the present 
tests will result, as Eysenck suggests, 1n the 
extraction of a factor of schizothymia-cyC™ 
thymia. However, such an investigation wou 
need to be carried out with considerable on 
for if two orthogonal factors such as payee 
icism and schizothymia-cyclothymia ate © 
be posited it would be incorrect to rege 
psychotic subjects as necessarily at the p. 
tremes of the latter continuum. 
REFERENCES j 
1. BrowQuisr, I. Investigations into the colorfort| 
reaction of epileptics. Acta Psychiat., 1 
1949, 24, 149-166. 


10. 


12. 


CoroR-FoRw RESPONSES 


. Borres, M. M. The basis of pertinence. Arch. 


Psychol., N. V. 1937, No. 212. 


. Braat, J. P. Die experimentelle Psychologie und 


Kretschmer's Konstitutionstypen. Mschr. Psy- 
chiat. Neurol., 1936, 94, 273-297. 


. CaMPAILLA, G. Sulla polarizzayione percettiva 


verso il colore rispetto alla forma in anamalati 
di mente. (On perceptual polarization toward 
color and form in mental patients) Arch. Psi- 
col. Neur. Psich., 1943, 4, 1-13. 


. CLARKE, A. D. B. The measurement of emotional 


instability by means of objective tests. Unpub- 
lished doctor's thesis, London Univer., 1950. 


. Corton, C. B. A study of the reactions of spastic 


children to certain test situations. J. genet. 
Psychol., 1941, 68, 27-44, 


. EysENcK, H. J. Dimensions of personality. London: 


Kegan Paul, 1947. 


. EvsENCK, H. J. Cyclothymia and schizothymia 


as a dimension of personality. I. Historical 
review. J. Pers., 1950, 19, 123-152. 


. Eysenck, H. J. Schizothymia-cyclothymia as a 


dimension of personality. II. Experimental. 
J. Pers., 1952, 20, 345-384. 

FORTIER, R. H. The response to color and ego 
functions. Psychol. Bull., 1953, 60, 41-63. 


. GorpsrEmN, K., & ScHEERER, M. Abstract and 


concrete behavior: an experimental study with 
special tests. Psychol. Monogr., 1941, 58, No. 
2 (Whole No. 239). 

GULLIKSEN, H. Theory of mental tests. New York: 
Wiley, 1950. 


537 


13. Krenn, J. D. Rorschach validation III: An ex- 
amination of the role of colour as a determinant 
in the Rorschach test. J. ment. Sci., 1953, 
99, 410-438. 

14, Krenn, J. D. An experimental investigation into 
the validity of the use of attitude to colour and 
form in the assessment of personality. Un- 
published doctor's thesis, London Univer., 1953. 

15. Krister, M. Experimentalpsychologischer Beitrag 

* zur Typenforschung. Zbl. ges. Neurol. Psychiat. 
1925, 98, 524-544. 

16. Kretscumer, E. Körperbau und Charakter. Berlin: 
Springer-Verlag, 1948. 

17. KuHrMANN, F. Experimental studies in mental 
deficiency: Three cases of imbecility (Mon- 
golian) and six cases of feeble-mindedness. 
Amer. J. Psychol., 1904, 1b, 391. 

18. LixpBEno, B. J. Experimental studies of color and 
non-color attitude in schoolchildren and adults. 
Copenhagen: Levin & Munksgaard, 1938. 

19. LixpBERG, B. J. Different mixtures of syntonic 
and asthenic personality traits. Acta Psychiat. 
Kbh., 1950, 25, 225-235. 

20. Norman, R. D., & Scorr, W. A. Color and affect: 
a review and semantic evaluation. J. gen, Psy- 
chol., 1952, 46, 185-223. 

21. Tuomson, G. The factorial analysis of human abil- 
ity, London: Univer. of London Press, 1950. 


Received June 27, 1953. 


ANXIETY AND THE MODE OF RESPONSE: : 


ROBERT E. SILVERMAN? 
Indiana University 


ECENT studies in the area of perception 
and motivation have often produced 
data with contradictory and am- 

biguous interpretations (3, 4, 6, 8, 11, 12, 

_ 13). One explanation for this situation may be 
that interpretations are made which are bur- 
dened with ad hoc concepts like perceptual 
defense and perceptual sensitization that even 
now show signs of being eschewed by their 
authors (12). 

It is felt that these concepts are entirely 
superfluous and arise because investigators 
fail to control the stimulus situation and to 
consider responses apart from any goal-seeking 
Or drive-reducing properties. It would seem 
that a fruitful approach at this time would 
involve studying the gross responses or a 
general class of responses that persons make 
to the immediate stimulus situation. Once 
we have acquired some information at this 
level, the next step might then be concerned 
with investigating the development or ac- 
quisition of responses as a function of stimulus 
conditions. 

In attempting to treat the first problem, 
that of studying the responses, it is necessary 
to develop a method of grouping or classifying 
responses as they relate to classes of stimuli. 
The classification scheme used in the experi- 
ment reported here simply entails a considera- 
tion of responses as they relate to the relevant 
as opposed to the incidental stimuli in a con- 
trolled stimulus situation. 

Given an experimental setting, we may 
designate the subject’s (S's) task as it is struc- 
tured by the experimenter (E) as being com- 
posed of relevant stimuli. All the controlled 
stimuli in the situation that have no observ- 
able relationship to the relevant stimuli are 
designated as incidental. We may assume that 


' This paper is a portion of a thesis submitted to the 
Depaitment of Psychology of Indiana University in 
partial fulfillment of the requirements for the degree 
of Doctor of Philosophy. The writer is indebted to Dr. 
Bertram D Cohen for his advice and assistance 
throughout the study. 

*A part of this paper was read before the Eastern 
Psychological Association at Boston in 1953. 
* Now at New York University. 


538 


in the experimental setting an S’s mode 
response can involve responding to: (a) # 
relevant or task stimuli, (b) the incident 
stimuli, (c) both, (d) neither. 1 
Bahrick and Fitts (2) have demonstrat 
that increasing the reward value of a centi 
task interferes with performance of a peri 
eral task. As motivation was increased 
tended to respond more to the relevant? 
opposed to the incidental aspects of t 
stimulus situation. Bahrick and Fitts’s” 
were rewarded or not rewarded for each di 
crete response. Thus, the positive reinford 
ment for each response was explicitly rela 
to the relevant task. 1 
However, the question arises as to the ge 
eral conditions that will elicit relevant as 0j 
posed to incidental responses. What, for e 
ample, will be the effect of anxiety-evoki 
variables? Does anxiety enhance the sele 
tivity of responding as some authors hat 
predicted (4, 11, 13)? Or, as a second group? 
writers have thought (5, 9, 10), do the cond 
tions which evoke anxiety produce a gene 
unselective type of responding? Ao 
The present study was designed to inv 
tigate the effects of anxiety—defined as W 
presentation of the threat of electric shock 
on the mode of response of human Ss. 
experimental situation permitted some eg 
of control over the relevant and the incident 
stimuli, and measures of the responses” 
these were obtained. If S’s mode of 
emphasized responding to the relevant stm 
we would expect greater improvement n * 
performance on the experimental task E 
pared to improvement in performance I É 
Ss who respond to the incidental stimu?* 
who respond to neither the relevant nor meio 
tal stimuli. The Ss who respond to the inci E 
tal stimuli would manifest more idm 3 
learning than Ss who use the other mode 
nse. Be. 
"The possibility exists that responding i 
both the incidental and relevant stimuli cob 
: ith respect 
raise or lower performance wi a 
either class of stimuli. In order to check "H 
two types of relevant tasks have been "A 


ANXIETY AND THE MODE or RESPONSE 


One task was considered as being difficult and 
the second task was a very easy one. In this 
way the relationship between the relevant. 
and incidental tasks was observed. 


METHOD 


Subjects. The Ss were 146 students from first-year 
psychology courses. They were assigned randomly to 
six groups. The groups each contained 25 Ss with the 
exception of groups E and F, each of which consisted of 
23 Ss. 

Apparatus. The experiment was performed in a 
sound-deadened room 8 ft. wide by 14 ft. long by 8 ft. 
high. The apparatus consisted of a response lever which 
S moved to indicate a response, a film strip projector 
which was used to present the relevant task stimuli, and 
a wire recorder and separate speaker attachment which 
were used to present the incidental stimuli. In addition, 
an inductorium was used to administer a sample shock 
to the shock-threat groups. 

"The response lever was a 2-in. plastic rod projecting 
through an aperture in a 23-in. by 43-in. wooden screen, 
The rod was connected to a brass slider which in turn 
was connected to an endless loop of string that moved 
a writing pen attached to an Esterline-Angus recording 
system. The arrangement was such that when the rod 
and slider were moved 9 in. the movement was recorded 
as a 2.2-in. move on the recording tape. 

The stimuli for the relevant task were presented by a 
film strip projector connected to a timing system which 
changed the film strip frame every 10 sec. The stimuli 
were five black lines 0.5 in. in width and varying in 
height from 2.25 in. to 3.25 in. in steps of .25 in. These 
stimuli were presented on a 6 by 8-in. white screen at 
S's eye level. The projector was behind the screen; thus 
only the screen and the response lever were visible to S. 

The incidental stimuli were 20 two-syllable words 

randomly selected from 94 spondaic words developed 
at the Psycho-Acoustic Laboratory of Harvard Univer- 
sity (7), The words used were: railroad, sidewalk, mid- 
way, whitewash, playmate, buckwheat, sundown, 
Platform, lifeboat, cookbook, outside, beehive, work- 
shop, headlight, doorstep, scarecrow, wildcat, baseball, 
grandson, stopwatch. Ten repetitions of the words were 
recorded in 5-min. intervals on a wire recorder (Radio- 
technic, Model 55A) in a woman’s voice. The incidental 
Stimuli were introduced by switching on the wire 
recorder which played through a muffled 6-in. speaker 
Placed in the ceiling of the experimental room. The 
volume output for the recorder was set at 50. At S’s 
Position, the sound-level intensity of the words was 
approximately as loud as “subdued conversational 
speech.” 
i The sample shock was administered through an 
inductorium energized by two 1.5-v. dry-cell batteries 
in series, The primary control switch was set for con- 
tinuous vibrator operation and thè output voltage was 
at maximum. 

Preliminary training and pretest. Each S in groups A, 
B, C, and D was seated at the apparatus, and the 
projector was switched on to present the training 
stimulus. The S was instructed that this was to be a 
discrimination task and that he was to learn to move the 
key when he saw the line which was now on the screen, 


539 


but that he was not to respond when any other line 
appeared. A brief training series followed during which 
E reinforced a correct response by saying “that’s right” 
and indicated a wrong response (failing to move when 
the correct line was presented or moving the response 
key when an incorrect line was presented) by stating 
“you should not move the key since that is a longer 
(shorter) line” or “you should move the pointer since 
that is the correct line.” During the training series the 
stimuli were presented in the following order: 3, 3, 3, 
3, 5, 3, 1, 3, 2, 3, 4, 3, where 3 indicates the correct 
line (2.75 in.), 1 is the longest line (3.25 in.), and 5 is the 
shortest line (2.25 in.). Following the training series 
each S received a pretest in which the lines were pre- 
sented in the following order: 3, 5, 3, 3, 3, 2, 3, 4, 3, 1, 
3, 2, 3, 3, 3, 4, 3, 1, 3, 5. 

An S who made more than five errors on the pretest 
was eliminated from the remainder of the experiment. 
Eleven Ss were eliminated. An error was defined as 
responding incorrectly to a line other than 3 or failing 
to respond to line 3. 

The Ss in groups E and F were exposed to a very 
easy discrimination task during the training and pre- 
test series. They were instructed to respond whenever 
a line (any line) appeared on the screen. The lines were 
the same as those used for the other groups. 

Test series. For all Ss the pretest series was followed 
by a 1-min. rest period and then a test series. In the 
test series the order of presentation was: 3, 5, 1, 2, 4, 5, 
4,1, 2,3, 4, 1, 2, 5, 3, 1,4, 3, 1, 5, 2, 3,4, 4,5, 1, 3, 2, 5, 
4, 1, 4, 2, 1, 5, 3. The Ss in groups E and F again 
merely had to respond when any line appeared on the 
screen. 

The experimental conditions were introduced dur- 
ing the test series for all six groups. Group A performed 
the test series while being threatened with electric 
shock. Electrodes were attached to S's left wrist and 
left ankle (the ankle electrodes were "props" in that 
they were not attached to the inductorium). Each 5 
was told that he would receive an occasional electric 
shock during the test series, A sample shock was given 
before the series began, but shock was not administered 
again during the experiment. One minute after the 
presentation of the first line, the words (incidental 
stimuli) were started. The words continued until the 
final 15 sec. of the test series. Thus the words were 
presented while visual stimuli 7 through 34 were being 
presented. 

e TABLE 1 
MEANS AND STANDARD DEVIATIONS OF THE 
RECALL AND RECOGNITION SCORES 
p 


Difficult Easy 

Measure 
BNon-| E |F Non- 
A Shock | shock | Shock | shock 
run 6.56 | 12.20 | 11.09 | 19.13 
SD 6.93. | 11.83 | 10.44 | 11.61 
Mas 4.31 | 6.19 | 5.09 | 7.25 
SD 4.60 | 4.73| 4.18 | 3.88 


540 RonERT E. SILVERMAN 
TABLE 2 
ANALYSIS OF VARIANCE OF THE RECALL AND RECOGNITION SCORES 
Recall Recognition 
pource Sum of df Mean F Sum of df Mean 
Squares Square Squares Square 
Difficulty of task 786.75 | 1| 786.75 | 5.879* 20.25 | 1 | 20.25 
Shock and nonshock 1105.71 | 1 | 1105.71 | 8.263** 97.35 | 1 | 97.35 
Interaction task X Shock 34.58 | 1 34.58 | .258 1 AT 
Within cells 12311.34 | 92 | 133.82 1844.25 | 92 | 20.05 
* Significant at the .05 level of confidence. 
** Significant at the .01 level of confidence. 
Group B performed the discrimination test under TABLE 4 
the same conditions as group A with the exception of 1 
the shock threat. Group C performed the discrimination ANALPÉS OE Vise Sat ae AT 
test under the threat of shock as did group A, but this 
group was not exposed to the words. Group D was 
exposed to neither the shock threat nor the words. Source Som ed od 
Groups E and F performed the easy discrimination. 1 
Group E was subjected to shock threat and exposed to ADT 
the words during the test series. Group F received no gors ane 1 ase (928 
threat of shock but was exposed to the words during the Words X Shock 132.25! 1 |132.250]9.941** 
X eerie Within cells 1277.12, 96 | 13.303 


Test for incidental learning. Following the dis- 
crimination test, Ss who had been exposed to the words 
were given a recall test and a recognition test for the 
incidental words. Recall scores were obtained by allow- 
ing full credit for a word correctly recalled, half credit 
for one syllable of the two-syllable word correctly re- 
called, and one-fourth credit for any two-syllable, 
spondaic word designated. The total score was multi- 
plied by 4 to eliminate fractions. Recognition scores 
were obtained by having Ss select words from a matrix 
containing 80 words. A score was corrected for guessing 
by subtracting from the number correct one-third the 
number of incorrect words. 


RESULTS 


Incidental learning. The mean recall and 
recognition scores for the groups exposed to 
the words are presented in Table 1. Inspection 
of these tabulated data indicates more evidence 
of incidental learning in the groups (E, F) 
which performed the easy discrimination task 


TABLE 3 


MEANS AND STANDARD DEVIATIONS OF THE 
DISCRIMINATION TEST SCORES 


Words No Words 
Measure 
B Non- D Non- 
A Shock | BO | C Shock | D Non 
Mean | 27.96 | 29.56 | 30.00 | 27.00 
SD 429 | 3.24 | 344 | 3.21 


** Significant at the .01 level of confidence. 


than in the groups (A, B) which performed 
the more difficult discrimination task. Even 
more marked is the superiority in incidental 
learning, as evidenced by their mean scores 
of the nonshock groups (B, F) over the shock 
groups (A, E). ' D 

The results of the analysis of variance for the 
incidental learning scores are presented 1j 
Table 24 That the degree of difficulty of tht 
discrimination task was an effective variable 
is indicated by the F ratio for the recall scores 
However, the differences between the nom 
Shock and shock conditions were statistic? 
significant for both the recall and recogniti 


Discrimination task. Discrimination SComm 
were obtained by tabulating the number " 
correct responses made by each S during %4 
pretest and test series. An analysis of the pr 
test data indicated that significant different? 
did not exist among the four groups. Thus $ 


* The variances for the recall scores were M a 
geneous. Transforming the data so as to RE. Bs, 
assumption of homogeneity does not change d of 
abilities (F for shock, nonshock was 8.7795 
difficult, easy was 5.035). Therefore the eer 2. 
for the nontransformed data is presented in Tal i 


ANXIETY AND THE MODE OF RESPONSE 


was assumed that the groups were relatively 
similar with respect to initial discrimination 
ability. 

The means and SD’s for the discrimination- 
test scores are presented in Tablé 3. The 
analysis of variance for these data is presented 
in Table 4. 

A significant interaction effect was found 
between the experimental conditions where no 
over-all differences had been found between 
the words, no-words conditions or between the 
shock, nonshock conditions. This interaction 
would seem to indicate that the two experi- 
mental conditions, shock and words, act dif- 
ferently in combination than either of these 
conditions acts alone. The discrimination of 
Ss exposed to shock threat is slightly impaired 
when the words are present as compared to 
the no-words condition. However, this im- 
pairment cannot be attributed merely to the 
presence of the words, for a comparison of the 
words and no-words groups which are not 
exposed to shock threat reveals that the words 
condition alone improved performance. It is 
only when the words are combined with the 
threat of shock that performance is disrupted. 


Discussion 


The results of the incidental learning tests 
are generally in agreement with those reported 
by Aborn (1) who used very different stimulus 
materials. His data indicate that “ego threat” 
decreased incidental learning as measured 
by recall and recognition scores. We might 
interpret such findings as offering support for 
the notion that threat acts to increase the 

` relevant mode of response. In the study re- 
ported here, we would expect an S who 
responds predominantly to the relevant 
Stimuli to show decreased incidental learning, 
as would an S who responds to neither class of 
stimuli. Responding to both classes should 
Produce relatively good performance in both 
Spheres since the facilitating effects of the 
words would cancel the interfering effects of 
the discrimination task. It is also possible that 
responding to some degree in the direction of 
the incidental stimuli might increase the rela- 
tive efficiency in both tasks. 3 

The mode of response of Ss exposed to 
Shock threat may be considered as being di- 
rected toward the relevant stimuli since these 


54 


Ss show relatively good discrimination when 
no words are present and poor discrimination 
as well as poor incidental learning when ex- 
posed to the words. The possibility exists 
that the shock-threat Ss attempt to avoid 
listening to the words and that these avoidance 
responses in turn interfere with discrimination 
accuracy. This bit of speculation stems from 
the fact that the group (A) exposed to both 
words and shock threat shows impaired dis- 
crimination. 

The fact that shock threat elicits a relevant 
mode of response is additional evidence for 
what appears to be the selective and stereo- 
typed patterns of response often attributed 
to the chronically anxious person. Whether 
or not the condition of anxiety per se enhances 
this mode of response is problematical. How- 
ever, by relating the mode of response to the 
discrete responses investigated by Bahrick 
and Fitts (2) the acquisition of the relevant 
response can be investigated within a general 
learning framework. We may consider the 
mode of response as a learned response ac- 
quired in the same manner as any class of 
discrete responses. Accordingly, we would 
predict that persons with a history of clinical 
anxiety would in a situation similar to that 
reported above manifest a strong relevant 
mode of response.* 

In the study reported here, the relevant 
response is defined by E. The task itself cannot 
be said to have any anxiety-reducing proper- 
ties, but by responding to the structured task 
Ss are literally conforming in a miniature social 
situation. This is to be expected, since, in a 
discussion of human behavior, relevance in 
its broad sense is defined by society. Thus we 
would predict and do find that a large segment 
of the anxious patients are rigid conformists. 
The typical compulsive patient with symptoms 
of manifest anxiety is largely a relevant. re- 
sponder although the rituals and compulsions 
often indulged in by these patients appear to 
be unusual. The explanation for this is that 
relevance is defined by subcultures and even 
by fantasy activity as well as by the mores of 
society at large. Thus a patient who indulges 
in a bizarre ritual is probably making his 
own interpretation of relevance. 


5 A study using test scores to establish a history of 
anxiety is in progress to test this prediction, 


SUMMARY 

This experiment investigated the influence 
of shock threat on the responses to incidental 
auditory stimuli and relevant visual stimuli. 
The Ss were trained to perform a visual dis- 
crimination task and were then tested for 
discrimination accuracy. The Ss who per- 
formed the discrimination test in the presence 
of incidental auditory stimuli were tested for 
recall and recognition of these stimuli. 

The results indicated that Ss exposed to 
the threat of electric shock manifested less 
incidental learning than Ss not exposed to 
shock threat. The shock-threat condition in 
the presence of the incidental stimuli impaired 
discrimination, but the shock-threat condition 
alone improved the accuracy of discrimina- 
tion. 

The results were interpreted as lending 
support to the clinical data that show anxious 


persons to be relevant responders or conform- 
ists. 


REFERENCES 


1, AmonN, M. The influence of experimentally in- 
duced failure on the retention of material 
acquired through set and incidental learning. 
J. exp. Psychol., 1953, 45, 225-231. 

2. Bannick, H. P., Frrrs, P. M., & Rann, R. E. 


RoBERT E. SILVERMAN 


Effect of incentives upon reactions to periphera 
stimuli. J. exp. Psychol., 1952, 44, 400-406, .— 

3. Bruner, J. S., & GoopuaN, Crcrrz C. Value ind 
need as organizing factors in perception. J, aj. 
norm. soc. Psychol., 1947, 42, 33-44. 

4. Bruner, J. S., & Posrman, L. Emotional 
tivity in perception and reaction. J. Pers., 1 
16, 69-77. 

5. Cameron, N. The psychology of the behavior di 
orders. New York: Houghton Mifflin, 1947, ~ 

6. Carrer, L. F., & ScHooLeR, K. Value, need, ant 
other factors in perception. Psychol. Rev., 1 
b6, 200-207. 

7. Davis, H. (Ed.) Hearing and deafness, a guide fo 
laymen. New York: Murray Hill Books Inc, 
1947. 1 

8. Howes, D. H., & SorowoN, R. L. Visual duratio 
thresholds as a function of word probability 
J. exp. Psychol., 1951, 41, 401-410. 

9. JeNxiNs, J. G. Instruction as a factor in "ind 
dental" learning. Amer. J. Psychol., 1933, 
471-477. 7 

10. Lipett, H. The role of vigilance in the develop: 
ment of animal neurosis. In P. Hoch & J. Zubi 
(Eds.), Anxiety. New York: Grune & Stratton, 
1951. Pp. 183-196. 

11. McGryntes, E. Emotionality and perceptual 
defense. Psychol. Rev., 1949, 56, 244-251, — 


1949, 18, 252-266. 
Received July 2, 1953. 


AN ANALYSIS OF THE CATTELL-LUBORSKY HUMOR TEST 
INTO HOMOGENEOUS SCALES 


JAMES K. YARNOLD anp MARVIN H. BERKELEY 
Human Resources Research Center, Lackland Air Force Base 


HE ANALYSIS of reactions to various 

types of humor stimuli has sometimes 

been regarded as a potentially valuable 
means of personality measurement. The prob- 
lem of classification and interpretation of 
responses to humor, however, has been a de- 
terrent to the exploratory use of humor 
scales in personality research. 

Andrews accomplished the first large-scale 
statistical analysis of a sample of humor 
items (2). His factor analysis of a 24-item 
matrix demonstrated that humor response 
was not of an entirely general nature as main- 
tained by certain early theorists (9). The item 
intercorrelation range (—.10 to .59, tetra- 
choric) indicated the magnitude of covariation. 
In addition, his study showed that the specific 
item structure, whether it be cartoon, joke, 
or limerick, was not of primary importance. 
While this latter information is certainly 
important, it should be pointed out that 
Andrews’ factors would have little utility as 
actual measures of humor response. 

Similarly, the research of Cattell and others 
has yielded little in terms of sufficiently homo- 
geneous and independent measurement de- 
vices. The first clusters (Cattell and Luborsky 
{4]) were developed with a too lenient criterion 
for including an item in a cluster, resulting in 
measures with low reliabilities. A later study by 
Cattell, Luborsky, and Hicks (6) was con- 
cerned with the factor analysis of a sample of 
humor items, The factor tests resulting from 
this analysis also fail to meet the minimum 
qualifications for an adequate set of scales. 
Moreover, their psychological meaning appears 
overly complex. The median reliability (Kuder- 
Richardson Formula 20) is .29, the coefficients 
Tanging from .18 to .51. Although the median 
intercorrelation between factor tests is only 
17, the limiting effect of the very low reli- 
abilities suggests that this degree of indepen- 
dence may be spurious. 

On the basis of the scaling deficiencies of 
the humor clusters and factors derived from 
the three major attempts at empirical clas- 


sification of humor items, it is apparent that 
no adequate measures of humor response yet 
exist. The task of the project described here 
has been to derive an unspecified number of 
homogeneous scales from a heterogeneous 
pool of humor items. i 


RESEARCH DESIGN 


The research design for this task was organ- 
ized as follows: (a) selection of first item set, (b) 
selection of subject sample, (c) test administra- 
tion, (d) analysis into tentative scales, (e) selec- 
tion of second item set, (f) analysis into final 
scales, (g) comparison of the tentative and final 
tests, and (A) psychological interpretations of 
final scales. Each of these phases of the research 
design is discussed in the above order. 


Selection of First Item Set 


It is obvious that the adequacy of coverage of the 
personality domain is limited by the coverage of the 
items selected for analysis. Cattell and Luborsky had 
selected the items for their C-L Humor Test, Forms A 
and B, from an extensive pool of 2,000 jokes and limer- 
icks. It was decided to capitalize on their previous work 
by selecting as the initial set one of the two forms of 
their test. Form A (91 items) required the forced-choice 
selection of the funnier of two jokes. Form B (112 items) 
required a response of funny or dull for each item. The 
first approximation of homogeneous scales was based 
on the collection of the more highly intercorrelated 
clusters developed by Cattell and Luborsky (5). Since 
Form A clusters (forced-choice items) presented many 
more interpretative problems than Form B clusters, 
items of the latter form were selected as tbe initial item 
pool. 

Selection of Subject Sample 

The sample on which the Form B homogeneous keys 
were developed consisted of 500 completed answer 
sheets of officer candidates at Lackland Air Force 
Base. The test was administered while most of the 
officer candidates were in their second week of training. 
In order to facilitate computational procedures, the 
sample was decreased to 500 cases from a number 
slightly over that amount, and stratified on the basis 

f sex. A 
i The cross-validation sample used to test the stability 
of the Form B keys consisted of 293 officer candidates of 
a subsequent class. The proportion of males and females 
as well as other OCS selection procedures remained 
unchanged for the cross-validation sample. 


543 


La 


544 


TABLE 1 


Paopucr-MOMENT INTERCORRELATIONS AND DISTRIBUTIONAL STATISTICS OF THE FINAL HOMOGENEOUS 
WITH AN INDEPENDENT SAMPLE 


(Sample: 230 Students, OCS Class 53-A) 


$ JAMES K. YARNOLD AND Marvin H. BERKELEY 


TaOWI RE, | RGAE RO Ro S T Mean SD | Km.zw | Toot 
: 

RD: 00  -—.0 31 .06 19  .00 | 13.35 3.46 70 20 

RF: 127085315. 17485, :231 15:29 6.22 2.37 143 15 

RG: N26) i402 gata 4.27 2.65 162 15 

RG: 115.0339 24 6.49 2.02 145 il 

R im 43 4.91 1.88 42 10 

S 132 | 444 1.83 “40 9 

T 3.40 2.06 E ia 


* Kuder-Richardson Formula 20. 


Test Administration 


The Form B item set was administered in testing 
rooms containing approximately 40 subjects (Ss) each. 
The Ss were told that their "sense of humor" was being 
measured. (Cattell and Luborsky had labeled Form B 
as “A Humor Rating Quiz.") An unlimited amount of 
time was allowed for test completion, although Ss were 
asked to record their immediate opinion of items. In 
order to reduce social facilitation, Ss were requested to 
inhibit audible laughter. However, in all testing groups, 
a few Ss found it impossible to comply with this request. 
Since Ss answered items at different rates of speed, it 
seems unlikely that individual item responses were 
seriously affected. 


Analysis into Tentative Scales 


A direct and economical means of developing 
independent tests that have an adequate degree of 
discriminating power has been suggested by DuBois, 
Loevinger, and Gleser (8), This technique is known as 
maximizing the discriminating power of a test. This 
method avoids a number of difficulties inherent in the 
application of factor analytic techniques to items and 
also appears to lack the principal limitations of other 
methods of maximizing test homogeneity (8, pp. 2-6). 

` This technique was thus selected for clustering the set 
of humor items into homogeneous tests. The major 
steps are listed below: 

Grouping of items into initial submatrices. It seemed 
that the later clusters developed by Cattell and Lubor- 
sky (5) might possibly be better than purely a priori 
groupings of items. Thus, Cattell-Luborsky cluster 
scores for OCS Class 51-A were intercorrelated, and on 
the basis of the intercorrelations, the C-L clusters were 
eS 

1 The details of the procedures and the results have 
been filed with the American Documentation Institute. 
Order document No. 4286 from the ADI Auxiliary 

Publications Project, Photoduplication Service, Li- 
brary of Congress, Washington 25, D. C., remitting in 
advance $1.75 for 35 mm. microfilm or $2.50 for photo- 
prints. Make checks payable to Chief, Photoduplication 
Service, Library of Congress. 


grouped into three matrices containing about 40 items 
each 


Analysis into tentative scales. Five tentative scales 
were developed by the DuBois, Loevinger, and Gleser 
method for maximizing the discriminating power of à 
test. These scales and others, subsequently derived, will 
henceforth be referred to by the letter of the mat ix 
from which they were evolved and asubscript correspond: 
ing to the order of evolvement. The tentative scale 
were labeled D;, Fs, Fs, Gi, and Gs. 

Selection of second item sel. The second item set from 
which the tentative scales were to be expanded into 
final scales consisted of the 91 forced-choice items ol 
Form A of the C-L Humor Test. However, Ss were no 
required to choose one of two paired jokes, but instet 
made an independent rating of funny or dull for ea 
joke. This resulted in a pool of 182 items which, d 
with Form B, was administered to a new sample of 400 
officer candidates, ; 

Analysis into final scales. It was the purpose of this 
phase of the research to increase the discriminating 
power of the tentative scales and to develop additional 
Scales from items of a new sample. In this analysis fo 
of the five tentative scales were retained and incre 
in length. These scales were relabeled RD;, RFs, RGI 
RG;, the R standing for revised. Three additional scal í I 
were evolved with labels R, S, and T. The seven $ a 
constituted the final homogeneous tests. Their in m 
correlations and distributional statistics are aa cd 
Table 1, for an independent sample of 230 ol 
candidates. 


Psychological Interpretations of Final Scales 


On the basis of a number of experiments Sears (14) 
concluded that a joke may be analyzed into two ot 
elements: thematic and schematic. The x 
element is the technical structure or form of a Mer 
while the thema "constitutes the meaningful «uiid 
content, or plot, of a joke." It is the joke thema pally 
evokes the asocial or repressed needs, PA m 
Aggression, Superiority, and Sex. Such a jo ue uu 
imaginal presentation of a thema which BY) nace 
suddenly and gratuitously—the end sto 5 
the subjects needs, fused with n Play. The mita 
thema, then, brings added delight by satisfying "| 


ANALYSIS OF THE CATTELL-LUBORSKY HUMOR Test 


fantasy some purely egocentric and usually hostile wish. 
The delight is not inhibited because it can escape, 
disguised by the playful social feelings engendered by 
the mirth-schema in which the mirth-thema is em- 
bedded” (13, p. 584). 


A tentative description within the Sears 
frame of reference of each of the final homo- 
geneous scales is presented below. A typical 
item is presented with each test for illustrative 
purposes, 

RD;. The element common to the items of 
RD; is that of sexual play or sexual need. 
Sexual fulfillment is accepted as an integral 
part of life. However, this need for sexual ful- 
fillment results in frustrations which are not 
necessarily resolved by “conventional” social 
mores. 

Tt seems reasonable to hypothesize that RD; 
will be highly weighted on the following factors 
or scales: “Social Introversion" (Guilford & 
Guilford [11], “Adventurous Cyclothymia 
vs. Inherent Withdrawn Schizothymia" (Cat- 
tell [3), and “Social Introversion" (Mosier, 


[12]). 


Example: The census taker viewed Mrs. Jones and the 
six tots of varying ages around her with a puzzled 
frown, He seemed particularly interested in a squirming 
infant in her arms. “I don’t quite understand you,” he 
admitted. “Did you say that your husband died six 
years ago?” “Yes, sir,” she replied emphatically. “He 
died, but I didn’t.” 


RF. The items of this test have in common 
the display of active self-assertion by an in- 
dependent, dominant, and somewhat uncon- 
ventional individual. 

It is hypothesized that RF» affords a subtle 
measure of the “Dominance” factor reported 
by Cattell (3), the “Lack of Agreeableness” 
factor of Guilford and Guilford (11), or in- 
versely, the “Sociable” factor of Thurstone 


(15) 


Example: One prehistoric man to another: ‘Now that 
we've learned to communicate with each other—shut 
up! 


RGı. In all items of this test, one detects 
the presence of a scheming, manipulative 
individual who is directing the action of his 
associates and taking full advantage of*all 
opportunities to assert himself. This assertion 
usually takes aggressive form of the extra- 
punitive type. In some cases the aggression 
is explicit, as in a hard, cutting retort to a 


+ 


x 


545 


rather stupid, dull, or otherwise uncalled for 
remark by an overly inquisitive, annoying, or 
generally dull character. In, other items the 
aggression is implicit, as in the case of the 
clever, rather malicious opening statement 
designed to allow the replying individual to 
make himself appear ridiculous. In all cases 
the directing influence of a shrewd, exploita- 
tive, and somewhat aggressive individual is 
evident. 

It is suggested that RG, may be related to 
the variance which the 16PF Questionnaire 
Factor L is purported to measure, “Paranoid 
Schizothymia vs. Trustful Accessibility" (5). 


Example: A famous musician once asked a man why he 
forever sang one tune. 

“Because it haunts me,” replied the man. 

“No wonder,” said the musician, “you are con- 
tinually murdering it.” 


RG». The items of RG, are characterized by 
a common element consisting of a general 
lack of social insight and sophistication on the 
part of one character. In nearly all cases a 
naive individual (usually juvenile) makes a 
statement which has the effect of upsetting 
the equilibrium or poise of the other individual. 


Example: Willie: Papa, why don’t you go away so I 
can see Mama dump that salesman off the fire escape?” 
Papa: “Why, Willie, what on earth are you talking 


about?" : 4 
Willie: “Well, I heard her tell him that she'd tip 


him off as soon as you left." 


R. The central thema of this test deals with 
the ridicule of authority figures (i.e., judges, 
ministers, doctors, teachers, parents, etc.). 

It is hypothesized that individuals obtaining 
a low score on Test R are those who identify 
with the ridiculed authority figure and are 
prototypical of the authoritarian personality 


(Adorno et al [1]). 


Example: “I rise for information,” said a member of the 


legislative body. 
; "Tom very glad to hear it,” said a bystander, “for no 


man needs it more.” 


S. The central element of Test S appears 
to be a passive acceptance of inferior or 
socially disapproved behavior by the leading 
character. In each case the individual reveals 
his own intellectual or moral inertia. There is 
no attempt to alter behavior in a more desir- 


546 James K. YARNOLD AND Marvin H. BERKELEY 


able direction. Actually, the principal character 
often seems satisfied with his resignation. The 
general implication is that the individual is 
too weak to better himself and so merely 
accepts his intellectual or moral inferiority. 


Example: An intoxicated man, walking along the street, 
regarded the moon with much contempt. “You needn’t 
feel so proud,” he said, “you are full only once a month, 
and I am full every night.” : 


T. The humor of this test seems to have its 
basis in the slightly cynical realism displayed 
by the principal characters. It is hypothesized 
that high scorers on Test T will be those in- 
dividuals who are sensitized to the foibles of 
men or women in everyday real-life situations. 
"The common element of Test T is one which 
deals with the contrast between human pre- 

` tension and performance. It appears that this 
test will relate to the emotional stability vs. 
general neuroticism factor suggested by 
. Eysenck (10) and Cattell (3), and to the lack 
i ae factor as suggested by Guilford 
41). 
Example: “Did he take his misfortunes like a man?” 
“Precisely. He laid the blame on his wife.” 


CONCLUSIONS 

The following conclusions are drawn from 
this study: 

1. An analysis of both Form A and Form B 
of the Cattell-Luborsky Humor Test by a 
recently proposed cluster method has resulted 
in the development of seven scales which 
appear to be superior, at least statistically, to 
the scales previously derived for this test. 

2. The apparent content of each of the seven 
scales has been briefly described and related 
to other factors and scales. 

3. The analysis of reactions to humor stim- 
uli does seem to provide a potentially effective 


and subtle means for the meas 
personality; however, validation is nece 


REFERENCES 


1. Aporno, T. W., FRENKEL-BRUNSWIK, | 
VINSON, D. J., & SANrFOnD, R. N. The 
ian personality. New York: Harper, 

2. Anprews, T. G. A factorial analysis 
to the comic as a study in perso 
Psychol., 1943, 28, 209-224. 

3. CarrELL, R. B. Personality, a systematic. 
and factual study. New York: Mi 
1950. 

4. CATTELL, R. B., & Lusorsxy, L. B. 
factors in response to humor. J. 
Psychol., 1947, 42, 402-421. 

5. CaTrELL, R. B., & Lusonsxv, L. B. Hi 

the C-L Humor Test of Personality. 

Ill.: Institute for Personality and Abil 

1947. 

6. CATTELL, R. B., Luporsxy, L. B., & 

Revised handbook for the C- ; 

Personality. Champaign, Ill: Instit 

Personality and Ability Testing, 1950; 

7. CATTELL, R. B., SauNpzzs, D. R., & 
Handbook for the 16 PF Test. Ch 
Institute for Personality and Abilit 
1951. i 

8. DuBois, P. H., LoeviNoEm, JANE, & 
GorpINE C. The construction of ho 
keys for a biographical inventory. USA: 
Resour. Res. Cent., Res. Bull., 1952, 

9. Eastman, M. The sense of humor. 
Scribner’s, 1921. 

10. EvsENck, H. J. Dimensions of personality. 

Kegan Paul, 1947. 1 

- GurLFonp, J. P., & Guirronp, Ruta B. 

factors S, E, and M, and their me: 
Psychol., 1936, 2, 109-127. 
. Mosrzz, C. I. A factor analysis of c 
tendencies. Psychometrika, 1937, 2, 2 
13. Murray, H. A. Explorations in perso 
York: Oxford Univer. Press, 1938. 

14. Sears, R. N. Dynamic factors in the 
humor. Unpublished doctor's disse: 
vard Univer., 1934. 

15. TuuzsTONE, L. L. The dimensions of tempt 
Psychometrika, 1951, 16, 11-20. 


Received July 6, 1953. 


SOME EFFECTS OF UNEQUAL DISTRIBUTION OF INFORMATION 
UPON GROUP PERFORMANCE IN VARIOUS 
COMMUNICATION NETS: 

MARVIN E. SHAW? 

University of Wisconsin? 


matical model for group structures (or 

communication nets), several studies 
(2, 4, 5) have been concerned with finding 
behavioral correlates of the various measures 
which the model generates. The measure that 
has so far proved most significant is centrality. 
An index of centrality for any particular group 
structure is obtained by first counting the 
minimum number of links (communication 
channels) which must be crossed in order for 
a given position in the net to communicate 
with every other position in the net. This 
count is called the sum of distances for that 
Position. Individual centrality is then com- 
puted by dividing the total sum of distances 
for the entire net by the sum of distances for 
the individual position, and net centrality is 
computed by summing the individual centrali- 
ties in the net. 

Centrality, according to Leavitt (5), cor- 
telates well with the behavior of people in 
different kinds of structures, He required 
groups to solve simple problems and compared 
their performance in different nets. Centrality 
accounted reasonably well for differences in 
accuracy, total activity, satisfaction of group 
members, emergence of a leader, and organiza- 
tion of the group. In Leavitt’s study, however, 
the information from which the group solved 
its problems was distributed equally among 
members of the group. It is a reasonable 
hypothesis that distribution of information, 
as well as centrality, may be an important 
variable in group behavior. The purpose of 

! This research was supported in whole or in part by 
the United States Air Force under contract AF 18(600)— 

; monitored by the Human Resources Research 
Institute, Maxwell Air Force Base, Alabama. 

The present paper represents a portion of the dis- 
Sertation presented in partial fulfillment of the require- 
ments for the degree of Doctor of Philosophy at the 
University of Wisconsin, 1953. The writer wishes to 
express his appreciation to Dr. J. C. Gilchrist, under 
whose Supervision this investigation was conducted, 
for his guidance, helpful suggestions and constructive 
Criticisms in every phase of the investigation. 

* Now at The Johns Hopkins University. 


S Bavelas (1) first presented a mathe- 


the present study is to examine such a hy- 
pothesis. 

The reasoning was as follows: centrality 
correlates with problem-solving behavior 
because the greater the centrality of a position 
the more information is available to the 
individual in that position. If this be so, then 
distributing information unequally in the 
first place by giving one position more informa- . 
tion than another ought to have the same 
consequences as increasing the centrality of 
that position.‘ Therefore, an experiment was 
designed in which centrality could be studied . 
simultaneously with information distribution 
in order to evaluate the relative contributions 
and interactions of the two variables. 


METHOD 


Apparatus and materials. In the present study three 
four-cell nets were selected for investigation: the wheel, 
the slash, and the circle. These nets are represented 
diagrammatically in Fig. 1, and their chief characteris- 
tics are given in Table 1. These nets were selected be- 
cause the net centralities progress in equal steps from 
16.0 for the circle to 16.8 for the wheel. 

The apparatus consisted of a room which was parti- 
tioned into four cubicles. Each cubicle was connected 
to every other cubicle by means of a slot through which 
message cards could be passed. These slots were fitted 
with overlapping cloth "curtains" so that it was impos- 
sible for one subject (S) to see into another’s cubicle, 
The frames supporting these curtains were painted to 
indicate the cubicle into which the slot opened. Each 
cubicle was supplied with an electrical switch which, 
when thrown, turned out a light on E's signal panel. 

The following is an example of the type of problem 
used: 

Statement of problem: 

A small company is moving from one office 
building to another. It must move four kinds of 
equipment: (1) Chairs, (2) Desks, (3) Filing 
Cabinets, and (4) Typewriters. How many 
trucks are needed to make the move in one 
trip? 

4The assumption that increased information is 
equivalent to increased centrality is undoubtedly an 
oversimplification of a rather complex process. Altering 
the proportions of the total information assigned to the 
various positions almost certainly changes other charac- 
teristics of the net as well as centrality. However, this 
oversimplification is perhaps justified in this early 
stage of theoretical development. 


EA" 


548 


Items of information needed to solve: 


1. The company owns a total of 12 desks. 

2. The company owns a total of 48 chairs. 

3. The company owns a total of 12 type- 
writers. 

4, The company owns a total of 15 filing 
cabinets. 

5. One truck will carry 12 typewriters and 
nothing else. 

6. One truck will carry 3 desks and nothing 
else. 

7. One truck will carry 5 filing cabinets and 
nothing else. 

8. One truck will carry 24 chairs and nothing 
else. 


Two other problems of the same type were em- 
ployed. One of these required the Ss to determine 
the time of arrival of a plane at a specified town, and 
the other required them to determine the amount of 
gasoline required to move vehicles from one motor pool 
to another. For each of these three problems, the 
statement of the problem was typed completely on each 
of four separate cards and each item of information 
was typed on a separate card. The materials for a 
single problem consisted of these twelve white cards. 

Subjects, The Ss were students enrolled in psy- 
chology or sociology courses at the University of 
Wisconsin. Ninety groups (sixty exclusively female and 
thirty exclusively male) were run. Each of these groups 
was assigned at random to one of six experimental 
conditions, subject to the restriction that five groups of 
males and ten groups of females be assigned to each 
experimental condition. The ‘six experimental condi- 
tions represented the possible combinations of two types 
of information distribution and three patterns of com- 
munication, 

Procedure. Each group was required to solve the 
three problems mentioned above. Under conditions of 
equal distribution of information, for each problem 
each S was given one of the problem cards and two of 
the information cards. Under conditions of unequal 
distribution of information, the S in Position A was 
given one of the problem cards and five of the informa- 
tion cards, and each of the other Ss was given one of the 
problem cards and one of the information cards. The 
distribution of particular items among positions was 
randomized by shuffling the white cards before each 
session. The sequence in which the problems were 
presented was varied according to a systematic latin 
square such that each of the three problems was given 
first, second, and third an equal number of times, and 
each problem preceded and followed every other prob- 
lem an equal number of times. 

With the exception of communication net and distri- 
bution of information, the procedure was exactly the 


T Du 1m 
C D C D.C D 
WHEEL SLASH CIRCLE 


Fic. 1. DIAGRAM OF THE THREE COMMUNICATION NETS 
TO BE INVESTIGATED 


Marvin E. SHAW 


TABLE 1 
CHARACTERISTICS OF THE EXPERIMENTAL, NETS 
Individual | Maxi 
Mm a wer Ced unn 
Net of Dis- |Central- Individue 1 
Chan- | tances | ity [odes 
nels A|B|c|p| tems 
Wheel| 3 18 | 16.8 |3.66.03.63.6. 2.4 
Slash | 5 14 | 16.4 8.54.74.73.5 1.2 
Cirde | 4 16 |16.0 4.0,4.04.04.0 0.0 


same for all groups. Each of the four Ss was brought 


into the experimental room separately, seated accord- P 


ing to a randomized schedule, and requested to remain ` 
quiet. The Ss were told the "names" of the members 
of the group (i.e., the color used to identify each mem- 
ber), the ones with whom each could communicate, 
the method of communication, and the general nature 
of the tasks. 

When all three problems had been completed, each 
S was asked to fill out a three-part questionnaire. | 
Part 1 required sociometric choices of the member of 
the group most preferred for six different activities 
ranging from one almost identical to that just com- 
pleted to one almost completely unrelated to the ac- 
tivity just completed. Part 2 required each S to in- 
dicate on a nine-point scale how well he liked his job 
in the group, how he would compare his position with 
each of the other positions, how well he thought the | 
group members cooperated with each other, and how 
he would rate the performance of the group. Part 3 
required an answer to the question, “Did your group 
develop a system ——? If so, describe it briefly.” 


RESULTS 


The results of this study fall under four 
general headings: (a) time scores, (b) com- 
munication units, (c) errors, and (d) question- 
naire results. The first three were taken as 
indicators of efficiency; the questionnaire 
responses were used as indicators of morale 
and leadership. 

Time scores. Time to reach a solution can 
be measured in at least three ways: (a) the 
fastest single correct trial, (b) the average 
time per problem per position, and (c) the 
total elapsed time per problem. Analyses of 
variance applied to the results using the first 
two time measures revealed no significant 
differences. The means of total elapsed time 
per problem are presented in Table 2. Analysis 
of variance showed significant differences 
between trials (p < .001). This probably 
represents learning to use the net, since all 
the problems used were well within the ex- 
periences of the Ss. There were no over-all 
differences between nets or between distribu- 


INFORMATION AND PERFORMANCE IN COMMUNICATION NETS 


549 


TABLE 3 


TABLE 2 
Means or TOTAL ELAPSED TIME PER PROBLEM Mean Tme REQUIRED BY EacH POSITION 
IN MINUTES IN MiNvrES 
Equal Distribution ‘Unequal Distribution Positions 
"Trials - Net 
Wheel| Slash | Circle | Wheel| Slash | Circle A B C D 
1 16.121/16.221/17.71116.29:,16.571/16.05$ Wheel 
2 11.68 (10.98 | 9.30 |11.57 |11.31 |10.10 Equal 11.06 9.73* | 11.23 | 11.14 
3 10.09 | 8.52*| 8.26*| 9.30 | 8.61*| 7.77* Unequal 8.611 | 10.91 10.54 | 11.71 
Slash 
* Separated as stragglers. Equal 10.50 | 10.02 9.94 | 10,43 
T Separated as a homogeneous group. Unequal 8.561 | 10.17 | 10.73 | 11.55* 
Circle 
tions of information. However, the nets X Equal 10.37 |10.21 | 9.98 | 9.93 
trials X distribution interaction was significant Epegua! (passu RETE 
(p < .01). * Separated as stragglers. 


In order to determine as nearly as possible 
just where the differences occurred, Tukey’s 
gap test (7) was applied to the mean scores.5 
The results of this test are indicated in Table 
2 by the symbols placed beside the separated 
means. The scores for the first trial differed 
as a group from those for all other trials, 
indicating that most of the learning occurred 
during the first trial. On the last trial, the 
slash and the circle required less time to reach 
a solution than did the wheel, regardless of 
information distribution. However, a precise 
relationship between net centrality and time 
was not found. 

The mean time required by each position is 
shown in Table 3. Analyses of variance for 
each net taken separately yielded essentially 
the same results for all three nets. Distribution 
as a main effect was not significant, but posi- 
tions and the groups X positions interaction 
were both significant (p < .001) for all three 
nets, The results of the gap test are indicated 
by symbols. Under conditions of equal dis- 
tribution of information, only Position B in 
the wheel was separated. This means that 
only relatively large differences in individual 
centrality produce differences in time required 
to reach a solution. Under conditions of 
unequal distribution of information, the 
Position receiving the most information 
(Position A) required significantly less time 
to reach a solution than did any other position. 

Communication units. Communication unfts 
were measured by counting the number of 
Separate items transmitted by the Ss in a 


5 In this test and in all subsequent applications of 
the gap test, the .05 level of confidence was accepted. 
4 


t Separated by a significant gap. 


TABLE 4 


MEAN Nomser or Items REQUIRED TO REACH A 
SOLUTION By EAcH NET 


Equal Distribution | Unequal Distribution 


Circle | Wheel 


76.7 | 51.4* 
59.9 | 48,4* 
60.8 | 43.3* 


Slash 


82.6* 
76.9 
65.3 


Wheel 
1 53.5* 
2 |58.0 
3 44.9* 


Slash 


74.9 
70.1 
62.5 


Circle 


72.2 
68.8 
58.1 


* Separated as stragglers. 


given net or position, since it was possible for 
one card to contain several such items.* An 
item is here defined as any simple sentence, 
or any meaningful part of a complex or com- 
pound sentence. Table 4 presents the mean 
number of items transmitted by each net. 
Analysis of variance indicated that there were 
significant net differences (p < .001), trial 
differences (p < .001) and second-order 
interaction effects (p < .001). Examination 
of Table 4 reveals that the chief difference 
between nets was that the wheel required 
fewer items to reach a solution than did the 
slash and the circle. Generally, the number 
of items per trial decreased with practice in 
much the same way as did the time scores. 
The significant interaction is apparently due 
to the relatively large number of items trans- 
mitted by the wheel (equal distribution) on 
Trial 2, and by the slash (unequal distribu- 
tion) on Trial 1. For items transmitted, as 


ê Results of an analysis of number of cards used 
were essentially the same as the results reported here. 


550 


TABLE 5 


MEAN NuMBER OF IrEMs COMMUNICATED PER 
PROBLEM FOR EACH POSITION 


Positions 
Net 
A B c D 

Wheel 

Equal 7.8 | 27.4¢ | 9.2) 7.9 

Unequal 10.9* | 24.0f | 6.3 | 6.6 
Slash 

Equal 14.9f | 19.6 | 20.6 | 14.1} 

Unequal 19.5 | 21.2 | 20.4 | 13.71 
Circle 

Equal 15.8 | 16.7 | 17.4 | 15.6 

Unequal 18.8 | 15.4 | 16.8 | 15.7 

* Separated as stragglers. E 


f Separated by a significant gap as a homogeneous group. 
1 Separated by a significant gap. 


for the time scores, net centrality was found 
to be an inadequate index. 

The mean number of communication items 
per problem for each position is shown in 
Table 5. Analyses of variance for each net 
taken separately showed that in the slash 
positions differed significantly (p < .001), and 
in the wheel positions and the distribution X 
positions interaction were significant (p < 
.001). Examination of the means in Table 
5 shows that in all nets the individual cen- 
trality indices were related to the number 
of items communicated, with those positions 
having the higher individual centrality send- 
ing more messages. In the wheel and the slash, 
the person receiving the most information 
sent more items than did the equivalent posi- 
tion under conditions of equal information 
distribution. Again, an inverse relationship 
was found between the time required to reach 
a solution and the number of items trans- 
mitted, 

Errors. Two types of errors were computed: 
(a) decision errors, defined as an’ incorrect 
answer reported by a member of the group, 

and (b) information errors, defined as the 
transmission of an item of information dif- 
ferent from that contained in, or derivable 
from, the items of information given to the 
group as a part of the problem to be solved. 
Since the error scores were decidedly skewed 
“and the variance was heterogeneous, the 
Mann-Whitney U test (6) was used instead 
of the usual analysis of variance. Only the 


Marvin E. SHAw 


differences produced by the information dis- 
tribution variable were significant (p < .001). 
The mean number of decision errors was 2.6 
and 2.1 for equal and unequal distribution, 
respectively; the mean number of information 
errors was 9.8 and 6.2 for equal and unequal 
distribution, respectively. This means that 
unequal information distribution decreased 
the number of both decision and information 
errors. 


The corrective power of the nets as meas- 


ured by the ratio of decision errors to informa- 
tion errors is given in Table 6. Analysis of 
variance showed significant differences be- 
tween nets (p < .05), but information dis- 
tribution had no effect. The gap test showed 
that the wheel differed significantly from the 
other two nets; ie., the wheel was less effec- 
tive in this respect, since a larger numerical 
value indicates less corrective power. This 
finding is in agreement with the findings with 
respect to time scores and number of items 
transmitted: the wheel differs from the other 
nets but there is no precise relationship be- 
tween corrective power and net centrality. 
Questionnaire resulis. The data obtained 
from the questions requiring sociometric 
choices of the S most preferred in various 
situations were analyzed for each net taken 
separately. In the wheel, 36 chi squares were 
computed to test this hypothesis and 28 of 
these were significant (p < .05). In the slash, 
19 chi squares were significant (p < .05) out 


- of 24 computed. In the circle, only 4 chi squares 


were significant (p < .05) out of 48 computed. 
These results offer strong support for the 
hypothesis that Ss in the positions having the 


- higher centralities are chosen more frequently 


than Ss in positions having lower centralities. 
_ Comparisons of choices under the two condi- 
tions of information distribution yielded only 


TABLE 6 
Corrective Power or Nets: RATIO OF DECISION 
Errors TO INFORMATION ERRors* 


Equal Distribution Unequal Distribution 


Wheel at Circle | Wheel | Slash |Circle 
Mean «51 | .33|.30| .77 | .34 | -25 
SD | se E 33| .83 |.37 | -4 


* Groups making no information errors were omitted when 
computing corrective power. 


-— 


INFORMATION AND PERFORMANCE IN COMMUNICATION NETs 


nonsignificant chi squares. The S in the posi- 
tion receiving the most information was 
chosen more frequently under conditions of 
unequal distribution of information than was 
the equivalent position under conditions of 
equal distribution of information in 16 of the 
18 comparisons made, but the differences were 
small and hence did not result in a significant 
change in the over-all distribution of choices. 

Examination of the sociometric choices 
revealed a similar pattern with respect to the 
number of times a single S was chosen unani- 
mously as the one most preferred. Under 
conditions of equal information distribution, 
the mean number of unanimous choices was 
6.5, 3.2, and 0.8 for the wheel, the slash, and 
the circle, respectively; under conditions of 
unequal information distribution, the cor- 
responding means were 6.0, 1.5, and 2.3. The 
nonparametric test developed by Friedman 
(3) and described by Wilcoxon (8, pp. 7-9) was 
used to analyze these data. Over-all differences 
were highly significant (p < 001), but when 
net and distribution effects were isolated, 
only the net differences were significant (p < 
.01). However, the net X distribution inter- 
action was significant (P < .01), indicating 
the effects of the net were different for dif- 
ferent distributions. 

Analyses of both the number of times 
chosen and the number of unanimous choices 
indicate that emergence of a leader, as indi- 
cated by sociometric choice, is directly related 
to the centrality index when only the equal 
information distribution situation is con- 
sidered. In the unequal distribution of in- 
formation situation, the number of unanimous 
choices was increased in the circle, decreased 
in the slash, and essentially unchanged in the 
wheel. This finding indicates that emergence 
of leadership is determined by the number of 
positions in the net having the highest in- 
dividual centrality (inverse relationship) and 
the maximum difference in individual centrali- 
ties (direct relationship). 

The mean ratings per net for the questions 
dealing with job satisfaction, positions com- 
parison, group cooperation, and group per- 
formance were analyzed by applying analysis 
of variance to each question taken separately. 
The results may be summarized as follows: 


|. (a) Ss in the wheel enjoyed their job in the 


group significantly less than did the Ss in 


551 


the slash and in the circle (p < .001). (b) As 
a group, the Ss showed no significant differ- 
ences in their ratings of the relative value of 
positions in the net. (c) The ratings of group 
cooperation showed slight but significant net 
differences (p < .05); the Ss in the wheel 
rated the cooperation of their group lower 
than did the Ss in the slash who in turn rated 
the cooperation of their group lower than did 
the Ss in the circle. (d) The Ss in the wheel 
rated the performance of their group lower 
than did the Ss in the slash and the circle 
under all conditions; under conditions of equal 
distribution of information, the slash and the 
circle did not differ in this respect, but under 
conditions of unequal information distribution, 
the Ss in the slash rated the performance of 
their group higher than did the Ss in the 
circle (5 < .05). 

Taken together, the rating scale results 
indicate that under conditions of equal in- 
formation distribution, centrality was not 
clearly related to group morale (where morale 
is defined as general satisfaction with the 
group's activity). Unequal distribution of 
information affected morale in only one of 
the three nets; it seems safe to conclude that 
information distribution is not an effective 
variable as far as group morale is concerned. 
These results are in agreement with the find- 
ings with respect to time scores, communica- 
tion units, and corrective power in that the 
wheel differed from the other two nets and 
in that the information distribution variable 
had little effect upon the group scores. 

The rating scale data were also analyzed 
according to the ratings of individual posi- 
tions, with the following results: (a) There 
were no significant differences between the 
ratings of job satisfaction and group coopera- 
tion. (6) The ratings of various positions dif- 
fered significantly (p < .01) under all experi- 
mental conditions, and the distribution X 
positions interaction was significant (5 < 
.05) for the wheel and the circle. The positions 
having higher centrality indices were rated 
higher than those positions having lower 
centrality indices under both conditions of 
information distribution and in all three 
nets. In the wheel and „the circle, the position 
receiving most of the information was rated 
higher under conditions of unequal distribu- 
tion than was the equivalent position under 


552 


conditions of equal information distribution. 
(c) In the slash, all positions rated the group 
performance higher under conditions of un- 
equal information distribution than under 
conditions of equal information distribution 
(p < .001). 

These results lead to the conclusion that 
there is some relationship between individual 
centrality and individual morale, but this 
relationship is tied very closely to the position 
preference component. The information dis- 
tribution variable produced equivocal results: 
additional information either had no effect 
upon individual morale (as indicated by job 
satisfaction and group cooperation ratings) 
or increased the individual morale (as shown 
by the positions comparisons ratings in the 
wheel and the circle and the group performance 
ratings in the slash). 

Responses to the question concerning group 
organization revealed that all groups used one 
of two forms of organization: (a) all informa- 
tion was sent to one S who solved the problem 
and sent the answer out to the other Ss, or 
(b) all information was sent to all Ss and then 
each S solved the problem independently. The 
first method was used by only one group in 
the slash and by only one group in the circle, 
but was used by eleven groups in the wheel. 
These data do not support the earlier findings 
that high net centrality leads to more con- 
sistent group organization. 


Discussion 


The data from this study make it clear that 
centrality correlates with behavioral measures 
only when differences among positions within 
the net are considered. Similarly, altering the 
distribution of information changes the be- 
havior of Ss in the various positions, but has 
little effect upon the performance of the group. 

Let us examine some of the positive rela- 
tionships among the variables in the present 
study. Subject to the limitations imposed by 
the conditions under which the data were 
collected, the results of this experiment offer 
some evidence for the following hypotheses: 

1. The time required for an individual in a 
position to complete an activity varies in- 
versely with individual centrality. 

2. Individual morale, number of items trans- 
mitted by an individual in a position, and the 
probability that an individual ip a given 


Marvin E. SHAw 


position will be chosen as a leader vary di- 
rectly with individual centrality. 

3. The emergence of a leader in a net (as 
indicated by unanimous choice) is determined 
by the number of positions having the highest 
individual centrality index in that net and 
the maximum difference in individual centrali- 
ties. 

4. Increasing or decreasing the amount of 
information initially assigned to an individual 
in a given position has an effect similar to 
respectively increasing or decreasing the in- 
dividual centrality index of that position. 

How can these relationships be explained? 
Leavitt believed that centrality was effec- 
tive primarily because it determines the de- 
gree of independence of the individual in a 
position. The present study offers no evidence 
against this explanation, but there is another 
possible explanation of the relationship be- 
tween individual centrality and morale that 
perhaps should be mentioned. This view would 
assume that centrality, because of its cor- 
relation with number of channels,’ determines 
the amount of communication activity en- 
gaged in by an individual and hence his 
enjoyment of the task. This draws some 
support from an ad hoc analysis of time per 
message which showed that Ss with higher 
centralities spent significantly less time per 
message than did Ss with lower individual cen- 
tralities (ġ < .001). The implicit assumption 
is that an individual enjoys a task which keeps 
him occupied more than he does a task 
which permits relatively long periods of inac- 
tivity. 

The reason for the relationship between 
individual centrality and number of items 
transmitted becomes obvious when it is re- 
membered that individual centrality and 
number of channels are perfectly correlated 
and that the most frequent method of solu- 
tion was for each S to send all available in- 
formation to all other Ss with whom he was 
connected. Thus it was inevitable that more 
channels would require more messages. 

It has been pointed out that the emergence 
of leadership in a net is determined by the 
number of positions having the highest 
centrality index in the net and the maximum 


7 Centrality and number of channels are perfectly 
correlated in the nets used in this experiment, but this 
is not true for all possible nets. 


E 


INFORMATION AND PERFORMANCE IN COMMUNICATION NETS 553 


difference in individual centralities. The inter- 
pretation would seem to be that the less 
ambiguity associated with the distinction 
between the position having the highest 
individual centrality and the other positions, 
the more agreement there is among group 
members regarding which S is most desirable 
for the leadership role. 

The effect of the communication net upon 
corrective power requires some explanation, 
It appears that in the wheel the only ways in 
which an error could be detected would be 
for an S to detect his own error, for an S to 
pass on to another SS a set of computations 
containing an error, or for an S to pass two 
conflicting statements to the same S. In the 
slash and in the circle an additional oppor- 
tunity for error detection was possible: an 
S might receive information about the same 
objects or events from two or more different 
sources. This added opportunity for error 
detection and correction probably accounts 
for the superiority of the slash and the circle 
in this respect. 

In summary, it is believed that individual 
centrality determines the behavior of in- 
dividuals in various positions within nets via 
the limitations which it imposes on these 
individuals. Centrality, however, bears no 
simple relationship to independence of action. 
The underlying processes which determine 
behavior vary with the particular aspect of 
behavior which is being measured. 


SUMMARY 

The purposes of the present experiment 
were (a) to test the effects of unequal dis- 
tribution of information upon leadership, 
morale, and problem-solution efficiency in 
Various communication nets, and (b) to check 
the findings of previous investigators, using 
a set of slightly more complex problems and 
different communication nets. It was believed 
that the primary source of differential forces 
would be centrality, and that increasing 
or decreasing the amount of information 
initially assigned to a given position would be 
equivalent to respectively increasing or de- 
creasing the centrality index of that position. 

Fifteen groups were randomly assigned to 
each of six experimental conditions, repre- 


senting the six possible combinations of two 
types of information distribution (equal and 
unequal) and three communication nets (the 
wheel, the slash, and the circle). Each group 
was required to solve three eight-item prob- 
lems. 

The results were interpreted as supplying 
some evidence for the following hypotheses: 

1. Time required for an individual in a 
position to complete an activity varies in- 
versely with individual centrality. 

2. Individual morale, number of items 
transmitted by an individual in a position, 
and the probability that an individual in that 
position will be chosen as a leader vary di- 
rectly with individual centrality. 

3. The emergence of leadership in a net 
varies inversely with the number of positions 
having the highest individual centrality index 
in that net and directly with the maximum 
difference in individual centralities. 

4. Increasing or decreasing the amount of 
information initially available to an individual 
in a given position has an effect similar to 
respectively increasing or decreasing the 
individual centrality index of that position 
or the number of channels available to that 
position. 


REFERENCES 


1. Bavetas, A. A mathematical model for group 
structure. Appl. Anthropol., 1948, 7, 16-30. 

2. Bavetas, A. Communication patterns in task- 
oriented groups. J. acoust. Soc. Amer., 1950, 22, 
725-130. 

3. FrrepMan, M. Use of ranks to avoid the assumption 
of normality implicit in the analysis of variance. 
J. Amer. statist. Ass., 1937, 32, 675-701. 

4. Herse, G. A., & MILLER, G. A. Problem-solving by 
small groups using various communication nets. 
J. abnorm. soc. Psychol., 1951, 46, 327-335. 

5. Leavirr, H. J. Some effects of certain communica- 
tion patterns on group performance. J. abnorm. 
soc. Psychol., 1951, 46, 38-50. 

6. Mann, H. B., & WurrNEY, D. R. On a test of 
whether one of two random variables is stochas- 
tically larger than the other. Ann, math. Statist., 
1947, 18, 50-60. 

7. Toxey, J. W. Comparing individual means in the 
analysis of variance. Biometrics, 1949, b, 99-114. 

8. Wizcoxow, F. Some rapid approximate statistical 
procedures. Stamford, Conn.: American Cyan- 
amid Co., 1949. 


Received July 9, 1953. * 


SOME EFFECTS OF UNEQUAL DISTRIBUTION OF INFORMATION 
IN A WHEEL GROUP STRUCTURE: 


J. C. GILCHRIST, MARVIN E. SHAW, AND L. C. WALKER 
University of Wisconsin 


Shaw (1) investigated some effects of 
unequal distribution of problem-related 
information in group structures. In general, 
he found that information distribution within 
structures had no effect upon over-all struc- 
ture measures of time, message units, errors, 
or morale although decreasing or increasing 
the amount of information iüitially available 
to an individual within a structure had an 
effect similar to respectively decreasing or 
increasing the individual centrality index of 
that position. 

There are, however, other distributions 
which affect the generality of this finding. In 
any structure having central person(s) in- 
formation may be distributed so that a 
peripheral person receives more or so that a 
central person(s) receives more. Shaw's ex- 
periment tested only the former possibility: 
the present experiment tests both possibilities 
using the four-person wheel structure. 


I THE immediately preceding report, 


PROCEDURE 


The apparatus, problems, questionnaires, and pro- 
cedures were identical to those of Shaw. 

The Ss were students enrolled in psychology and/or 
sociology courses at the University of Wisconsin. 
Forty-five groups of four members each were run. 
Thirty of these groups were composed of females and 
fifteen were composed of males. Each of these groups 
was randomly assigned to one of three experimental 
conditions, subject to the restriction that each experi- 
mental condition be assigned ten groups of females and 
five groups of males. The three experimental conditions 
were: (a) equal distribution, (6) unequal distribution 
to the periphery, and (c) unequal distribution to the 
center. 

Each group was required to solve three problems 
randomly assigned one at a time. Under the condition 
of equal information distribution, for each problem 
each S was given a card stating the problem and two 
cards giving information required in the problem’s 
solution. Under conditions of unequal distribution, 
the S in the appropriate position (A in the peripheral 
condition and B in the central condition. See the 
“wheel” in Fig. 1 of Shaw’s report) was given one of 
the problem cards and five of the information cards, 


1 This research was supported in whole or in part 
by the United States Air Force under Contract AF 
18(600)-340, monitored by the Human Resources 
Research Institute, Maxwell Air Force Base, Alabama. 


and each of the other Ss was given one of the problem 
cards and one of the information cards. 


RESULTS 


A simple within and between analysis of 
variance performed upon the times for the 
single fastest correct trial yielded an insignifi- 
cant value of F. 

Analyses of variance were performed upon 
the times of the fastest position per trial and 
slowest position per trial. Distribution was 
found to have no effect, either main or inter- 
action. The significant effects in both analyses 
were trials and groups within. The former 
indicated that learning was occurring; the 
latter that the sets of four persons differed 
from each other. (The groups within term is 
equivalent to an individual differences term 
in the usual psychological design.) 

A further breakdown of time scores involves 
positions within the structures. The time 
measures here for the main effects of distribu- 
tion and trials and their interactions differ 
from the previous two analyses in that here 
the design results in mean time appearing as 
means for those variables, i.e., the mean of the 
time of completion of the four positions’ times. 
Trials are again significant, and show a learn- 
ing effect. Positions and distribution X posi- 
tions are also significant in addition to in- 
dividual differences terms. Because position 
when tested by its interaction with distribution 
is not significant, only the interaction means 
are given in Table 1. Note particularly that 
the central position is affected adversely as to 
time under unequal central distribution. 

The measure of communication was the 
message unit defined as a simple sentence or a 
meaningful part of a compound or complex 
sentence. The significant terms of the analysis 
of:variance are discussed below. Trials show, 
once again, a learning effect, i.e., fewer message 
units per trial. Position, remaining significant 
over and above the distribution X position | 
interaction, shows that the central person 
sent 3.3 times as many message units as the | 
other positions. Table 2 summarizes the means | 
of the distribution X position interaction. 1 


554 


UNEQUAL DISTRIBUTION OF INFORMATION IN A WHEEL STRUCTURE 


TABLE 1 


Means AND RESULTS OF TukEY's Gap TEST (2) or 
Time Scores BROKEN Down BY POSITIONS 


Information Distribution 


it A A 
Posinon Equal Unequal Unequal 
a Peripheral Central 
A 11.04 9.05* 11.14 
B 9.74* 10.91 10.54 
C&D 11.18 11.13 11.37 


* Separated as a straggler (p « .05). 


TABLE 2 
MEANS AND RESULTS OF TUKEY's Gap TEST OF THE 
DISTRIBUTION X POSITION INTERACTION OF 
NUMBER or MESSAGE Uxrrs TRANSMITTED 
ra e 
Information Distribution 


Position U al U 1 
E nequ. nequa! 
SEES Peripheral ^ Central 
A 9.69 12.241 7.31 
B 29.60* 27 .00*t 31.82* 
C&D 10.11 7.88 7.73 


* Separated by a significant gap ($ < .05) as a homogeneous 
group. 
T Separated as a straggler (p < -05). 


Errors were analyzed as in Shaw's experi- 
ment. No significant differences were found on 
any of the error measures. 

Analysis of the sociometric questions using 
chi square demonstrates that information dis- 
tribution did not change the pattern of soci- 
ometric choices on the six questions. Of the 24 
chi squares computed between distributions, 
only one was Significant. These chi Squares 
were computed for (a) the A position choosing, 

) the B position choosing, (c) the C and D 
positions combined choosing, and (d) all 
Fonon combined choosing for each ques- 
ion. ` 


On the other hand, chi squares computed 


- within information distribution groups show 


that the choices reflect the group structure: 
Positions A and C plus D chose B (p < .05 for 
25 of 36 chi squares). Position B chose ran- 
domly among A, C, and D. 

The rating-scale data were subjected, to 
analyses of variance. It was found that posi- 
tion was effective on three of the four ques- 
tions. In each instance, the B position yields 
the high mean, le. is more satisfied with the 
Job, considers its status higher, and believes 
the group cooperated better than the other 


555 


TABLE 3 
Means or DISTRIBUTION X POSITION INTERACTION 
ON JOB SATISFACTION AND POSITION 
STATUS RATINGS* 


| 


Job Satisfaction Position Status 


| 
ti 
| 


Position! F Un | Um i UM 
Equal pedal | equal | Equal Periph- c 
| pem] mal | eral 
A $.2| 6.5 15.2 | 10.4 | 12.9 | 10.8 
B 6.2 | 5.9 | 6.8 | 17.5 | 16.2 | 18.9 
C&D |54 42/49) 117 9.0 | 10.3 
| | 


* Tukey’s gap test yields no more information in this instance 
than does the analysis of variance, 


three positions. Of most interest, however, are 
the significant"interactions of distribution x 
position on the first two questions. The means 
of these interactions are given in Table 3. Note 
that every rating change follows the connected 
propositions that individual morale is directly 
related to individual centrality and that in- 
creased or decreased information status is 
similar to respectively increašed or decreased 
individual centrality. 

Categorizing the answers to a question re- 
garding the groups’ development of a system 
showed no differences between the information 
distribution conditions when tested by chi 
square. Sixty-seven per cent of the groups, 
however, adopted the system of sending all 
information to everyone and checking each 
other's answers. In the remaining 28 per cent 
which developed a system, the central posi- 
tion, B, received all information and trans- 
mitted only answers. Two groups claimed that 
no system was developed. 


Discussion 


The data show that individual centrality 
is (a) directly proportional to individual 
morale and to the probability that an in- 
dividual will be chosen as a leader and (b) 
inversely proportional to individual problem- 
solution times where there is equal distribu- 
tion of information, but that there are devia- 
tions under other information-distribution 
conditions. Since the argument will be based 
primarily upon individual-position time scores, 
they are discussed first. 

Table 1, it will be' recalled, presented the 
means of the position X information-distribu- 
tion time scores. Shaw's formulation that 
increasing or decreasing information at a 
position is similar to respectively increasing 


556 


or decreasing the centrality index of that 
position is upheld there as long as compari- 
sons are made within distribution conditions? 
Once comparisons are made between distribu- 
tions, however, failure to follow Shaw's propo- 
sition is apparent. The most noticeable failure 
is at position B in the unequal central distribu- 
tion which should be the lowest mean and, 
instead, is the third lowest. This negative 
instance must be accounted for, and a probable 
answer is found in the communication require- 
ments placed upon B by the nature of the 
information distribution. 

The groups usually adopted the system of 
“information to everyone.” Under this sys- 
tem, B must send a minimum of 17, 18, and 21 
message units in the unequal peripheral, 
equal, and unequal central conditions re- 
spectively. As seen in Table 2, the message 
units actually sent by B order directly in the 
sequence given by these minima. Times of 
the B position, though, order equal, unequal 
central, unequal peripheral. These orderings 
suggest that there is an interaction between 
individual centrality and communication re- 
quirements: that, when the number of required 
message units passes a certain optimal output 
level, it begins to counteract the effects of 
individual centrality. Such a statement may 
best be summarized with the term “‘satura- 
tion’ since it is not simply and directly re- 


2 There is some question as to the comparative time 
of positions A and B in the unequal peripheral condi- 
tion. Should this distribution so readily overcome the 
structural centrality effect as to produce the time dif- 
ferences between those positions that it did? The dif- 
ference between these means is associated with a 
p< 05. 

%The differences between the means are almost 
identical with the differences between the minima. 
This suggests, since the absolute values of the means 
are higher than those of the minima, that the channels 
carry a constant amount of “noisy” and/or redundant 
messages. Examination of Shaw’s data for three nets 
shows the same effect appearing there. 

4 Two kinds of what we have called saturation may 
be distinguished: one we call “channel saturation” 
and refers to the number of channels with which a 
position must deal; the other, discussed above, is 
“message unit saturation.” The former should be 
closely related to both centrality and group size and the 
latter to centrality, problem size, and information dis- 
tribution. Each of the two main classes may break 
down into “input” and “output” saturations. If these 
variables are basic ones and the above textual state- 
ments are correct, they should ultimately affect leader- 

ship emergence and morale. Experiments now in 
progress bear upon these points. 


J. C. Gucurrst, Marvin E. Sgaw, AND L. C. WALKER 


lated to the number of message units required 
to be transmitted as is shown by the A posi- 
tion in the unequal peripheral distribution. 
Thus, information distribution is indirectly an 
effective variable by virtue of its effects upon 
what is, perhaps, a more basic phenomenon, 
saturation, or the input and output require- 
ments which it places upon positions within a 
group structure. 

One other comment is pertinent. Although 
morale follows almost perfectly Shaw’s 
original statement as to the effects of informa- 
tion distribution, the emergence of leader- 
ship does not: information distribution did 
not affect leadership emergence. Examination 
of Shaw’s data indicates that that conclusion 
was based upon the findings in structures 
where each position has two or more channels 
terminating upon it. Information distribu- 
tion, then, has an effect upon leadership 
emergence in structures having lesser dif- 
ferences in centrality than does the wheel 
structure. 


SUMMARY AND CONCLUSIONS 


An experiment was conducted using a four- 
person wheel group structure to test the effect 
of information distribution upon problem 
solving, leadership, and morale within the 
structure. Three conditions of information 
distribution were used: equal, unequal with 
most information initially located at the 
center of the structure, and unequal with a 
peripheral position initially receiving the 
most information. 

The data substantiate the position that 
the individual centrality index is directly re- 
lated to the emergence of leadership in the 
structure and individual morale. The findings 
with respect to information distribution, how- 
ever, suggest that it is not simply and di- 
rectly related to the individual centrality 
index. Rather, it appears to be only indirectly 
related thereto and mediated by a more basic 
phenomenon which has been termed “satura- 
tion." 


REFERENCES 


1. SHaw, M. E. Some effects of unequal distribution of 
information upon group performance in various 
communication nets. J. abmorm. soc. Psychol. 
1954, 49, 547-553. 

2. Tuxey, J. W. Comparing individual means in the 
analysis of variance. Biometrics, 1949, b, 99-114. 


Received August 25, 1953. 


| 


li 
i 
| 
] 


THE EFFECT OF ONE PARTNER'S SUCCESS IN A RELEVANT 
TASK ON THE INTERACTION OF OBSERVER PAIRS: 


* BERNARD MAUSNER 
University of Massachusetts 


HE PRESENT report is based on one of a 

series of studies of the determinants of 

social interaction in judgment. The 
paradigm for these investigations is that of 
the Sherif experiment in which it was demon- 
strated that the judgments of subjects (Ss) 
reporting the extent of autokinetic effect 
converged toward a group norm. It has al- 
ready been found that the degree of this con- 
vergence can be affected by such factors as 
prior reinforcement of the responses (2, 4, 7), 
variation in the psychophysical method em- 
ployed (1, 5), variation in the nature of the 
stimuli (3), and manipulation of the prestige 
of one of the partners through his identifica- 
tion as an "expert? or “nonexpert” (6). With 
reference to the last of these factors, there 
exists the problem of specifying in as precise 
a manner as possible the variables determining 
prestige. When this is done, prestige may serve 
as a useful intervening variable in the theoret- 
ical treatment of social interaction. 

The writer (8) has suggested that con- 
vergence in judgment is a function of the 
summation of two conflicting response tenden- 
cies: (a) a tendency to continue to make 
previous judgments, and (b) a tendency to 
agree with a partner. The partner's prestige 
1s probably one of the determinants of the 
latter tendency. Prestige may be strengthened 
Where the partner gives specific evidence of 
competence in the kind of judgment being 
made; it may be weakened where evidence of 
incompetence is presented. Other things being 
equal, this variation in prestige should result 
in a concurrent Strengthening or weakening 
of the tendency to agree with the partner. 
Where the opposing tendency, that to main- 
tain an earlier range of judgments, is of known 


i H The present report is the fourth in a series entitled, 
Studies in Social Interaction." The data were fre- 
sented at the 1953 meetings of the Eastern Psycholog- 
ical Association. The experiment was carried out in the 
laboratory of the department of Psychology, Wash- 
ington Square College, New York University. The 
Assistance of the following students who served both as 
confederates and as experimenters is gratefully ac- 
knowledged: Leon Dreier, Peter Sciaki, Marilyn 
Reisman, Sandra Skolnick, and Barbara Diamond. 


557 


strength, or is judged to be randomized in a 
group of Ss, convergence should be predictable 
from information concerning the partner’s 
demonstrations of competence, 

On the basis of these considerations a 
specific hypothesis was tested: Ss will show a 
greater tendency to converge in judgment toward 
partners who have previously demonstrated 
success in a reldted task than toward partners 
who have previously demonstrated failure. 


METHOD 


Subjects. The Ss were 28 undergraduate students in 
elementary psychology classes at Washington Square 
College, New York University. All were naive with 
respect to the purposes of the experiment and the area 
of the investigation. 

Procedure. In a preliminary session each S judged 
the length, in inches, of each of a series of 100 lines. 
These consisted of ten lines varying in length from 10 
to 19 in., each presented ten times. Each block of ten 
trials contained all ten lengths arranged in a different 
permutation from that used in other blocks, The 
judgments were made in a dark room; the lines were 
presented by means of an apparatus in which a lighted 
slot }4 in. wide and varying in length was exposed to S 
for approximately 1 sec. (for complete description cf. 4). 

Following this session (the Alone session), a. con- 
federate of E was presented as a second "subject." 
Both Ss were given 20 trials in which they were in- 
structed to move a stylus through a 17-cm. slot placed 
behind a curtain. They then attempted to reproduce 
this length by moving the stylus through an equivalent 
distance in a free slot, also hidden. The actual S com- 
pleted the task first and was followed by the confed- 
erate. Each S was requested to observe and record his 
partner's performance. A success was defined as re- 
production within 1 cm. of the correct distance. The 
confederates were instructed to perform very skillfully 
for some Ss, very unskillfully for the remainder. Those 
Ss for whom the partner's reproductions had an SD 
of 1.5 cm. or better were considered to have had a 
"successful" partner (group S); those for whom the 
SD of the reproductions was more than 1.5 cm, an 
"unsuccessful" partner (group U). The Ss were as- 
signed to these groups in rotation. Group S contained 
17 Ss, group U, 11 Ss. 

After this procedure had been completed, both Ss 
judged the lengths of the same series of lines previously 
presented (Together serief). They alternated as first 
to respond to each stimulus. The confederates had 
been instructed by E to make their judgments vary : 
consistently by 4 to 5 in. from those given by the actual 
S in the alone situation. 

Treatment of results. Since one of the two persons in 
each pair was E's confederate, and, by instruction, was 


558 BERNARD MAUSNER 


forced to maintain a relatively constant range of judg- TABLE 1 

ments, the degree of social interaction in judgment was INTERACTION BETWEEN S AND E's CONFEDERATE 
estimated by measuring the degree of shift in judgments ^ (P) Given nv D Score; Accuracy or SLOT REPRO- 
made by the other member of the pair, the actual S, ^ pyction mv SD or Triats; GROUP S: PanTNER'S 
in two ways. First, a D score was calculated for each S. SD < 1.50, Group U: Partner’s SD > 1.50 
This measure is derived from the difference in frequency 


with which each number response was made in the alone j Group S 
and together situations. A high D score indicates con- 
siderable shift, and, where this is toward the partner, Slot Reproduction 
considerable convergence. A low D score indicates ^ Subject D et ee 
maintenance of the same range of judgments in the SD(S) SD(P) 
together as in the alone session (see 4, 6, for detailed 
method of calculation), In the second method of analy- ZVI 67 3.17 .38 
sis, graphs of median judgments per block of ten KLE 58 1.51 70 
trials were used to show direction of shift and as a HEN 78 2.21 1.39 
basis for qualitative descriptions of the progress of KEL 84 .80 .90 
interaction through the course of the experiment, BRE 68 1.84 .83 
RESULTS AND DISCUSSION $ rer E m UR 
Table 1 gives D scores for the Ss of both DEB 60 1.73 1.39 
groups. The mean D score for group S is abis i o AR 
62.1, for group U, 20.2. This difference is BRA 162 2.88 ^83 
significant at the .01 level (¢ = 3.10, n = 26, FRI é 60 2.93 .78 
p < .01). This indicates greater shift in judg- SWI* 30 2.35 .92 
ment from alone to together situations for COH" 4 2.00 ret 
the Ss whose partners were successful (group vane " Ai i 
S) than for those whose partners were unsuc- CHA* 13 2.63 1.39 
cessful (group U). Inspection of the median 
graphs shows a tendency to formation of a Group U 
Social norm of judgment in the pairs of group 
S, a lack of such tendency in the pairs of M 7 2.26 EA 
group U. Figure 1 shows three such plots for AUE $ e 3.12 
Ss in group U; Fig. 2 four plots for Ss in BAS 24 :40 247 
group S. These are representative of the 10 CAL 19 2.95 1.88 
Ss in group U and the 12 in group S whose FOT 12 7.10 2.93 
behavior corresponded with the prediction? HYS d SUE 
Thus the hypothesis presented above is sub- 16 SUN Sey 
ypo! p above is su JUS 31 1.75 1.84 
stantiated by the data of this experiment. ECK 32 2.24 1.58 
Six of the 28 Ss do not show the predicted HOL* 66 3.43 2.87 


behavior. Four of the Ss in group S have D 
scores below the mean for group U; one (SWI) 
is borderline. These are clearly distinguished tion. The two most accurate Ss have the 
from the remainder of the Ss in this group highest and the third highest D scores. The 
both by their D scores, and by the failure of most inaccurate S in the entire experimental 
their graphs to show formation of a social group (FOT) is found in group U, and has an 
norm. The Ss' own performance on the in- appropriately low D score (12); i.e., he failed 
terpolated task was not at all related to the to converge. 
degree of social interaction. Among the five Ss Unfortunately, there is no further informa- 
in group S who failed to converge none was tion available in the data of this experiment to 
particularly accurate in “blind” slot reproduc- throw light upon the behavior of these deviant 
p Pio Ss. However, their occurrence may be pre- 
ia d pica et af te lagen dicted from the theoretical scheme alluded to 
with the American Documentation Institute. Order @bove. In any group of Ss chosen at random it 
Document No. 4113 from the ADI Auxiliary Publica- is likely that, owing to personality factors 


tions Project, Photoduplication Service, Library of : ; : 3 1 

Coreia Washington 25, DANG td AE lint aioe: bbc eru not immediately identifiable, 
vance $1.25 for 35 mm. microfilm, or $1.25 for 6 X 8 Some have a high tendency to maintain 
in. photocopies, Make checks payable to Chief, Photo- Previous judgments. The opposing tendency 
duplication Service, Library of Congress. to agree with the partner is probably not too 


* Ss whose results did not conform to the prediction. 


PRESTIGE AND SOCIAL INTERACTION 


P(U) -x pos 


TOGETHER 
TRIALS BY TENS 


ALONE 


559 


N 
o 


w R2 


ro 


SPO-o 
P(U)-x 


S 


MEDIAN JUDGMENT (INCHES) 
e 


TOGE THER 
TRIALS BY TENS 


ALONE 


Geo i 

21 SU QE 
FX 

FE 

3, ECK-o \ 
E P(U) -x x 

FE qox NIS 
8 e| à 

E 


ALONE 


TOGETHER 
TRIALS BY TENS 


Fic, 1. MEDIAN JUDGMENTS PER BLOCK OF TEN TRIALS First ALONE AND THEN WITH A PARTNER 


UNSUCCESSFUL IN SLOT 


high to begin with where the partner is an 
anonymous fellow student. It would be un- 
likely, therefore, that a manipulation of the 
partner's prestige, such as that here attempted, 
would be effective 100 per cent of the time. 
The fact that it was effective in 71 per cent 
of the cases in group S, and that a significant 
difference in D scores was found between the 
two groups, argues for the sensitivity of the 
Convergence experiment as a measure of social 
interaction. 

; Apparently the effect of a partner's failure 
in lowering the tendency to agree is even more 
Pronounced than the effect of his success in 
raising it, since all but one of the Ss in group 
U failed to converge. That deviant case 
(HOL) had a D score (66) actually above the 
mean for group S. Although there is no cor- 
Ioborating evidence, it is possible that fhe 


_ fact that HOL is the sole Negro S is significant. 


Es 


Aere may be factors in the developmental 
history of Negro Ss which would lead to a 


heightened tendency to agree with a white 
partner. 


REPRODUCTION. PLOTS FOR THREE Ss ARE INCLUDED. 


Thus two kinds of conditions antecedent to 
prestige can now be identified. Prestige as 
measured in terms of convergence in judgment 
has been anchored to a partner's group mem- 
bership (7) and may now also be anchored to 
a partner's demonstrations of competence in 
relevant tasks. The next step in these investi- 
gations should be an attempt to identify and 
measure some of the genetic factors which 
relate prestige to tendency to agree. Tt may be 
suggested that, for most people, agreement 
with prestigeful individuals has been posi- 
tively reinforced, and disagreement negatively 
reinforced. That is, most Ss used in these 
experiments have received reinforcements 
from agreeing with members of an “expert” 
group, or with people who demonstrate 
competence. This argument leads to a testable 
hypothesis. It may be predicted that Ss will 
converge toward partners with whom agree- 
ment has previously been positively reinforced 
in some other context, but will not converge 
toward partners with whom agreement has 
been negatively reinforced. If such a relation- 


560 

a te 

$ ^P bRA-o PPE T 
Sel pis)-x vt x 
ou Po 
3 [ 

GaN. 


TOGETHER 
TRIALS BY TENS 


ALONE 


MEDIAN JUDGMENT (INCHES) 


TOGETHER 


ALONE 
TRIALS BY TENS 


BERNARD MAUSNER 


Ex ge 
» 
24 
2 
x 
x 


Lj 


MEDIAN JUDGMENT (INCHES) 
Ss r3 


TOGE THER 
TRIALS BY TENS 


ALONE 


J 
vy 


MEDIAN JUDGMENT (INCHES) 


ALONE 


TOGETHER 
TRIALS BY TENS 


Fic. 2. MEDIAN JUDGMENTS PER BLOCK or TEN TRIALS FIRST ALONE AND THEN WITH A PARTNER 
SUCCESSFUL IN SLOT REPRODUCTION. PLOTS FOR FOUR SS ARE INCLUDED. 


ship were found to hold, the kinds of prestige 
effects demonstrated in the present and earlier 
studies could logically be related to similar 
histories of differential reinforcement in our 
Ss’ development. 


SUMMARY AND CONCLUSIONS 


Twenty-eight undergraduate Ss judged the 
lengths of 100 lines first alone and then with 
a confederate of E. These confederates had 
demonstrated failure in a similar task for 11 
of the Ss, and had demonstrated success for 
17 of them. 

The Ss who worked with previously suc- 
cessful partners converged significantly more 
toward these partners in the group judgment 
situation than did the Ss who worked with 
previously unsuccessful partners. 

This finding is held to indicate that demon- 
strations of success and failure in relevant 
tasks may be used as one kind of antecedent 

. condition anchoring the intervening variable 
“prestige.” A partner’s prestige is considered 
one of the determinants of the tendency of an 


S to agree with him, and thus finally of the 
degree of convergence in judgment. 


REFERENCES 

1. BERENDA, Ruts W. The influence of the group on the 
judgments of children. New York: King’s Crown 
Press, 1950. 

2. Kerman, H. Effects of success and failure on “‘sug- 
gestibility" in the autokinetic situation. J 
abnorm. soc. Psychol., 1950, 45, 267-285. 

3. Lucuins, A. S. On agreement with another’s judg- 
ments. J. abnorm. soc. Psychol., 1944, 39, 97-111. 

4. Mausner, B. The effect of previous reinforcement | 
on the interaction of observer pairs. Univer. 
Microfilm, Ann Arbor, Mich., Publ. No. 2118, 1950. 

5. Mausner, B. The effect of variation in psycho- 
physical methods on the interaction of observer 
pairs. Paper read at Eastern Psychol. Ass., Atlantic 
City, N. J., April, 1952. 

6. MausNER, B. Studies in social interaction: iil. 
The effect of variation in one partner’s prestige 
on the interaction of observer pairs. J. appl. 
Psychol., 1953, 37, 391-393. 

7. MAUSNER, B. The effect of prior reinforcement on \ 

| 


i 


the interaction of observer pairs. J. abnorm. soc. 
Psychol., 1954, 49, 65-68. i 
8. Mausner, B. Studies in social interaction: I. A | 
conceptual scheme. J. soc. Psychol., in press. 


Received July 15, 1953. 


SOME PERSONALITY CORRELATES OF STIMULUS 
GENERALIZATION UNDER STRESS 


CHARLES W. ERIKSEN 
The Johns Hopkins University 


psychological stress, Lazarus, Deese, and 
Osler (11) have pointed out the impor- 

tance of the individual differences component 
in performance under stress conditions. In 
some subjects (Ss) the introduction of stress 
results in poorer performance, but in others 
performance is unaffected or even improves. 
The purpose of the present experiment was to 
investigate some of the personality correlates 
of these individual differences in response to 
stress. Specifically, the problem here was the 
relation of the hysteria-psychasthenia dimen- 
sion, as measured by the respective scales in 
the Minnesota Multiphasic Personality In- 
ventory (MMPI), to the degree of stimulus 
generalization shown under the stress of 
Strong electric shocks. The Ss were trained to 
make a simple horizontal arm movement when 
a square of a certain size was presented, and 
the amount of generalization of this response 
to squares of other sizes was determined. 

Previous research has indicated that learn- 
ing and memory performance under stress 
Produced by “self-esteem” threat is related 
to the hysteria-psychasthenia variable. The Ss 
Scoring high on hysteria have been found to 
have poorer recall of failures on self-esteem- 
involving tasks (6, 15) and poorer memory for 
anxiety-arousing stimuli (1) than Ss scoring 
high on psychasthenia. However, in the 
Present experiment not only was a different 
aspect of performance measured, but the 
method of inducing stress was the threat of 
painful physical stimulation rather than a 
threat to the Ss’ self-esteem. To see what 
relationship, if any, might be expected between 
Performance and the hysteria-psychasthenia 
variable under these changed conditions, we 
need to consider briefly some of the factors 
Involved in the concept of psychologieal 
Stress, 

We Can equate psychological stress with 
anxiety and analyze it into the following 
Sequential components: 


IE THEIR recent review of research on 


Stimulus situation — anxiety response — §’s 
response to his internal anxiety-produced 
stimuli 


o EEE SBE SN AE esc epus c oT TN 


Such an analysis helps us to bear in mind some 
of the factors involved in stress and to see what 
effect changes in the stress conditions will 
have upon different performance measures. 

The stimulus situation alone provides ample 
opportunities for individual differences to 
occur in stress research. It has been shown 
(12) that differences in goals and prior ex- 
periences may render a given stimulus anxiety 
arousing for some Ss and not for others. More 
pertinent to our present problem is how dif- 
ferences in the stimulus situation limit or 
affect S’s response to his internal anxiety cues. 
Responses are made not only to the internal 
anxiety cues, but they must also be adapted 
to the external anxiety-provoking situation. 

Although a number of different stimulus 
situations have been used to stress Ss, most, 
if not all, represent variations upon one or the 
other of two basic types. One is to threaten 
Ss with painful physical stimulation such as 
electric shocks. The other is to threaten self- 
esteem by failure or the threat of failure on 
some highly competitive task. Both types of 
situation and their variations have been suc- 
cessful in making Ss anxious, but it is to be 
noted that a given response may not be equally 
effective to both types of threat. People differ 
in the defenses they use and in the way they 
have learned to handle and interpret anxiety. 
We may expect that some of these learned 
responses are more appropriate to one type 
of stress than to the other. Avoidance defenses 
such as forgetting failures or “psychologically 
leaving the field” may be effective methods of 
evading anxiety under situations of self-esteem 
threat, but they are quite useless when it 
comes to evading electric shocks or other 
physical punishment. 

In the present experiment, where anxiety 
was induced by electric shocks, the prediction 
was made that Ss scoring high on the hysteria 
scale would show more stimulus generaliza- 
tion than Ss scoring high on psychasthenia. 
This prediction was arrived at from two con- 
siderations. First, Ss high on hysteria, de- 
prived of using their avoidance-type of defense 
by the nature of the present stress situation, 


562 


should be more subject to uncontrolled anx- 
iety. The Ss high on psychasthenia, on the 
other hand, would appear from previous re- 
search and from the content of the MMPI 
items to be more accustomed to coming to 
grips with anxiety on the conscious level. 
Their supposed defenses of rationalization and 
intellectualization should be more compatible 
with the present stress situation. 

The second consideration was that the 
greater the uncontrolled anxiety, the greater 
the stimulus generalization. Meyer (13) has 
recently summarized evidence indicating that 
one effect of anxiety is to lower the threshold 
for responses, and experiments by Hilgard, 
Jones, and Kaplan (10) and by Rosenbaum 
(14) have suggested a relation between anxiety 
and stimulus generalization. 


METHOD 

Design. The Ss were trained to respond with a hori- 
zontal arm movement when a square of a certain size 
(training stimulus) was presented. After this response 
had been well established, squares of different sizes 
(generalization stimuli) were occasionally presented to 
‘Ss instead of the training stimulus, and Ss’ responses 
or tendencies toward responses were recorded. Two 
experimental conditions were used. Under the avoidance 
condition, Ss were instructed that on some of the trials 
when the training stimulus appeared a shock circuit 
would be on. On these trials they would receive a shock 
if their response to the training stimulus was not rapid 
enough. Under the nonavoidance condition, Ss were 
also told that some of the time when the training stim- 
ulus appeared a shock circuit would be om, but here 
they were given to understand that the shocks were 
completely unrelated to their performance. In other 
words, no matter what they did, when the shock circuit 
was on, they would be shocked. 

These two experimental conditions were used to 
determine what effect the factor of evitability of shock 
might have upon performance and also as a control 
for 5s who might deliberately decide to respond to every 
stimulus in order better to avoid receiving shocks. As a 
further control against this latter possibility, Ss in 
both conditions were told that, although there was no 
shock for incorrect responses, the experimental session 
would continue until their performance satisfied a 
criterion of speed and accuracy. 

The stimuli were three squares having a projected 


1 Although the above procedure is quite similar to 
that used by Rosenbaum (14), it differs from his in two 
important respects. Rosenbaum failed to provide a 
control against Ss who may deliberately decide to 
respond to every stimulus, and he also did not inform 
Ss that the shock circuit would not always be on when 
the training stimulus appéared. An attempt by the 
present experimenter to duplicate exactly Rosenbaum's 
procedure revealed that a number of Ss deliberately 
responded to every stimulus since they “had nothing 
to lose" because errors were not punished. Some Ss also 
discovered that the shocks were arbitrarily adminis- 


CHARLES W. ERIKSEN 


size of .75, 1.12, and 1.50 in. square. These sizes were’ 
found to be equally spaced on the equal discrimin- 
ability scale described by Garner and Hake (8). Be- 
tween Ss the large and small square were alternated 
as the training stimulus. The Ss were run individually, _ 
Before starting the experiments, S was shown the three | 
stimuli, one at a time, to insure that he could correctly 
identify each. Twenty training trials were then given 
in which the training stimulus was the only stimul 
presented. Five test trials for the training stimulus an 
for each of the two generalization stimuli followed the 
training trials without interruption. An additional 42^ 
training trials were interspersed among the test trials 
in such a manner that each test trial was separated 
from the other test trials by three training trials. Thus 
for an S who was trained on the large square, the ex- 
perimental sequence would be: 20 successive presenta- 1 
tions of the large square, a test presentation of the large, 
medium, or small square, three more presentations of | 
the large square, and so on, Two counterbalanced, 
sequences were employed with the stimuli in the test | 
trials to minimize sequence effects in the generalization 
gradients. 

Subjects. A shortened version of the MMPI, con- - 
taining the items for the hysteria, psychasthenia, lie, 4 
and K scales as well as the items for Taylor’s scale of 
manifest anxiety (17), was administered to 120 males 
enrolled in the introductory course in psychology. 
The Ss used in the experiment were selected from 
among those who scored above the 80th decile 
(hysterics) and below the 20th decile (psychasthenics) 
on the distribution of scores obtained by subtracting ^ 
each individual's score on the psychasthenia scale from ^ 
his score on the hysteria scale. This subtractive pro- 
cedure was used since it had been previously found to” 
be effective (1, 6) as a way to partial out a general.- 
factor of “neuroticism” which contributes to high 
scores on both scales. A total of 18 hysterics, 12 in the” 
avoidance condition and 6 in the nonavoidance, and f 
21 psychasthenics, divided 14 and 7 between the 
avoidance and nonavoidance conditions, constituted. 
the experimental Ss. 

Apparatus, The S was seated at a table 2 ft. away 
from a 3 X 4ft. vertical plywood partition. In the 
lower left side of the partition was a horizontal slot 
14 in. wide and 11 in. long. In the right-hand side of 
the partition, at eye level, was an 8 X 8-in. ground 
glass window, The stimuli were presented to S by 
projecting them on the window. On a ready signal 
from E, S inserted a pencil into the slot on the left- 
hand edge where the pencil made contact with a 
microswitch, When the training stimulus appeared, 5 
made a rapid horizontal movement of the pencil to 
the right, all the while keeping the pencil in the slot. | 
The extent of the required movement was 23 cm.” 
(approximately 9 in.), which was indicated by & 
marker on S's side of the partition. The S’s response. 
was recorded by the tracing of the pencil on recordin; 
paper mounted behind the partition. The stimuli wer 
projected by a 35-mm. projector. Through the use of 4 
condenser-time-delay circuit the duration of exposure 
of each stimulus was 3 sec. 


tered by E since, in accord with the arbitrary schedul 
they would sometimes be shocked when they had mi 
a very rapid response and not shocked when thi 
response was unusually slow. 


PERSONALITY AND STIMULUS GENERALIZATION UNDER STRESS 


"The latency of S's response was measured to the 
nearest hundredth of a second by a Standard Electric 
Timer. The timer was activated simultaneously with 
the projector and was stopped when S' broke contact 
with the microswitch at the left-hand edge of the slot. 

_ If S failed to respond, the timer automatically stopped 
when the projector circuit was interrupted by the 
timing-delay apparatus. 

Electric shock was administered to the back of S’s 
left hand by hand electrodes. The source of shock wasa 
faradic current from an inductorium supplied by a 
6-v. dry cell. Strength of shock varied slightly between 
Ss, since the attempt was made to keep the strength 
of shock just below the level where S would refuse to 


continue with the experiment. G 


RESULTS 


For each S the mean response amplitude on 
the test trials was computed for each of the 
three stimuli. The means of the individual S 
means are plotted in Fig. 1, by experimental 
condition, for the hysteric and psychasthenic 
groups. While there is essentially no dif- 
ference in response amplitude between groups 
or conditions to the training stimulus (0), the 
amount of generalization to the two non- 
training stimuli (stimulus 1 and 2) is con- 
sistently greater in the hysteric group under 
both experimental conditions, Also both the 
hysteric and psychasthenic groups show more 
generalization under the avoidance than under 
the nonavoidance condition, There is little 
difference, however, in the amount of generali- 
zation shown to stimulus 1 and 2. 

. Statistical analysis supports the graphical 
interpretation of the data. Since the variance 


A MYSTERIC 

9 PSYCHASTHENIC 
—— AVOIDANCE 
777 WON-AVOIDANCE 


RESPONSE (CENTIMETERS) 


STIMULI 


Fic, 1. MEAN REsPONSE AMPLITUDES TO THE TRAINING 
AND GENERALIZATION STIMULI 


563 


TABLE 1 


SUMMARY OF THE ANALYSES OF VARIANCE FOR THE 
AVOIDANCE AND NONAVOIDANCE CONDITIONS 


Source Mean | df | st F 
sl 
Avoidance Condition 
Total 2024.35) 51 
Groups | 280.72 1 280.724. 76** 
Stimuli 26.90, 1 | 26.90| 2.15 
Individuals (1415.32! 24 | 58.97] 
Groups X Stimuli inter- | i 
action Pref Aa i 77; .06 
Pooled Ss X Stimuli in- 
teraction 300.64 24 | 12.53 
Nonavoidance Condition 
Total 350.17| 25 | 
Groups j 70.77, 1 | 10.77,3.30* 
Stimuli 30) 1 -30| .08 
Individuals 235.72, 11 | 21.43 
Groups X Stimuli inter- | 
action -68 1 +68) 18 
Pooled Ss X Stimuli in- | 
teraction 42.70 11 | 3.88 


* Significant at the .05 level (one-tailed test). 
** Significant at the .02 level (one-tailed test). 


of the response amplitudes was significantly 
less in the nonavoidance condition, it was not 
appropriate to combine the two experimental 
conditions in a common analysis of variance. 
Instead a modified three-way analysis (groups, 
the two generalization stimuli, and individ- 
uals) was performed separately for the avoid- 
ance and nonavoidance conditions. The results 
of these analyses are summarized in Table 1. 

For both experimental conditions, dif- 
ferences between the hysteric and psychas- 
thenic groups were significant beyond the .05 
level for a one-tailed test in the predicted 
direction. In neither condition was there a 
significant difference between the generaliza- 
tion stimuli. Nor were the interactions sig- 
nificant. In Table 2 will be found the means 
of the individual S means by groups and ex- 
perimental conditions along with their ac- 
companying / tests. 

Because of a significant difference in the 
variance of responses under the avoidance and 
nonavoidance conditions, it was not possible 
to compare the difference in over-all generaliza- 
tion between these two conditions by the 
standard ż test. Instead the method suggested 
by Cochran and Cox (2) was used; this yielded 
a ¢ value of 5.31, significant beyond the .001 


564 


TABLE 2 
MEAN RESPONSE AMPLITUDES IN CENTIMETERS BY 
GROUPS AND EXPERIMENTAL CONDITIONS TO THE 
TRAINING AND GENERALIZATION STIMULI 


Avoidance Condition Homa o anes 
Stimuli Stimuli 
Group 
0 1 2 rem 0 1 2 li 
com- 
bined bined 
Hysteric 24.0 |14.4  |12.7 |27.1 25.0 4.8  |4.7 |9.6 
Psychasthenic|24.1 | 9.5 8.3 17.7 |25.2 |1.1 [1.7 |2.8 


t OL) 2.17**| 1.81*| 2.18**| .42/2.28**|1.34|1,85* 


* Significant beyond the .05 level (one-tailed test). 
** Significant beyond the .02 level (one-tailed test). 


TABLE 3 
MEANS OF THE MEDIAN RESPONSE LATENCIES IN 
HuNpREDTHS OF A SECOND BY GROUPS AND 
EXPERIMENTAL CONDITIONS TO THE TRAINING 
AND GENERALIZATION STIMULI 


Avoidance Nonavoidance 
Group Stimuli Stimuli 
0 1 2 0 T 2 
Hysteric -26 | .23 | .23 | .34 | .26 | .26 
Psychasthenic | .30 | .24 | .26 | .36 | .35 | .31 


level for a two-tailed test. Thus it seems safe 
to conclude that the avoidance condition 
results in greater stimulus generalization. 
The median latency for positive responses 
during the test trials was determined by Ss 
for each stimulus. The means of these median 
latencies are given in Table 3 by group and 
experimental condition. Three trends can be 
noted in this table. First, the latencies to the 
generalization stimuli are shorter than to the 
training stimulus. Second, the response laten- 
cies for the hysterics are consistently shorter 
than for the psychasthenics, and third, the 
latencies for the avoidance condition are 
Shorter than for the nonavoidance. While 
none of these trends is supported by statistical 
test, the shorter latencies for the responses to 
the generalization stimuli are similar to the 
findings of Gibson (9) and Rosenbaum (14). 
They are probably to be expected since the 
less time an S takes for discrimination, the 
greater the likelihood of a wrong response. 


Discussion 


The above results are in good agreement 
with the prediction. The hysterics under both 


CHARLES W. ERIKSEN 


avoidance and nonavoidance conditions show 
a greater degree of stimulus generalization 
than the psychasthenics. The demonstration 
of a difference between these two groups, how- 
ever, does not insure that our theoretical 
basis for expecting this difference was correct. 
While the prediction for the present study was 
arrived at from the assumption of a difference 
in the way hysterics and psychasthenics 
respond to the internal cues of anxiety, other 
explanations are readily available for the 
present results. Further study of such extreme 
groups is required before we can be confident 
as to the factors underlying the differences in 
performance. 

Irrespective of what may be the basis for 
the obtained differences in stimulus generaliza- 
tion, the present findings are in keeping with a 
growing amount of experimental work show- 
ing the relation of the hysteria-psychasthenia 
variable to various aspects of performance 
under stress. In addition to the previously dis- 
cussed relation of this personality variable to 
learning and memory performance, Davis 
(3, 4) has shown characteristic differences in 
performance on a psychomotor task between 
hysterics and psychasthenics when stress was 
induced by rapid pacing on the task. Results 
closely related to those of the present study 
are reported in an experiment by Deese, 
Lazarus, and Keenan (5). They found that Ss 
high on the MMPI hysteria scale showed 
impairment in nonsense syllable learning under 
stress produced by electric shocks, whereas 
the performance of Ss high on the psychas- 
thenia scale was little affected. 

The success of the hysteria-psychasthenia 
variable in predicting various aspects of 
performance under different stress conditions 
would appear to establish it as an important 
dimension of personality. As such there is a 
need for a clearer definition of the variable in 
personality and behavioral terms. In a previ- 
ous article (6) the similarity of the hysteria- 
psychasthenia variable and Eysenck’s factor 
of introversion-extroversion (7) was pointed 
out, but as yet there is insufficient evidence 
for positively identifying it with Eysenck’s 
factér. 

It is interesting to note that the composite 
hysteria-psychasthenia scale used in the 
present study is quite highly correlated with 
the Taylor scale of manifest anxiety (r = 
—.72, N = 120). The magnitude and direc- 
tion of the correlation are such as to show that 


PERSONALITY AND STIMULUS GENERALIZATION UNDER STRESS 


Ss on the hysteria pole are low scorers on the 
anxiety scale and those on the psychasthenia 
pole are high scorers. Taylor (16) and Taylor 
and Spence (18) have shown that high- and 
low-anxiety groups selected on the scale of 
manifest anxiety differ in performance in 
conditioning and verbal maze learning. They 
have interpreted these results along Hullian 
lines with anxiety being considered as a 
generalized drive influencing performance 
through the building of habit strength. 

The present experiment and that of Deese 
et al. (5) suggest that the Taylor and Spence 
interpretation of the anxiety scale is too simple 
an interpretation. If the drive interpretation 
of anxiety were correct, we would have ex- 
pected the hysteric group in the present ex- 
periment to show the least stimulus generaliza- 
tion and the psychasthenic group the most. 
In fact, however, the opposite held true. This 
finding, together with the other evidence men- 

“tioned above, suggests that the Taylor scale of 
manifest anxiety is not so much a measure of 
anxiety as it is a measure of a personality 
variable relating to the way individuals re- 
Spond to or express their anxiety. 


SUMMARY 


The amount of stimulus generalization under 
experimentally induced anxiety was deter- 
mined in an hysteric and a psychasthenic 
Broup of Ss, selected on the basis of these 
respective scales in the MMPI. Three dif- 
ferent size squares were used for the training 
and generalization stimuli, and the response 
Was a simple horizontal arm movement. Two 
experimental conditions were used. Under the 
avoidance condition Ss were instructed that 
failure to respond rapidly would result in 
their receiving a Strong electric shock. Under 
the nonavoidance condition Ss were told that 
they would receive electric shocks randomly, 
irrespective of their performance. In fact, 
however, both conditions had the same 
number of shocks administered in a predeter- 
mined order, 

The hysteric group of Ss showed signifi- 
cantly more stimulus generalization under 
both experimental conditions. Also for both 
the hysteric and psychasthenic groups, the 
avoidance condition resulted in significantly 
More generalization. The results were con- 
Sidered in relation to a broad concept of 


565 


Stress, and it was suggested that the Taylor 
scale of manifest anxiety was not so much a 
measure of anxiety level as it was a measure 
of how Ss respond to or handle their anxiety. 


REFERENCES 


1. Carson, V. R. Individual differences in the recall 
of word-association test words. Unpublished 
doctor’s dissertation, The Johns Hopkins 
Univer., 1953. 

2. Cocuran, W. G., & Cox, G. M. Experimental 
designs. New York: Wiley, 1950. 

3. Davis, R. Neurotic predisposition and the dis- 
organization observed in experiments with the 
Cambridge Cockpit. J. Neurol. Neurosurg. 
Psychiat., 1946, 9, 119-124. 

4. Davis, R. Increase in strength of a secondary 
drive as a cause of disorganization. Quart. J. 
exp. Psychel., 1948, 1, 22-28. 

5. Dzzsz, J., Lazarus, R. S., & KEENAN, J. Anxiety, 
anxiety reduction, and stress in learning. J. 
exp. Psychol., 1953, 46, 55-61. 

6. EnrxsEN, C. W. Psychological defenses and “ego 
strength” in the recall of completed and in- 
completed tasks. J. abnorm. soc. Psychol., 1954, 
49, 45-50, 

7. EvsENCK, H. J. Dimensions of personality. London: 
Kegan Paul, 1947. 

8. Garner, W. R., & Hakre, H. W. The amount of 
information in absolute judgments. Pyschol. 
Rev., 1951, 58, 446-459. 

9. GIBSON, ELEANOR J. Sensory generalizations with 
voluntary reactions. J. exp. Psychol., 1939, 24, 
237-253. 

10. Husar, E. R., Jones, L, V., & KAPLAN, S. J. 
Conditioned discrimination as related to anxiety. 
J. exp. Psychol., 1951, 42, 94-100. 

11. Lazarus, R. S., DEESE, J., & Oster, Sonia F. 
The effects of psychological stress upon per- 
formance. Psychol Bull., 1952, 49, 293-317. 

12. Mant, G. F. Anxiety, HC] secretion, and peptic 
ulcer etiology. Psychosom. Med., 1950, 12, 
158-169. 

13. Mrver, D. R. On the interaction of simultaneous 
responses. Psychol. Bull., 1953, 50, 204-220. 

14. RosENBAUM, G. Stimulus generalization as a 
function of level of experimentally induced 
anxiety. J. exp. Psychol., 1953, 46, 35-43. 

15. RosENZWEIG, S., & Sarason, S. An experimental 
study of the triadic hypothesis: reaction to 
frustration, ego-defense, and hypnotizability, I. 
Correlational approach. Charact. & Pers., 1942, 
11, 1-19. 

16. TAYLOR, JANET A. The relationship of anxiety to 
the conditioned eyelid response. J. exp. Psychol., 
1951, 41, 81-93. 

17. TAYLOR, JANET A. A personality scale of manifest 
anxiety. J. abnorm. soc. Psychol., 1953, 48, 
285-290. ` B 

18. TAYLOR, JANET A., & Spence, K. W. The relation- 
ship of anxiety Itvel to performance in serial 
learning. J. exp. Psychol., 1952, 44, 61-64. 


Received July 23, 1953. 


TACTUAL PERCEPTION OF FORM BY THE BRAIN-INJURED! 


ALAN O. ROSS 
Yale University? 


OST of the tests used by the psy- 
M chologist for the diagnosis of brain 
injury employ visual or auditory 
stimuli (8, 9, 17). The neurologist, on the 
other hand, frequently uses tactual stimuli 
| for the examination of patients suspected of 
being brain-injured, particularly when the 
lesion is believed to be in the posterior parietal 
area, where injury is said to result in an in- 
ability to recognize forms tactually (astereog- 
.. nosis) (3, 6, 11). These neurological techniques, 
however, are rarely objective and never quan- 
titative. It is the purpose of this study to de- 
termine whether an objective, quantitative 
tactual test of perception of raised outlines 
of forms can be used to discriminate the per- 
formances of brain-injured from those of 
normal human adults. 

A test was designed on the principle that 
outlines of a form can be presented in varying 
degrees of completeness and that the degree 
of completeness can be expressed in terms of 
the number of tactual stimuli of which the out- 
line is composed and the distance between 
‘these stimuli. Since brain-injured patients 
have been reported to exhibit impaired tac- 
tual perception, especially when the parietal 
lobe is involved, and since brain-injured pa- 
tients are said to have difficulty in responding 
to two simultaneously presented stimuli (2, 5, 
13, 17), two hypotheses concerning their per- 
formance on the tactual test may be advanced. 

Hypothesis I, Patients with brain injury 
require a greater degree of completeness of a 
form outline than normals in order to recognize 
the form tactually. 

Hypothesis II. Patients with brain lesions 
differentiate less well than normals between 
two forms presented simultaneously through 

the tactual modality. 


1 Part of a dissertation presented for the degree of 
Doctor of Philosophy at Yale University. The valuable 
advice and critical comments of Col. Charles S. Gersoni 
and the members of the doctcral committee, Drs. S. B. 
Sarason, H. E. Rosvold, and J. R. Wittenborn, are 
gratefully acknowledged. 

? Now at Brooke Army Hospital, Fort Sam Houston, 
Texas. 


566 


METHOD 


The Tests 


The experiment consisted of two test series. In the 
tactual tests, the stimuli were thumbtack heads out- 
lining letters, numbers, and well-known symbols on 
individual plywood boards, presented in one or two 
open-ended boxes which screened the boards from S’s 
view? 

Identification series. The main test made use of eleven 
forms which S had to identify. Six of the forms were 9 
in. by 9 in., five were 4 in. by 4 in. in their largest di- 
mension. Each form could be presented in four steps of 
decreasing order of difficulty. The first step of each form 
consisted of the outline of the form made up of the 
least number of tacks which could be used without de- 
stroying the essential characteristics of the form. On 
each subsequent step the outline consisted of more tacks 
until, with the fourth step, the form was completely 
outlined by adjoining tacks. The four steps thus dif- 
fered in the number of tactual stimuli and the distance 
between these stimuli. 

Each of the 11 forms of the main test was followed 
by a factual multiple-choice recognition test bearing 
upon the form which it followed. One of these forms 
was a miniature wire replica of the form represented by 
the particular set, the other three were different forms 
which had some aspect, such as a distinct angle, in 
common with the form represented in the set. 

A visual multiple-choice recognition test was pre- 
sented once, at the end of the identification series. This 
test consisted of a card on which the 11 forms used in 
the main test were reproduced in line drawings. The 11 
test forms were randomly scattered among 29 other 
forms from which they were to be chosen. 

Discrimination series. This consisted of ten paired 
figures of which each pair was presented simultane- 
ously. Using both hands, S had to say whether the two 
figures were the same or different. This series consisted 
of letters or numbers outlined with tacks, placed adja- 
cent to one another. Five pairs contained identical 
forms, while the other five contained forms which were 
similar in some respect. 5 


Procedure 


Subjects were examined individually. Each S was 
told that the examination was in the nature of a survey, 


r Details of the procedure as well as illustrations and 
descriptions of the stimulus boards, the boxes, and other 
material are contained in an appendix which has been 
deposited with the American Documentation Institute. 
Order Document No. 4164 from ADI Auxiliary Publi- 
cations Project, Photoduplication Service, Library of 
Congress, Washington 25, D. C., remitting in advance 
$2.00 for 35 mm. microfilm or $3.75 for photocopies. 
Make checks payable to Chief, Photoduplication Serv- 
ice, Library of Congress. 


TACTUAL PERCEPTION Or Form Bv THE BRAIN-INJURED 4 567 


assurance being given that the results would not in any 
way bear upon his status as a patient or as a soldier.* 
He was then asked to indicate his dominant hand and 
to place this hand in one of the screening boxes. A brief 
tactual screening test was given in which S had to 
identify by touch three out of four items of everyday 
use. 

Identification series, main test. For this test the in- 
structions stressed the fact that each of the boards had 
a figure or a form on it, that these forms had a meaning 
which could be described, and that S was expected to 
tell what the figure was and to reproduce it by drawing. 
The S was assured that the form was always a single 
object which could be recognized if seen. Two sample 
boards were shown visually to acquaint S with the con- 
struction of the forms. A time limit of one minute was 
set for each board and S was encouraged to guess if 
necessary. 

The first step of a form was placed in the box. As 
soon as S gave a response, or at the end of one minute, 
the board was withdrawn and S was asked to draw 
what he thought the form looked like, If S identified the 
form correctly, either by naming or by drawing, or 
both, the /aciual multiple-choice board was placed in the 
box and S was asked to examine the four figures and to 
point to the one which had the same shape as the main 
test form just completed. This was followed by the first 
step of the next form in the series. 

An incorrect response or a failure to respond to a 
board on the main test was followed by the next step 
in the set of four boards until either a correct response 
was obtained or the time limit for the fourth step had 
expired, and S had thus failed to recognize the particu- 
lar form on any of its four steps. In this case the next 
form in the series was presented, starting with the 
first step. 

After completion of the last form of the main test, 
the visual multiple-choice recognition test was presented 
and 5 was instructed to point to all the forms he remem- 
bered having worked on. A time limit of half a minute 
was set for this test. This completed the identification 
series, 

Discrimination series. For this series S was instructed 
to place one hand in each box and to state whether the 
two forms on the boards he was to feel were the same or 
different. Time limit for each pair was half a minute. If 
55 delayed his response, he was encouraged after 23 sec- 
onds; after a response was obtained or the time limit 
had expired, the boards were replaced by the next pair 
in the series. 

\ After completion of the discrimination series, S was 
given the Bender-Gestalt Test (1, 12) as part of another 
study and the CVS Individual Intelligence Scale (10). 
The CVS contains the Comprehension and Similarities 
subtests of the Wechsler-Bellevue scale (20) and a short 
vocabulary test, based on the Stanford-Binet word list 
(16, 18). The entire experimental procedure never 
lasted longer than 1 hour and 45 minutes. 


Scoring E 


The main test of the identification series was scored 
by assigning a value of 4 to a correct response on the 


*The verbatim instructions are contained in the 
E (see footnote 3). 
Or criteria see section on scoring and aj dix 
(see footnote 3). as pam 


first step of a form; a value of 3 to a correct response 
on the second step; 2 for the third; and 1 for the fourth. 
Failure to give a correct response on all steps of a form 
earned the score of zero.5 Since one form was used as an 
introduction and was not scored, the maximum total 
score for the main test of the identification series was 
40, the minimum score zero. This score will be referred 
to as the main score in the following discussion. 

On the tactual multiple-choice recognition test, a. 
score of 1 was given for each correct choice, leading to a 
possible maximum score of 10. The visual multiple- 
choice recognition test was scored by giving 1 point for 
each design correctly recalled. The maximum score for 
this test was 11. Each of these subtests was scored and 
analyzed separately and no over-all total score was 
calculated for the series. 

The discrimination series was scored by crediting 1 
point for each correct discrimination, for a total possible 
score of 10. 


Subjects 

The Ss were patients from the neurosurgical, neuro- 
logical, and rehabilitation services of three Army hos- 
pitals. The test was administered to every patient 
whose major diagnosis was craniectomy, regardless of 
cause, and who was present during the week in which 
the hospital was visited. A total of 31 patients were 
examined. Seven of these had to be discarded before 
analysis of the data, leaving a sample of 24.7 Discards 
were necessitated by the diffuse nature of some of the 
lesions which made localization impossible, by the pa- 
tients’ emotional state interfering with testing, or by 
the fact that diagnoses on which the initial selection had 
been based had implied surgical procedures which, upon 
closer study of the medical history, were revealed as not 
having taken place. 

Detailed abstracts of the medical and operative his- 
tories of the patients were obtained and, wherever pos- 
sible, the Army General Classification Test score (19) 
or the Armed Forces Qualification Test score was se- 
cured from personnel records. The site of the lesion was 
ascertained by close scrutiny of operative notes and 
from skull X rays, encephalograms, and surgeons! state- 
ments. 

The normals were obtained from the service person- 
nel stationed at an Army hospital. In order to find a 
match for each patient, controlling for intelligence, age, 
years of Army service, and years and geographic region 
of education, 42 normals were selected from the avail- 
able population by reference to a personnel roster. 
These Ss were screened for histories of trauma or dis- 
ease which might have affected their brain structure. 
Five cases had to be eliminated because some residual 
brain damage was suspected, leaving 37 normals who 
were tested. 

From this pool of normals, a match for each of the 
24 patients was drawn. The intelligence measure used 
for the matching was the CVS Individual Intelligence 
Scale (10). The data pertaining to the matching varia- 

— 


$ Methods and criteria used in scoring are described 
in the appendix (see footnote 3). 

1 Detailed information on the 24 patients and on the 
discarded cases is contained in appendix Tables 1 & 2 
(see footnote 3). 

8 See check list in the appendix (see footnote 3). 


568 
TABLE 1 
MATCHING VARIABLES 
(N = 24 pairs; variables matched for same 
pairs throughout) 
Variable "Meansied s Fs ow 
Intelligence . 
Normals 29.58 |6.20) 
Patients 29.42 |6.25| -78 |1-02) .99 
Age (years) 
Normals 24.5 4.5 
Patients 23.9 Jas | 79 1-12] .75 
Education (years) 
Normals 10.9 11.9 
Patients 11.0 |2.5 | 24 [1.68 -74 
Service (years) 
Normals MOEN PSE 
Patients 3.32.2 et TE 


Note.—None of the differences is significant. All of the cor- 
relations are significant below the .01 level. 


bles are summarized in Table 1. The mean difference ¢ 
test and the F test for homogeneity of variance indicate 
that the two groups were probably drawn from the same 
population. The high product-moment correlations 
show that the individuals within the groups were 
closely matched. 

Civilian occupations of normals and patients were 
similar in distribution,’ so that a possible bias because 
of occupational experience with tactual tasks could be 
ruled out. All Ss passed the tactual screening test, thus 
eliminating peripheral anesthesia as explanatory of the 
findings. 

RESULTS 


Identification series, main test. According to 
the principle on which the test was designed, 
the main score is an expression of the degree of 
completeness required by an S in order to 
recognize a tactually presented form outline. 
Should the patient group score significantly 
lower than the normal group, Hypothesis I, 
that patients with brain injury require a 
greater degree of completeness of a form out- 
line than normals in order to recognize the 
form tactually, would be confirmed. The 
main-score mean for the patients is signifi- 
cantly below the mean for the normals with a 
p below .01 (Table 2), supporting Hypothesis 
I 

Both verbal and graphic responses were 
credited as correct in order to avoid a handicap 
to the patients who might have had verbal 


9 See appendix Table 3 (see footnote 3). 

3 Since the hypotheses predict differences in one 
direction, one tail of the distribution of £ was used in 
all comparisons deriving directly from the hypotheses. 
All other tests of significance are based on two tails of 
the distribution. 


ALAN O. Ross 


difficulties of an aphasic nature. The number of 
Ss in each group who drew the figures before 
naming them should indicate the extent of 
such a handicap. A chi-square analysis shows 
that there is no significant relationship be- 
tween this manner of responding and member- 
ship in the patient group (chi square — .82, 
df — 2). 

'The lower main score of the patients might 
be the result of their slowness in responding. A 
comparison of the patients' mean response time 
per board with that of their respective controls 
indicates that there is no significant difference 
between the groups (Table 2). A further com- 
parison, relevant to the question of slowness, 
deals with the number of times Ss had boards 
withdrawn from them because of expiration of 
the.time limit. A chi square of .003, df 2, 
indicates that there is no association between 
brain injury and slow performance. The time 
factor was apparently not responsible for the 
poorer performance of the patients. 

Patients had significantly lower scores than 
normals when only the first five forms of the 
main test of the identification series are con- 
sidered (Table 2). Since this includes no more 
than 20 minutes of testing time, the fatigue 
factor was apparently not responsible for the 
lower score of the patients on the test as a 
whole. 

The correlation between the main scores 
and the CVS scores, which represent a post- 
injury intelligence measure for the patients, 
is .53 (p < .01) for the normals and .26 (p 
> .10) for the patients. The two correlations 
do not differ significantly (z = 1.06, p = .29). 


TABLE 2 
Summary OF Scores FOR IDENTIFICATION SERIES 


Mean 
Score 1 
Normals | Patients 
N = 24)(N = 24) 
Main 25.1 | 20.1 | 3.74** 
First 5 forms (main) 16.1] 13.9 | 2,63** 
Mean time per board 41.8) 43.1 .15 
Tactual multiple choice 8.6 7.9 | 1.50 
Visual recognition 
Number right 5.4 4.0 | 3.60** 
Number pointed to 7.1 5,5:1:5,39** 
Number wrong 2:3. People ego. 
Accuracy ratio 172.9 | 173. .02 


Note.—Tests of significance based on one tail of distribution 
for first three comparisons; on two tails for tactual multiple-choice 
and visual recognition data. 

** Significant at the .01 level. 


{ 


$ 
J 


TACTUAL PERCEPTION OF FORM BY THE BRAIN-INJURED 


Tactual multiple-choice recognition test. The 
score on this test is a measure of immediate 
memory. The comparison reveals no significant 
difference between patients and normals 
(Table 2). 

Visual multiple-choice recognition test. Pre- 
sented at the end of the identification series, 
this test is a further measure of memory. 
Patients chose significantly fewer correct 
figures than normals (Table 2). The number of 
correct figures chosen is subject to the effect 
of several variables. The greater the total 
number of figures an S points to within the 
half minute available, the greater will be the 
probability of his including figures which are 
scored as correct. Normals pointed to sig- 
nificantly more figures than did the patients 
(Table 2). When the number of correct figures 
is weighted by the total number of figures 
pointed to, the resulting accuracy ratio fails 
to reveal a difference between patients and 
normals (Table 2). It appears that the greater 
number of correct figures chosen by the nor- 
mals is a result of their pointing to more 
figures within the time available. It may be 
possible to correct for this in the future by 
requiring a limited number of choices. Normals 
also pointed to more wrong figures than pa- 
tients, but this difference is not significant 
(Table 2). 

Occasionally an S failed to recognize a form 
on all four steps of the identification series’ 
main test. Patients tend to have more such 
failures than normals, as indicated by a chi 
Square which approaches significance (chi 
Square = 9.37, df = 4, p > .05). This raises 
the possibility that patients were not able to 
make as many correct choices as normals be- 
cause they were familiar with fewer forms 
which they might recognize. Having failed to 
Tecognize the letter R tactually, for example, 
they were unable to recognize the R on the 
visual multiple-choice test. Accordingly, per- 
formance on the visual multiple-choice test 
Was rescored considering only those forms 
which an S had recognized on either the main 

test or the tactual multiple-choice test of the 

identification series. The percentage scores so 
obtained indicate that normals performed 
Significantly better than patients (p < .01 
on Wilcoxon’s paired replicates test [21]). 

Discrimination series, main test. This test 
Was designed as a test of Hypothesis II, re- 


b 


569 


garding the differentiation between two simul- 
taneously presented forms. Four patients were 
unable to perform on this test because they 
had only one functioning hand. The remaining 
20 patients had a mean score of 6.8, compared 
to a mean score of 7.6 for the matched normals. 
The difference between the two means is 
significant at the .05 level with a ¢ of 2.10, 
supporting Hypothesis II. Further analysis 
indicated that Hypothesis II is related to Hy- 
pothesis I. The discrimination scores corre- 
late with the identification main scores with an 
r of .47 (p < .05) for the 24 normals and with 
anr of .48 (p < .05) for the 20 organics. It is 
possible to evaluate the difference between the 
groups on the discrimination series, holding 
tactual identification ability, as measured by 
the main score, constant. A ratio which meas- 
ures performance on the discrimination series, 
independent of performance on the main test 
of the identification series, was obtained by 
dividing an S’s standard score for discrimina- 
tion by his standard main score on identifica- 
tion. The mean difference for this ratio, pa- 
tients versus normals, is not significant with a 
t of .25. 

The parietal area is said to be critical for 
tactual recognition of form (6, 11). An attempt 
was, therefore, made to determine whether 
patients with parietal lesions required a more 
complete outline of forms than patients with 
lesions in other areas in order to recognize 
the form tactually. The patient group con- 
tained only three cases with lesions restricted 
to the parietal area, but a group of 10 patients 
could be formed when all other patients where 
the parietal area was involved in the injury, 
but not exclusively, were added to the purely 
parietal cases. Only four of these cases had an 
injury in the dominant hemisphere and used 
the dominant hand. The mean main score of 
this heterogeneous group of 10, who had some 
form of parietal involvement, is significantly 
lower than the mean main score of the re- 
maining 14 patients where the parietal area 
was not involved in the injury (t = 1.80, 
df = 22, p < .05). The fact that the patients 
in the parietal group had injuries of various 
sizes implicating other areas to varying de- 
grees makes it impossible to attach much 
weight to this result. e 

Tf tactual perception is considered as primar- 
ily localized in the parietal area, it might be 


570 


suspected that the presence of parietal cases 
in the sample of patients resulted in the low 
score of the patient group on the test on which 
the answer to Hypothesis I was based. In line 
with Hypothesis I, a new comparison of the 
main scores for the identification series was 
made, using only the 14 patients who had no 
. parietal involvement. The mean for this group 
was 22.0, compared to a mean of 25.3 for their 
normal controls. The difference is significant 
at the .05 level (¢ = 2.62), indicating that the 
difference between patients and normals re- 
mains when all cases with parietal involve- 
ment are excluded so that the conclusion 
relevant to Hypothesis I is not changed. 
Differences in test performance are fre- 
quently attributed to differekces in the length 
of time since operation. The mean time since 
operation (not including cranioplasties) was 
7.9 months, with a standard deviation of 3.3; 
the median was 8 months, the range from 2 
to 15 months. To test for a possible dependence 
between time since operation and main score 
on identification series, a chi-square analysis 
was made, dichotomizing the patient group 
at the median of time since operation and at 
the median of identification main scores. A chi 
square of .001, df — 1, indicates that there is 
no significant tendency for long or short time 
Since operation to vary with high or low main 
Scores. When a Pearsonian correlation coef- 
ficient was computed for length of time since 
operation and main score, the nonsignificant r 
of .11 ($ = .70) pointed toa similar conclusion. 
Inasmuch as the patients took the Army 
classification test at the time of their induction 
into the service, the scores they obtained at 
that time represent a measure of their pre- 
morbid performance, while the CVS scores 
measure their performance after injury. The 
Army classification scores are based on one 
of two tests, the Army General Classification 
Test (AGCT) (19, or the Armed Forces 
Qualification Test (AFQT), which has super- 
seded the AGCT in recent years. The two tests 
correlate on the order of .92, and the scores 
are comparable in terms of the performances 
measured. In the following, classification test 
‘scores will be referred to as “AGCT” scores, 
regardless of which of the two tests they were 
based on. v i 
Alternate basis for maiching. By matching 
the groups on the basis of the patients’ post- 


ALAN O. Ross 


TABLE 3 
Gnouss MATCHED FOR PREMORBID INTELLIGENCE 
Mean | 
+ Comparison D 
Nor- | Pa 
mals | tients 
ms | 
“AGCT” score 59.0|59.6| .62 
Age | 24.0 | 22.8 | 1.27 
Education (years) | 11.3. 10.6 | 2.00 
Identification (main) | 25.8 | 20.4 | 3.43** 
Discrimination | 7.5 | 6.8 | 1.81* 


Note.—N = 20 for each group, except for discrimination scores || 
where N = 16 for each group. Tests of significance based on two | 


tails of distribution for first four comparisons, on one tail for 
analysis of identification and discrimination scores. 

* Significant at the .05 level. 

** Significant at the .01 level. 


injury intelligence scores it was shown that the 
lower tactual scores of the patients were in- 
dependent of any general lowering of mental 
ability. In order to investigate differences in 


tactual scores between patients and normals ^ 


when preinjury mental ability is held constant, 
a new matching was accomplished by pairing 
each patient with a normal who had the same 
"AGCT" score. Age and years of education 
were controlled for at the same time. Table 
3 shows that the mean differences for the 
matching variables were not significant. When 


the principal comparisons relevant to Hypo- | 


theses I and II are repeated with preinjury 
intelligence held constant in this manner, the 
differences in identification main scores and 
discrimination scores remain significant at the 
-01 and .05 levels respectively. 


Discussion 


A tactual test was designed on the principle 
that outlines of a form can be presented in 
varying degrees of completeness and that the 
degree of completeness may be expressed in 
terms of the number of tactual stimuli of 
which the outline is composed and the distance 
between these stimuli. Two hypotheses con- 
cerning the performance of the brain-injured 
on this test were advanced. The hypothesis 
that patients with brain injury require a 
greater degree of completeness of a form out- 
line than normals in order to recognize the form 
tactually was confirmed. The hypothesis that 
patients with brain lesions differentiate less 
well than normals between two forms presented 
simultaneously through the tactual modality 
was also confirmed. 


| 
| 
Í 


TACTUAL PERCEPTION OF FORM BY THE BRAIN-INJURED 


With the present form of the test, the 
scores of patients and normals have too great 
an overlap for the technique to be valuable 
for differential diagnosis of brain injury, but 
the study suggests that with greater refine- 
ment and extensive standardization, a test 
of this kind may be made clinically useful. 

The results of the study are relevant to a 
number of theories, opinions, and practices 
current in the field of brain injury. One of 
these is the view that disturbed tactual recog- 
nition of form is associated with a lesion in the 
parietal area of the cortex (3, 6, 11). The data 
obtained in this experiment indicate that func- 
tions as complex as those required in the 
tactual test used can be interfered with by 
lesions in other areas as well. Inferior tactual 
ability can be found in patients who have 
undergone brain operations, grouped without 
regard to the cerebral area affected, although 
there is indication that patients with lesions 
involving the parietal area exhibit a greater 
decrement than patients with lesions ih other 
areas. 

It would appear that the technique em- 
ployed in the present study is sensitive to dis- 
turbances of a general psychological function, 
under which tactual recognition of form can be 
subsumed. If the function of the intact cere- 
brum is viewed as one of integration or syn- 
thesis (4), the present findings might be 
Interpreted as indicating a disturbance of the 
integrative function of the brain-injured sub- 
Jects, manifesting itself in poor test perfor- 
mance. 

The present findings are further relevant 
to the statements by several writers (25-13. 
17) that patients with brain injuries experience 
difficulty when required to differentiate be- 
tween two simultaneously presented stimuli. 
It was shown that normals perform better 
than patients when they had to differentiate 
tactually between two simultaneously presen- 
ted forms. The patients’ deficit on the dis- 
crimination task can, however, be reduced to 
the more basic deficit which operates in 
tactual recognition of form so that simul- 
taneity, as such, does not seem to be of primary 
Importance in reducing the performance lewel 
of brain-injured. 

Impulsiveness is another characteristic oc- 
casionally ascribed to brain-injured (14, 15). 
The data for the visual multiple-choice recog- 


571 


nition test show that normals are significantly 
more productive than patients in terms of the 
number of forms pointed to within the time 
limit, but the greater number of wrong forms 
chosen suggests a lack of precision on the part 
of the normals. While patients point to fewer 
forms, the forms they do point to are more 
often correct than the many forms pointed to 
by the normals, so that the relative accuracy 
of the two groups is the same. The evidence 
thus points against impulsiveness on the part 
of the patients. It has also been suggested that 
brain-injured patients restructure their en- 
vironment on the level of their reduced capaci- 
ties (7). Such a phenomenon may be involved 
in the findings on the visual multiple-choice 
task. Patients 'appear to have reduced the 
quantity of their performance, thus main- 
taining the quality. 

It might be argued that the psychological 
reactions of the patients to the experience of 
brain surgery and hospitalization resulted in 
their lower tactual scores. Several factors 
refute such an explanation. The patients and 
the normals reacted to the testing situation in 
essentially the same manner. The instructions 
aimed at reducing test anxiety seemed to have 
had the desired effect on all but one patient, 
and he was eliminated. Besides this clinical im- 
pression, there is quantitative evidence which 
speaks against the presence of the so-called 
catastrophic reaction in the patients. Anxiety 
might slow down the performance of the pa- 
tients so that they would exceed the time 
limit significantly more often than the normals 
or show a longer mean response time per board. 
This was not the case. The opposite effect, 
that patients were giving rash and indiscrim- 
inate responses in order to escape the situation, 
is disproved by the mean response time data 
and by the data on the visual multiple-choice 
test, where patients showed more accuracy 
and discrimination of choice than normals. 

It might be argued alternatively that the 
tactual scores are an expression of intelligence 
and that they might have been predicted from 
a knowledge of intelligence scores based on 
more traditional tests. Since the differences 
between patients and normals predicted by 
the hypotheses prevailed both when preinjury 
and when postinjury intelligence was held 
constant, it would appear that the tactual 
scores are relatively independent of intelligence 


572 ALAN O. 


scores. It is possible that for diagnostic pur- 
poses a knowledge of both tactual and intel- 
ligence scores would be useful. 


SUMMARY 


A tactual test of perception of raised out- 
lines of forms was designed on the principle 
that such outlines can be presented in varying 
degrees of completeness and that the degree of 
completeness can be expressed in terms of the 
number of tactual stimuli of which the outline 
is composed and the distance between them. 
The power of this test to discriminate the per- 
formances of 24 brain-injured patients from a 
like number of normal adults was examined. 
Patients with brain injury were shown to re- 
quire a greater degree of completeness of form 
outline than normals in order to recognize the 
form tactually. When two stimuli were pre- 
sented simultaneously for tactual discrimina- 
tion, patients performed worse than normals, 
but when tactual recognition of form was held 
constant, the simultaneity factor appeared to 
contribute little to the difference between 
patients and normals. It was suggested that 
the technique might be refined and standard- 
ized to make it useful for differential diagnosis 
of brain injury. 


REFERENCES 


1. BENDER, Lavrerra. Disturbances in visuomotor 
Gestalt function in organic brain disease associ- 
ated with sensory aphasia. Arch. Neurol. Psy- 
chiat., 1933, 30, 514-537. 

2. BENDER, M. B., Suarrro, M. S., & TEuBER, H. L. 
Allesthesia and disturbance of body scheme. 
Arch. Neurol. Psychiat., 1949, 62, 222-236. 

3. Binc, R., & Havwaxen, E. Compendium of re- 
gional diagnosis in lesions of the brain and spinal 
cord. (11th Ed.) St. Louis: Mosby, 1940. 

4, Brickner, R. M. An interpretation of frontal lobe 
function based upon the study of a case of partial 
bilateral frontal lobectomy. Proc. Soc. Res. nerv. 
ment. Dis., 1934, 18, 259-351. 


Ross 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 
19. 


21. 


. DeJonc, R. N. The neurologic examination. New 


. GOLDSTEIN, K. The organism. New York: American 


. WECHSLER, D. The measurement of adult intelli- 


. Campora, G. Astereognosis: its cause and mecha- 


nism. Brain, 1925, 48, 65-71. | 
York: Hoeber, 1950. 


Book Co., 1939. 


. GOLDSTEIN, K. & ScHEERER, M. Abstract and con- 


crete behavior: an experimental study with spe- | 
cial tests. Psychol. Monogr., 1941, 53, No. 2 | 
(Whole No. 239). - | 


. HarsrEAD, W. C. Brain and intelligence, a quanti- 


tative study of the frontal lobes. Chicago: Univer. 
of Chicago Press, 1947. | 

Hunt, W. A., & Frencu, EuzABETH G. The CVS 
Abbreviated Individual Intelligence Scale. J. 
consult. Psychol., 1952, 16, 181-186. 

NrersEN, J. M. Agnosia, apraxia aphasia; their 
value in cerebral localization. (2nd Ed.) New 
York: Hoeber, 1946. | 

Pascat, G. R., & SUTTELL, BARBARA J. The Bender- 
Gestalt Test; quantification and validity for adults. 
New York: Grune & Stratton, 1951. 

PorteUs, S. D., & Kepner, R. D. Mental changes | 
after bilateral prefrontal lobotomy. Genet. Psy- 
chol. Monogr., 1944, 29, 3-115. 

PonrEUS, S. D., & PETERS, H. N. Maze test valida- 
tion and psychosurgery. Genet. Psychol. Monogr., 
1947, 36, 3-86. 

Strauss, A. A., & Werner, H. Disorders of con- 
ceptual thinking in the brain-injured child. J. 
nero. ment. Dis., 1942, 96, 153-172. | 

TERMAN, L. M., & MERRILL, Maun A. Measuring 
intelligence. New York: Houghton Mifflin, 1937. 

TEUBER, H. L. Neuropsychology. In R. E. Harris, 
et al., Recent advances in diagnostic psychological 
testing; a critical summary. Springfield, Ill.: 
Charles C Thomas, 1950. Pp. 30-52. | 

THORNDIKE, R. L. Two screening tests of verbal 
intelligence. J. appl. Psychol., 1942, 26, 128-135. 

Unitep SrATES War DEPARTMENT, ADJUTANT | 
GENERAL’S OFFICE, CLASSIFICATION AND RE- 
PLACEMENT BRANCH, PERSONNEL RESEARCH 
Section. The Army General Classification Test. 
Psychol. Bull., 1945, 42, 760-768. 


gence. (3rd Ed.) Baltimore: Williams & Wilkins, 
1944. 

Witcoxon, F. Some rapid approximate statistical 
procedures. Stamford, Conn.: American Cyana- 
mid Co., 1949. 


Received August 30, 1953. 


THE CONSISTENCY OF INTERPERSONAL BEHAVIOR JUDGMENTS 
MADE ON THE BASIS OF SHORT-TERM INTERACTIONS IN 
THREE-MAN GROUPS! 


ROBERT R. BLAKE, JANE SRYGLEY MOUTON, anp BENJAMIN FRUCHTER 
The University of Texas 


IRECT interpersonal judgments made 
D by strangers on the basis of short- 

term interactions may provide one 
basis for measuring individual personal and 
social characteristics of the kind that cannot 
be easily evaluated through objective tests. 
Is such behavior sufficiently consistent and 
can such judgments be made with sufficient 
reliability to justify their further consideration 
in connection with technical problems such as 
assembling individuals into effective work 
groups? Since previous studies (e.g., Barker 
[1], Bell and French [2], and Haythorn [3]) 
bear on this issue only indirectly, it seems 
desirable to seek a direct answer to these 
questions. 

This paper reports an experiment designed 
to determine the extent to which one technique 
of direct assessment produces reliable judg- 
ments concerning the personal and social 
characteristics in individual behavior. The 
approach to estimating the reliability employed 
in this investigation is somewhat different 
from the usual method. In the more conven- 
tional reliability study conditions in the 
first administration are repeated in the second 
administration, In the present investigation 
the estimate of reliability obtained is depend- 
ent not only on the reliability of the judgments 
but also on the consistency of the behavior 
and similarity of the group structures in the 
two Situations. Since these three sources of 

, Variation are confounded in the design, high 
Consistency may be interpreted as indicating 
little variation in any of the three sources. 
In cases of low agreement, however, it cannot 
be determined which of the sources of variation 
account for it. The values obtained therefore 
can be considered estimates of the amount of 


2 This research was supported in part by the United 
States Air Force under Contract Number AF 18(600)- 
602, monitored. by the 3305th Research and Develop- 
ment Group (Combat Crew Training Research 
Laboratory), Human Resources Research Center, 
Randolph Air Force Base, Texas. Permission is granted 
for reproduction, translation, publication, use, and 
disposal in whole and in part by or for the United 
States Government, 


consistency to be expected over situations 
where raters, tasks, and groups vary. 


EXPERIMENTAL DESIGN 


The general plan called for direct assessment 
and observer evaluation of the behavior of each 
of 33 subjects who interacted as members of 
three-man groups in two sessions. Changes 
between the two group sessions included. 
varying the task on which members worked, 
the composition of the groups, and the identity 
of the observers. These changes were introduced 
by the use of the following experimental plan. 

The general plan shown in Table 1 called 
for eleven interaction groups for each of two 
tasks. Every group consisted of three members 
who were unacquainted with one another 
at the beginning of the two discussion periods, 
with the same individuals participating in 
different groups during both periods. That is 
to say, in the second discussion period the 
membership of the groups was changed so that 
the three who interacted with one another 
during the second period were also unac- 
quainted with one another at the beginning 
of their interaction period. In addition the 
interaction was evaluated by trained observers 
whose schedules were arranged so that no 
observer watched the same person functioning 
in two different sessions. 

The behavior of individuals composing each 
group was evaluated in two ways. At the end 
of both discussion periods the three members 
filled in a 12-item direct-assessment question- 
naire describing their own behavior and 
that of the other two members. The observers 
also filled in the same questionnaire to describe 
the behavior they had observed in the mem- 
bers. The rankings received on these question- 
naires constitute the basic data for assessing 
the consistency of behavior between and 
within the two discussion periods. 

METHOD 
Subjects 


The Ss were 33 college students. Two restrictions 
were imposed in selecting them. The first was that they 


en? 


574 


TABLE 1 
EXPERIMENTAL PLAN USED TO INTRODUCE VARIATIONS 
IN Task, GROUP COMPOSITION, AND IDENTITY OF 
OBSERVERS FOR Two SESSIONS INVOLVING 
THE SAME 33 SUBJECTS 


Fi Group No. 
NER I E IH PEN XI 
1 
Members ABC DEF GHI JKL 
Observer 1 2 3 4 
2 
Members| AEI BFJ CGK DHL etc 
Observer. 4 3 2 1 


should not be acquainted with one another prior to the 
beginning of the study. The second was that they 
should be undergraduate students,ea restriction intro- 
duced in order to insure homogeneity in the sample of 
Ss who were being asked to rate one another. Twenty- 
four of the Ss were men and nine were women. 
Groups were arranged so that men and women worked 
. separately. 


Observers 


The observers (Os) were 11 members of a class in 
social psychology. They were pretrained for making 
these kinds of observations by participating as Ss in 
three previous discussion groups where they filled out 
questionnaires similar to the ones used in this study? 


Group Task and Procedure 


Two tasks were used to produce group discussion. 
The groups were instructed to discuss the assigned 
topics, to come to agreement regarding a solution, and 
to write a brief report concerning the agreement 
teached. The first discussion was based on the question, 
“Should college grades be abolished?” The second 
task required the three Ss to construct one meaningful 
sentence which contained the following three words, 
to, too, and two. The second task was not so satisfactory 
as the first for producing interaction. After 15 minutes 
of interaction on the first task the groups were broken 
up and each individual was given a questionnaire, It 
included 12 items in terms of which each participant 
could rank himself and the two other members. Items 
were selected to permit assessment of personal and 
social characteristics that appeared in the small group 
discussions. In each case S was required to rank himself 
as one of the three members. The items used are shown 
in Table 2. 

On completion of this phase, new three-man groups 
were created so that each individual interacted with 
two individuals different from those who had been 
members of his first group. After a 15-minute discussion 
period on the second task these groups were broken 
up and the same questionnaire was administered to 


? The class members who conducted the sessions and 
served as Os were Richard Austin, Dorothy Babin, 
Robert Bell, John Bowen, Jack Brackeen, Jack Hain, 
Julian Hill, Bette Perry, Glenn Rosson, Alice Smith, 
and Jane Woodbridge. i 


Rosert R. BLAKE, JANE SRYGLEY MOUTON, AND BENJAMIN FRUCHTER 


them. After both sessions, Os filled in the same ques- 
tionnaire to judge the personal and social characteristics 
of the three group members. Each O ranked three 
different people on each of the two occasions, making 
their rankings without knowledge of the rankings 
made by the group members themselves or by other Os. 
Control of the previously mentioned factors pre- 
cludes the possibility of obtaining high consistency 
between rankings due to the introduction of factors 
such as previous acquaintance, recall on the second 
occasion of rankings made of other members on the 
preceding occasion, or continuity in performance based 
on discussion of the same topic. Significant agreement 
between rankings can be interpreted as due to consist- 
ency in the judgments of the behavior of Ss as evaluated 
by the technique employed. Absence of significant 
agreement cannot be interpreted as an inconsistency of 
judgments since the lack of agreement may be due to 
differences in the tasks or structure of the groups. 


Statistical Treatment of Data 


The distributions of rankings were trichotomized 
and chi square was used as the statistical test in making 
the various item comparisons, such as between ad- 
ministrations, between Ss, between Os, and so on, 
and the contingency coefficient (C) was computed as 
the index of the degree of relationship. A limitation 
of the scoring method is that the rankings within 
any one group are not independent of one another 
since ties were not permitted. The contingency coeffi- 
cient was preferred to the Pearson product-moment 
coefficient since it seemed improbable that the distribu- 
tion characteristics needed for applying the latter 
method, i.e., continuous normal distributions, were 
obtained in the data. They do meet the conditions for 
applying the chi-square test, however, with the excep- 
tion of low expected frequencies in some cells. 


RESULTS 
The Consistency of Judgments between Sessions 


The design of the study permits determina- 
tion of the consistency of judgments between 
discussion periods by comparing rankings 
received on the questionnaire during the 
first period with those received during the 
second. It will be recalled that the people Ss 
interacted with on the second occasion were 
different from those on the first, and that no S 
was acquainted with the other two members at 
the beginning of the two interaction periods. 
Furthermore, the topic of discussion was 
different, 

The consistency of judgments between 
sessions can be looked at from different points 
of View including the individual’s evaluation of 
his own behavior (self-self), the evaluation of 
his behavior by two other group members with 
whom he was participating (member-member), 


and the Qs’ evaluations of his behavior 


(observer-observer). In addition the combined 


CONSISTENCY OF INTERPERSONAL BEHAVIOR JUDGMENTS 


575 


TABLE 2 
ITEMS USED FOR THE DIRECT ASSESSMENT OF INDIVIDUAL BEHAVIOR 


Instructions: This questionaire is to be filled out as you saw the members of your group interacting during this 
period. For each item you are to rank the members of the group in order of preference. Use the number of each 
person to identify the members. Include yourself in the ranking. 


" Item 


Abbreviation used 
in text 


1. Rank the group members in terms of who contributed most to the group decision. 


Contribution to 
group decision 
Clerical work 


2. Rank the group members in terms of who did most of the clerical work. 
3. Rank the group members in terms of who most often functioned as the leader. Leadership 
4. Rank the group members in terms of who did most to keep the group “on the ball.” ^ On ball 
5. Rank the group members in terms of who was most agreeable and willing to go along  Agreeable and will- 
with what the others wanted. ing 
6. Rank the group members in terms of who was the most dominant in the group situation. Dominant 
7. Rank the group members in terms of who most wanted to do a good job, Good job 
8. Rank the group members in terms of who was the most cooperative. Cooperative 
9. Rank the group members in terms of who was most satisfied with the group decisions. Satisfied with 
group decision 
10. Rank the group members in terms of who was most frustrated by the group situation. Frustrated 
11. Rank the group members in terms of who showed the most interest. - Interest 
12. Rank the group members in terms of who yielded most to others. Yielded 
TABLE 3 
CONSISTENCY OF JUDGMENTS BETWEEN SESSIONS 
a | Self-Self Member-Member | Observer-Observer Group-Group 
em | Hs 
x MDE Te Cl cs M SG Hikes Ma Bed Dress 2e 
f i i | 
1. Contribution to group | 3.60 | — | .31| 9.57) .05 | .48| 7.09 | .20 | .32| 7.07 | .20 | .42 
decision | | 1 | 
2. Clerical work 10.01 | .05 | .48 | 4.61 | — | .35| 7.64 | .20 | .33 | 12.31 | .02 | .52 
3. Leadership 20.92 | .01 | .62 | 9.33 | .10 | .47 | 10.90 | .05 | .50 | 9.48 | .10 | .47 
4. On ball 10.39 | .05 | .49 | 6.37 | .20 | .40 | 4.36 | — | .34 | 18.38 | .01 | .60 
5. Agreeable and willing 3.05 | — | .29 | 5.57 | — | .38 |. 5.99 | .20 | .39 | 6.12 | .20 | .40 
6. Dominant 13.18 | .02 | .53 | 3.76 | — | .32 | 10.36 | .05 | .49 | 19.41 | .01 | .61 
T. Good job 7.14 | .20 | .32 | 4.531 | — | .35 | 2.18| — | .25) 4.42) — | .34 
8. Cooperative 8.16 | .10 | .45 | 1.80 | — | .23 | 11.99 | .02 | .52 | 3.13 | — | .29 
9. Satisfied with group de- | 10.22 | .05 | .49 | 2.88 | — | .28| 2.73| — | .28| 2.39 p — |26 
cision | 
10. Frustrated 5.25 | — | .37 | 6.35 | .20 | .40 | 3.81 | — | .32| 6.94.20 | .42 
11. Interest 11.45 | .05 | .51 | 9.71 | .05 |..48| 3.81| — |..32| 2.68; — | .27 
12. Yielded 5.04| — |.36| 5.17 | — | .37| 2.18| — |.25| 2.83 | — | .28 


ratings of the three group members (the S’s 
self-judgments and the judgments by the two 
Other members) were used as a basis for 
evaluating the individual's behavior on a 
group basis (group-group). The statistical 
Summaries for these comparisons are shown in 
Table 3. j E 
An examination of Table 3 shows that 
there is some consistency in judgments as 
assessed by certain items on the questionnaire. 
By way of summarizing thefindings reported in 
Table 3, it seems clear that reliable judgments 


about characteristics of individual behavior 
can be made even on the basis of extremely 
short periods of interaction. Items like /eader- 
ship (No. 3), interest (No. 11), and dominance 
(No. 6) were reliably judged in a number of the 
comparisons made. This finding supports the 
view that direct judgments about personal 
and social characteristics can be used as 
measures of certain aspects of behavior under 
conditions of short-term interaction when 
conditions in the first administration are 
different from those in the second administra- 


576 Rospert R. BLAKE, JANE SRYGLEY MOUTON, AND BENJAMIN FRUCHTER 
TABLE 4 
CONSISTENCY OF JUDGMENTS WITHIN EACH OF THE Two SESSIONS 
Observer-Member | Observer-Member | Observer-Group Observer-Group 
teni (1) (2) (1) (2) 
xt $ [^ x $j ¢ od b [^ x P2 EG 
Contribution to group de- | 27.27 | .01 | .67 | 11.48 | .05 | .51 | 29.03 | .01 | .68 | 15.46 | .01 | .57 
cision 
Clerical work 18.54 | .01 | .60 | 14.68 | .01 | .56 | 33.27 | .01 | .71 | 24.44 | .01 | .65 
Leadership 26.72 | .01 | .67 | 14.94 | .01 | .56 | 18.85 | .01 | .60 | 20.93 | .01 | .62 
On ball 18.88 | .01 | .60 | 5.48 | — | .38 | 16.92 | .01 | .51 | 5.96| — | .39 
Agreeable and willing 2.00 | — |.24| 5.50| — | .39 | 10.12 | .05 | .48 | 9.61 | .05 | .48 
Dominant 16.62 | .01 | .58 | 5.13 | — | .37 | 12.05 | .02 | .52 | 14.64 | .01 | .55 
Good job +55 | — | .13 | 8.72 | 10 | .46] 1.79'} — | .23] 4.05 | — | .33 
Cooperative 1.73 | — | .22| 2.48) — | .26| 8.28] .10 | .45 | 6.47 | .20 | .40 
Satisfied with group deci- | 4.20 | — | .34] 2.68| — |.27| 2.70| — | .28| 1.76| — | .23 
sion 
Frustrated 2.35| — |.26| 5.72| — | .38| 5.17 | — |.37 | 2.30| — | .26 
Interest 1.33 | — |..20| 5.33 | — .|..37.| 6.86 | ..20 | .42 | 7.59 | .20 | .43 
Yielded 8.93 | .10 | .46 | 14.40 | .01 | .55 | 2.68| — | .27 | 8.04 | .10 | .44 
Self-Observer (1) | Self-Observer (2) Self-Member (1) Self-Member (2) 
Contribution to group de- | 24.28 | .01 | .65 | 16.68 | .01 | .58 | 10.67 | .05 | .49 | 9.99 | .05 | .48 
cision 
Clerical work 34.10 | .01 | .71 | 24.60 | .01 | .65 | 21.59 | .01 | .63 | 14.14 | .01 | .55 
Leadership 15.61 | .01 | .57 | 24.37 | .01 | .65 | 11.90 | .02 | .52 | 12.88 | .02 | .53 
On ball 23.75 | .01 | .65 | 1.60| — | .22 | 18.46 | .01 | .60 | 2.51 | —.|.27 
Agreeable and willing 9.65 | .05 | .48 | 3.80 | — | .32| 7.02 | .20 | .42 | 15.03 | .01 | .56 
Dominant 7.24 | .20 | .42 | 13.30 | .01 | .54 | 6.44 | .20 | .40 | 7.20 | .20 | .42 
Good job. 4.33 | — |.34| 4.67 | — |.35| 3.19 | — | 30] 3.15 | — | .30 
Cooperative 6.75 | .20 | .41 | 7.12 | .20 | .42 | 4.50| — | .35| 7.75 | .20 | .44 
Satisfied with group deci- 40) — | .11 42) — |.11| 2.39 | — | .26 .97 | — | .17 
sion 
Frustrated 8.61 | .10 | .46 | 3.12| — | .29 | 4.37 | — | .34| 2.96] — | .29 
Interest 5.41 | — | .38| 8.29 | .10 | .45| 1.58] — | .21| 5.84] — | .39 
Yielded 5.99 | .20 | .39 | 4.90 | — |.36| 3.206| — |.30| 6.94 | .20 | .42 


tion. This generalization seems to be true 
regardless of whether judgments are made by 
Os, by group members, or by Ss themselves. 
On the other hand judgments concerning 
other personal and social characteristics such 
as yielded (No. 12), desired to do a good job (No. 
7), and satisfied with the group decision (No. 9) 
were not made reliably on a between-session 
basis. While these data demonstrate the 
feasibility of using direct-assessment tech- 
niques to judge certain personal and social 
characteristics, further experimental work is 
needed to determine what kinds of characteris- 
tics are most amenable to reliable observation. 


The Consistency of Judgments within Sessions 


The second approach concerns the degree of 
agreement between different judges looking at 
the same sample of behavior. One question 


here is concerned with the relationship 
between Os’ rankings and judgments of each 
individual by the other two group members. 
Another relates to rankings given by the 
group as a whole to each member as contrasted 
with the judgments given by the Os. The 
difference between these two comparisons is 
that in the latter case the S’s self-judgment is 
included. The same types of questions can be 
asked by comparing the self-judgments with 
the Os’ judgments and by comparing the 
self-judgments with the judgments given 
by the other two group members. The statisti- 
cal summaries for each of these comparisons 
are presented in Table 4. 

An examination of Table 4 indicates con- 
siderable agreement in the assessment of 
behavior on a within-session basis. This agree- 
ment appears when comparisons are made 


| 


| 


CONSISTENCY OF INTERPERSONAL BEHAVIOR JUDGMENTS 


577 


TABLE 5 
SUMMARY OF LEVELS OF SIGNIFICANCE OF ITEMS BETWEEN AND WITHIN SESSIONS 
Between Sessions Within Sessions 
Tum dS Observers Cus prece 
Observer-| Member-| cor ser| Group- rco tg x Pr 
Observer| Member Group 

Ses- | Ses- | Ses- | Ses- | Ses- | Ses- | Ses- | Ses- 
sion 1 | sion 2| sion 1 | sion 2 | sion 1 | sion 2 | sion 1 | sion 2 

Contribution to group de- II I II II I I I I I I T I 

cision 
Clerical work II IH I I I I I I I I I I 
Leadership I I I II I I I I I I I I 
On ball III II I I I|III I|III I | I | Il 
Agreeable and willing I IH p H |W | mW I/mj;| II I I I 
Dominant I III I I I|II| II TESTI [Sr I I 
Good job III III II HI |I | | I |I |III | III | III | III 
Cooperative I III II HI |II|IHI! II| W/W) II| II | HI 
Satisfied with group deci- | III III I II | OT |M | i | III | III | III | W | III 
sion 

Frustrated III II III II |HI || I |I | III | III | III | III 
Interest III I I HI |HI|II|II, IL|II II | Il| II 
Yielded IH III III IH I I| I|IHI|II| IDL|II,| II 


between judgments given by Os and members, 
between members’ judgments and self- 
judgments, between Os’ judgments and 
self-judgments, and between judgments by 
Os and the group as a whole. 

The tabulation also indicates that consider- 
able agreement obtains between the judgments 
made by independent Os for a single interaction 
Situation. Items that produce consistently 
reliable judgments are those involving contribu- 
tion lo the group decision (No. 1), clerical work 
(No. 2), and leadership (No. 3). On the other 
hand satisfied with group decision (No. 9) and 
frustrated (No. 10) were the least reliably 
judged of any in Table 4. The remaining 
items were judged consistently in some 
comparisons but not in others. 

Of particular interest is the extent of 
agreement between the self-judgments on the 
one hand and those by the Os and by other 
group members on the other. Self-judgments 
are usually not utilized in analytic situations 
of this type, but apparently they agree well 
with the ratings made by others in regard to 
certain aspects of behavior such as contribution 
to the group decision (No. 1), clerical work 
(No. 2), leadership (No. 3). This statement 
Seems to hold for both task situations used 
in the present study. 

In order to provide a comparative analysis 
of the value of items from both a between- 
Session and a within-session point of view a 


summary of the significance levels for the 
different comparisons on an item-by-item 
basis is presented in Table 5. The items are 
grouped into categories on the basis of their 
levels of significance. Category I contains all 
items significant beyond the 5 per cent level. 
Category II includes items significant between 
the 5 per cent and the 20 per cent levels. 
Category III contains the items that fail to 
achieve the 20 per cent level of significance. 
The items in the first category reach an 
acceptable standard for rejecting the null 
hypothesis. The second category contains 
items which, while not demonstrated to be 
significant within the limited sampling of the 
present study, merit further consideration 
because of their possible value for scaling 
behavior of the type under analysis. Items for 
which there is no demonstrated significance 
from one interaction period to another are 
included in the third category. 

An inspection of this table shows, for 
example, that the item dealing with contribu- 
tion to group decision (No. 1) produced reliable 
judgments on a within-session basis and that 
the judgments for this item are also reliably 
assessed on a member-member basis from a 
between-session standpoint. The same kind of 
statement can be made for the leadership 
(No. 3) and dominant (No. 6) items. On the 
other hand, items such as good job (No. 7) and 
satisfied with group decision (No. 9) do not 


578 


seem to be judged consistently on a within- 
session basis, nor do they produce consistent 
judgments from a between-session standpoint. 
A more detailed examination of this table 
will identify the items likely to be useful in 
the evaluation of behavior samples on a 
short-term interaction basis for the types of 
situations used in this study. 


SUMMARY AND CONCLUSIONS 


This paper dealt with the reliability of 
social judgments based on direct assessment 
of Ss engaged in short-term interaction. 
In order to assess the reliability of these kinds 
of judgments an experimental design was 
devised in which the conditions in the two 
administrations were different? 

The general plan called for the direct 
assessment and observer evaluation of the 

_ social and personal characteristics of the same 
33 Ss in two completely independent inter- 
action situations. Changes between the two 
group situations included the task on which 
the members worked, the composition of the 
groups, and the identity of the Os. On comple- 
tion of each task, group members were asked 
to fill ina 12-item, direct-assessment question- 
naire on which they judged their own behavior 
and that of the other two group members on a 
tanking basis. The Os judged the behavior 


Rogert R. BLAKE, JANE SRYGLEY MOUTON, AND BENJAMIN FRUCHTER 


of the group members using the same question- 
naire. The results confirm the proposition 
that reliable judgments of short-term inter- 
action can be made even when the two 
situations are different. Items that permit 
consistent assessment from both the between- 
and within-sessions points of view include 
leadership (No. 3), contribution to group 
decision (No. 1), and dominance (No. 6). Also 
of interest is the finding that Ss’ judgments of 
their own behavior, a type of data frequently 
disregarded, are related significantly to the 
judgments made by Os and other group 
members and therefore merit further investiga- 
tion for possible utility. These results support 
the view that direct-assessment data can serve 
as one basis for evaluating personal and social 
characteristics that cannot be satisfactorily 
measured by other techniques. 


REFERENCES 


1. BARKER, R. G. The social interrelations of strangers 
and acquaintances. Sociomeiry, 1942, 5, 169-179. 

2. Bett, G. B., & Frenc, R. L. Consistency of 
individual leadership position in small groups of 
varying membership. J. abnorm. soc. Psychol., 
1950, 45, 764-767. 

3. HavrRORN, W. The influence of individual members 
on the characteristics of small groups. J. abnorm. 
soc. Psychol., 1953, 48, 275-284. 


Received March 17, 1954. Prior publication. 


CASE REPORT 


A BLIND ANALYSIS OF A CASE OF MULTIPLE PERSONALITY USING 
THE SEMANTIC DIFFERENTIAL 


CHARLES E. OSGOOD ann ZELLA LURIA 
University of Illinois 


been collecting semantic data from pa- 

tients undergoing psychotherapy. This 
has been feasible through the cooperation of 
psychotherapists in various parts of the 
country. When the manuscript of “A Case of 
Multiple Personality” (3) arrived, the editor 
of this journal, without our knowledge, sug- 
gested to Thigpen and Cleckley that it would 
be interesting to have semantic data from 
each of the “personalities” in their patient. 
Thigpen and Cleckley accepted the suggestion 
and administered a form of the differential we 
have been using for this purpose to their pa- 
tient in each of her three ‘personalities. The 
editor also suggested that we might see how 
much these semantic data would allow us to 
infer about the patient without our having 
any knowledge of the case history, protocol, 
or prognosis.! This appeared to be a useful 
and rather intriguing way to estimate the 
validity and sensitivity of this instrument. 
If we could infer descriptions of the three 
personalities which correspond with clinical 
observations, and if we could make reasonably 
accurate interpretations and predictions, we 
would be encouraged to continue our efforts 
to improve the semantic differential as a 
clinical tool. 


F: MORE than a year the writers have 


THE SEMANTIC DIFFERENTIAL 


The semantic differential is a combination 
of association and scaling procedures de- 
signed to give an objective measure of the 
connotative meaning of concepts. The under- 
lying logic (1) can be summarized as follows: 
(@) The process of description or judgment 
can be conceived as the allocation of a con- 


.. 1 We wish to thank Dr. J. McV. Hunt, editor of this 
Journal, for utilizing the opportunity presented by the 
Work of Thigpen and Cleckley to arrange for the data 
in this study, for suggesting the blind analysis, and for 
his general encouragement, We are also very grateful 
to Drs. Thigpen and Cleckley for giving the semantic 
differential to their patient and for their interest in the 
instrument. 


cept to a set of experiential continua defined 
by pairs of polar terms. Thus the connotative 
meaning of a linguistically complex assertion, 
such as “My father has always been a rather 
submissive person," can be at least partially 
represented as 


MY FATHER &tive. : 
MY FATHER soft: 


:i— passive 
——i.— hard 


The greater the strength of association, e.g., 
“extremely submissive, a regular door- 
mat,” the more polarized, toward 1 or 7, the 
allocation (2). Since many scales of judgment 
are highly intercorrelated (e.g., good-bad, fair- 
unfair, honest-dishonest, kind-cruel, and so 
forth all reflect mainly the single “evaluative” 
factor in judgments), a limited number of 
such continua can be used to define a semantic 
space within which the connotative meaning 
of any concept can be specified. This clearly 
indicates some variant of factor analysis as 
the basic methodology in developing such an 
instrument. Two such analyses have been 
completed, both providing evidence for three 
general factors, “evaluation,” “potency,” and 
“activity,” and some unknown number of 
specific factors that are probably denotative 
in nature. 

The form of semantic differential we have 
been using in studying psychotherapy is 
based on this factor analytic work. In the 10 
scales used, it gives approximately equal 
weight to the first three factors isolated. These 
scales and their factor loadings are given in 
Table 1. The 15 concepts used in this form of 
the differential were selected after consulta- 
tion with clinicians and pretesting for their 
differentiating power. Ideally, they should 
sample the major persons and problems in- 
volved in therapy-in-general; we are not 
entirely satisfied with, the present set, how- 
ever, and more work should be done here. The 
concepts used are also shown in Table 1. In 
the test form itself, concepts are rotated 
against scales in such a way that each con- 


579 


580 


cept appears once with each scale, but with a 
maximum interval between successive ap- 
pearances of both. The subject is instructed 
to do his checking rapidly, without struggling 
over particular items, to give his “immediate 
impressions." A 150-item form such as this 
usually takes less than 10 minutes to com- 
plete. 

Reordering the raw data for a single sub- 
ject on a single testing yields a matrix of N 
columns (here, 15 concepts) and 7 rows (here, 
10 scales). The meaning of a particular con- 
cept to the subject, as defined by the opera- 
tions of measurement here, is the profile of 
numbers in its column (or, more efficiently, 

the position in the n-dimensional space de- 
fined by the projection of these numbers onto 
the factors). Difference in meaning for two 
concepts is defined by the distance between 
their positions in this space, as computed by 
the generalized distance formula, D — Ve, 
in which d is the difference in allocation of the 
two concepts on a single scale (2). The more 
similar any two concepts are in connotative 
meaning, the smaller will be the value of D. 
Change in meaning (of the same concept at 
different times during therapy, or in different 


TABLE 1 
Concepts AND Scares USED IN THIS ANALYSIS 
Concepts 
LOVE MENTAL SELF-CON- 
CHILD SICKNESS TROL 
MY DOCTOR MY MOTHER HATRED 
ME PEACE OF MY FATHER 
MY JOB MIND CONFUSION 
FRAUD SEX 
MY SPOUSE 
Scales and Their Factor Loadings 
Scales Evaluation Activity Potency 
valuable-worthless .T9 13 04 
clean-dirty .82 .03 =.05 
tasty-distasteful aT o = .05 
fast-slow .01 70 .00 
active-passive 14 59 04 
hot-cold — .04 46 — .06 
large-small .06 34 .62 
strong-weak .19 .20 .62 
deep-shallow 5 27 .14 .46 
tense-relaxed —.55 37 | —.12 


CHARLES E. OSGOOD AND ZELLA LURIA 


"personalities") can be defined by the same 
operation, except that d here refers to the 
differences in allocation of the same concept 
on the same scale at different testings. The 
mathematical properties of this formula also 
allow us to represent the semantic structure of | 
an individual in a concise form; computation 
of the distance, D, of every concept from 
every other concept yields an N/N matrix 
(here, 15/15) of distances which have the 
property of plotting within a space having 
dimensionality equal to the number of factors. | 
To the extent that the individual subject | 
being studied uses the same three factors 
isolated in our general factor work, his data 
will plot accurately in three dimensions. 


THE SEMANTIC DATA 


At this point we should state exactly what 
information we have about this case. We know © 
that we are dealing with a case of triple 
personality, and these have been labeled for 
us (presumably by the therapists who col- 
lected the semantic data) “Eve White," “Eve 
Black," and “Jane.” We suppose that the 
“White” and “Black” have some connotative 
significance—certainly, as will be seen, the 
quantitative semantic data distinguish sharply 
between them. We also know, of course, that | 
the patient is a woman, presumably partici- 
pating in some kind of therapy; we do not | 
know the stage of therapy or whether or not 
she is hospitalized. We considered it also fair | 
to ask (from J. McV. Hunt) about the follow- 
ing items of sociological status, because they | 
contribute to the meaningful interpretation of 
certain concepts: Concept cmrLp—does this 
woman have a child? Yes, she does. Concept 
SPOUSE—is this woman married? Yes, she is. 
Concepts FATHER and MorHER—are her 
parents alive? The mother is, but Hunt 
doesn’t know about the father. Concept MY | 
jJoB—has this woman had a job outside of 
homekeeping? Yes, she has. This is the sum 
total of our external information about the 
The semantic differential was given to this | 
woman twice while “in” each of her three | 
personalities; a period of about 2 months 


intervened between the two testings. p | 


raw semantic data for each of the three per 
sonalities are given in Tables 2-4. The roma? | 
numerals I and II refer to first and secont | 


| 


BLIND ANALYSIS WITH SEMANTIC DIFFERENTIAL 


testings respectively? We take the space to 
give these raw data in full so that anyone who 
is interested may study them and test any 
particular “hypotheses” about the case he 
wishes. Low values in these tables indicate 
judgments toward the polar term on the left 
and high values judgments toward the polar 
term on the right. The data in these tables 
Show the semantic profiles, or meanings, of 
each concept at each testing and while “in” 
each personality. Since the form given at each 
testing was actually a double form (each item 
repeated once), we were able to estimate the 
reliability of these data. The immediate test- 
retest reliability coefficients for each of the 
testings are as follows: Eve White I, .82; Eve 
White II, .90; Eve Black I, .65; Eve Black 
II, .89; Jane I, .89; Jane II; .94. These coeffi- 
cients indicate (a) a generally satisfactory 
level of reliability, (b) a consistent trend in 
all three personalities toward greater stability 
through time, and (c) that Jane is the most 
consistent or stable personality over short in- 
tervals of time and Eve Black is the least. 

To obtain measures of semantic similarity 
and structure, we computed the matrices of D 
for each concept with every other concept, for 
each personality and testing. With an ordi- 
nary desk calculator and a table of square 
roots, these operations are very simple and 
rapid. In order to conserve space, the six 
matrices of D are not given here. These "dis- 
tances” are based on application of the 
formula given earlier across all 10 scales, For 
Convenience in plotting the models which 
appear as Fig. 1-6, the data for scales con- 
tributing to each of the three factors were 
averaged and new D’s computed. This, in 
effect, forces those data into three dimen- 
sions and, hence, into solid models that have 
no error. The very slight amount of distortion, 
or loss of information, resulting from this 
averaging process and restriction to three 
dimensions can be seen from the following 
Correlations between original (10 scale) and 
“factor” D matrices (3 average scales): Eve 
White I, .91; Eve White II, .93; Eve Black I, 
:96; Eve Black II, -98; Jane I, .86; Jane II, 


:92. In other words, nearly all of the variance 
e 
? These values are actually averages of two check- 
marks, a double form of the semantic differential having 
been used for reliability purposes. In this double form, 
each item appears twice. 


581 


in this woman’s judgments can be accounted 
for in terms of three factors. Figures 1-6, then, 
provide quite accurate representations of the 
ways various concepts are related in each of 
the personalities; the smaller the distance 
between any two concepts the more similar-in 
connotative meaning they are. 


THE THREE PERSONALITIES AND THEIR 
CHANGES THROUGH TIME 


The general assumption we are following is 
that “mental illness” is essentially a dis- 
ordering of meanings or ways of perceiving 
from those characteristic of people judged 
“normal” in our society, and that the process 
of psychotherapy from the patient’s point of 
view is essentially a reordering and changing 
of these meanings. Within the limitations of 
our type of measurement and our sampling of 
concepts, the locations and relations among 
concepts shown in Fig. 1-6 can be thought of 
as pictures of how this woman perceives her- 
self, the significant people about her, and 
certain modes of action—when functioning 
"jn" her several personalities? We assume 
that this woman is receiving some kind of 
treatment through the period covered by our 
two samplings, I and II, and therefore look 
particularly for the types of changes in mean- 
ing that are taking place in the three per- 
sonalities, as well as at the general nature of 
their organization. For purposes of ready 
comparison, all of the models are oriented in 
respect to the concept Mv DOCTOR, which 
stays practically constant in meaning (good, 
strong, and quile active) through both time and 
personalities; spatially, in the figures, good is 
up and bad down, active to the left and passive 
to the right, and strong is away from the 
viewer while weak is near to or toward the 
viewer; the solid ball represents the origin 
of the space, e.g., a hypothetical “meaning- 
less" concept that would result from checking 
all 4's. 


Eve White 


Semantic structures for Eve White I and 
If are shown in Fig. 1 and 2. The most general 
characterization would be that Eve White 
perceives “the world" in an essentially normal 
fashion, is well socialized, but has an unsatis- 


3 The authors wish to thank Professor Jozef Cohen 
for his help in preparing these figures. 


CHARLES E. OSGOOD AND ZELLA LURIA 


| | 

loz 0200-10: ere rove so-o-ze T LO to'z] Zoo: zio- 1lo: rlo: rlo’ ro:0"20: 2074 sanae 
0*20:Z0"Y0*0"Z0'L0'ZS'Te-T0-20'Z0"FO'Z 5" 0'0'Y 0: Z0" Z0-T0"T0-TO-T S20 2072072 daop 
01020: 20-*0-10 to zoze as elo zls S070" 0-207207 2020 to 110° |S 1/0790" 20 2072 fam 
010 T0 10-10-0109 |€-9|0-C 0S0 T0090" 01010 ro0"20720" Z0" |" ToLo TOT ES 
0-100 T1007 Z007Z/€-90-9/c"v| 0 10-0790 (0° ts" 10 toto: 20720200 00760 To zoz Ap 
eslos Tie vs tose tote 10579,0726 ts ototo: z020 zlo: to tots eloo T-Z 1807 
0101/0 100-0 toosas #090 ts z 59/6 0° 1|0°F10" tis slo zozo: 2/0" [0-100 TOT mews 
(0° 10" totis: to'zlo Tj0'S|5-90-z(0-2 00-090 0-160: Tio #020: 20" 4/0" LIS" To TIo TOT E 
0-100700 4/0" to: To zle TS YS 302o zozo 0:2]: 20- 2/0" 410" T0" TJO 110" T(0"90 20: Z0" ojqunieA 
0'10-/0-|0-Y|S*902)0"9/0-9|0-z|e elote zo ols 0:10:10 To 1/07 Z0 20: 20-0 TIO" Tio To poo 

Pabapaeeraeenae: moeurs rirRETERESESESESPSERESESERESESESERES 

NO TR | caver |Syae | ca | avanan amo (CU | enm | aaor 

xovig IAJ OX VIVO AVY 
£ ATAVL 2 
zt Er: ; 

o'so-ss-To- 10-1 zz" to: SY Yo-ZO-e'co-0-T07To-to-To"ve zo'co-vo-ro'ro-to'te-zor 2Anoe 
c-gle-T0 T0 1020 Totoro zote ee zo tozo to te tezo Tto TIe^Z( ZO VS VS To: TO CO TO COT doop 
cce to zoo toto zoze to Tte soto zozo Tto roto toso ss to Tts'soso Tto Tto Tto: TO ZO T Ase} 
&-zlo-vie- 10-10-00 to T/0°9|0°z\$ eloo: To TO: zo zis 907200760" toos slo: toto: Z0" Z[0 TOTO JO" Z geom 
oe e-clo-ols- to 2020 110° teolo sls so-slo To: to 20 2020 zle ojs slo:40-20-9ls*sio: 410° 20" (020072072 Anp 
o'+o'clo:zlo: to zot oslo'zo90 00 +0: 0-zo:To90:40'zlo:'t0:9j0: zos: slo:ols:So:z0:so-zis:zhs -+04 3st 
c-zl0-90-clo:200"200 20 no tieaj ols sloso: to: T: 2,020-Z|66|69/0-Z|€ "9/6 slozio: T0: 2072000 T0907 Z Tews 
eze toto tools te To to eose soto to teoloe ele to: Tto: tozo zo: to: tozo: 4|0" 410° v6 zot asua} 
ese ze sloelo to: To 204o to te elo-eozozo to to to: toto: 1/0 TO- 10-900: TOTO" TOTO TO" a[qenqea 
o-zieclo-so-100S0-9071 o-10:so +e zoso To toss cools to sotto Se co"vo-so"9/0-90" fie slos plos 

LLL eins See om — || —|_—_| — CEES EAEN —| ——— jae | 

{1 n|rj|mnm|r rin|iriniizjnu|rinj|rj|mn|rj|imn|rjizrjiuj|rij|immiriimui|ri 

=e d SEINS 350055 | anvaa Peaks |: DOC beating Re (D os 


582 


ALHM FAQ NOX VIVG MVA 
z ATGVL 


BLIND ANALYsIS WITH SEMANTIC DIFFERENTIAL 


TABLE 4 
Raw Data For JANE 


16.5 
1.0 
7.0 
7:0 
4.0 
7.0 
6.5 
1.0 
1:0 
L5 


i 
0 
6.5 
4.0 
7.0 
7.0) 
1.0 
4.0) 
4.0) 


CON- 


5/7. 
0 
0 
0 
5 
5 
5 


.0/4.5,4.0, 


MY 
FATHER | FUSION 


.0,6.04.0| 
011.52. 
014. 
0 
0 
0 
5 


.0/6.544. 
-0/1.0/2.54.0]1. 
.52.01.011 


HATRED 


1.51 


i 


4.52.0/7.0/7. 
.04.012.0/2 


7 
2 
1 
7 
1 
2 


0 
0 
0 
-0) 
0 
5 
0 


MY SELF- 
SPOUSE | CONTROL| 


.011.0/1.0/1.016.0/7.0]1 


FRAUD 


I 


PEACE 


6.5/1.02.04.0,6.04.0/6.0]1.0/1.5/6 


I 


.5/7.0/7.04.5]1. 


4.5/4.0 


Sar sana 


MENTAL| MY 
SICKNESS] MOTHER | OF MIND 


IL 
4.04.5 


| 


MY JOB 


6.0/4.0) 


-011.011.52.0/1.0/1.0/1.5/2.0/7.0/7.0]1.5/2.0|7.0|7.0/7.0/6. 51.0, 


:0/2.0/1.0/1.0/2.0/2.0/1.0|1.0]1.0/1.0/7.0/7.0]4. 
4.04.57. 


Ij} im 


I 


54.0/5.54.0. 


-0/2 
0 
0 
.07. 
0 
0 
5 
0 


Hu 
6.5 


7 
7 
1 
7. 
7 
1 
1 
1 


H|I 
0 
5 
5 
0| 
. 0| 
0 
0 
0 


6.544.0,6 
1.0/1.0/1.0/4 

07 

.5/7.07 

02 

07 

07 

oji 

01 

oji 


| 


I 
5.0/5 

0 

-0 

.0 
0} 
0 
0 
0 


LOVE | CHILD 


0 
0 
5 
0 
0 
0| 
0| 
5 


1.0/1.0) 


7 
7 
6 
7 
7 
1 
1 
1 


0 
0 
0 
.0) 
«0| 
0 
0 
0 
5 


4.016.04.0 
1 
7 
7 
7 
T 
7 
1 
1 
2 


Ijun 


583 


584 


CHARLES E. OscooD AND ZELLA LURIA 


Fic. 1, Eve Warre I 


factory attitude toward herself. Here the usual 
societal “goods” are seen favorably—wv 
DOCTOR, MY FATHER, LOVE, SELF-CONTROL, 
PEACE OF MIND, and MY MOTHER are all good 
and strong whereas FRAUD, HATRED, and to 
some extent CONFUSION are bad. The chief 
evidence of disturbance in the personality is 
the fact that me (the self concept) is con- 
sidered a little bad, a little passive, and defi- 
nitely weak. Substantiating evidence is the 
weakness of her CHILD, as she sees him (or 
her), and the essential meaninglessness to her 
of my sPOUSE and sEX. Note also the wide 
evaluative separation between LOVE and SEX. 
In the interval between testings I and II me 
and sEX become more, bad and passive and 
simultaneously become almost identical in 
meaning to her—and note that her concep- 
tions of Love (a good, strong thing) and sex 


Fic, 2. Eve Warre II 


(a bad, weak thing like herself) have moved 
still further apart. 


Eve Black 


Semantic structures for Eve Black I and II 
are shown in Fig. 3 and 4. The most general 
characterization here would be that Eve Black 
has achieved a violent kind of adjustment in 
which she perceives herself as literally perfect, 
but, to accomplish this break, her way of per- 
ceiving “the world" becomes completely dis- 
oriented from the norm. The only exceptions to 
this dictum are my DOCTOR and PEACE OF 
MIND, which maintain their good and sirong 
characteristics, the latter, interestingly 
enough, also becoming active on II. But 
Eve Black perceives herself as being g004, 
then she also has to accept HATRED and 
FRAUD as positive values, since (we assume) 
she has strong hatreds and is socially fraudu- 
leht. So we find a tight, but very un-normal, 
favorable cluster of ME, My DOCTOR, PEACE 
OF MIND, HATRED, and FRAUD. What are post 
tive values for most people—cuitp, M* 
SPOUSE, MY JOB, LOVE, and sEx—are com 


| 


BLIND ANALYSIS WITH SEMANTIC DIFFERENTIAL 


Fic. 3. Eve Brack I 


pletely rejected as bad and passive, and all of 
these except cHILD are also weak (this may be 
because CHILD was weak in Eve White and 
much of the change here is a simple “flip-flop” 
of meanings). Note that it is MOTHER in this 
personality that becomes relatively meaning- 
less; FATHER, on the other hand, stays good 
but shifts completely from strong (in Eve 
White) to weak—possible implications of 
these familial identifications will be con- 
Sidered later. Note also that in this per- 
sonality Love and SEX are closely identified, 
both as bad, weak, passive things. 


Jane 


Semantic structures for Jane I and II are 
Shown in Fig. 5 and 6. The general charac- 


Fic. 4. Eve Brack II 


585 


Fic. 5. Jane I 


terization is that Jane displays the most 
“healthy” meaning pattern, in which she ac- 
cepts the usual evaluations of concepts by her 


586 


society yel still maintains a salisfactory evalua- 
tion of herself. MY FATHER, MY MOTHER, MY 
CHILD, and MY pocror—most of the signifi- 
cant persons in her life—are seen as good, 
strong, and active. The major modes of be- 
havior, PEACE OF MIND, LOVE, SELF-CONTROL, 
and Mv JOB, are seen as equally good and 
strong, but somewhat passive—as if these ways 
of behaving and thinking were simply ac- 
cepted without stress. The two socially 
agreed-upon evils, HATRED and FRAUD, are 
put in their proper places. The most signifi- 
cant characteristics of Jane’s meaning system, 
however, are these: The self concept, ME, 
while still not strong (but not weak, either) is 
nearer the good and active directions of the 
semantic space; note also the close identifica- 
tion of ME and MENTAL SICKNESS, which here 
is mot an unfavorable concept to her. Her 
attitude toward her husband, My SPOUSE, is 
for the first time meaningful (unlike Eve 
White) and tending toward the good, strong, 
active directions, like the other significant 
persons (unlike Eve Black). And Love and 
SEX (quite unlike Eve White) are both favor- 
able and quite closely identified. The changes 
from testings I to II are simply such as to 
strengthen the “healthy” pattern evident in 
the first view. ME becomes considerably more 
good and active; my SPOUSE for the first time 
becomes completely identified connotatively 
with MY DOCTOR and Mv FATHER (and loses its 
tie with CONFUSION); and LOVE and sEx be- 
come intimately identified with each other 
and close in meaning to SELF-CONTROL and 
PEACE OF MIND. 

The thumbnail semantic sketches of each 
personality just given make it intuitively 
evident that the semantic differential does 
draw sharp distinctions between the three 
personalities inhabiting one nervous system. 
It is possible to demonstrate these distinctions 
quantitatively by intercorrelating D matrices 
between personalities and over time. If two of 
our models are generally similar in structure, 
such that large and small distances between 
concepts in one are reflected also in the other, 
then the r will be high. Table 5 gives these 
correlations. The first thing to note is that 
the correlation of each personality with itself 
(e.g, testings I and 1T) is regularly much 
higher than the correlation of that personality 
with any other personality (with the single 
exception of Eve White I and Jane I). This 


CHARLES E. Oscoop AND ZELLA LURIA 


TABLE 5 


CORRELATIONS OF D-MATRICES BETWEEN PERSONALI- 
TIES AND OVER TIME 


White | White | Black | Black | Jane | Jane 
I II I II I | 0 
White I — 
White II | .73| — 
BlakI |—.06| — -— 
Black IZ | — |-.02 86| — 
Jane I 113 | — |—.26| — | — 
Jane II m 53| — |—.08 | .92 | — 


is quantitative justification for the statement 
that the semantic differential does differ- 
entiate between the several personalities of 
this woman. Whether it differentiates in a 
valid way is a matter that can be judged only 
by relating our analysis to the detailed case 
history material available elsewhere (3). 
Another important thing to note about . 

these correlations is that Eve White and Jane 
(the two “socialized” personalities) are fairly 
highly correlated whereas the correlations of 
Eve Black with the other two are definitely 
low, even negative. In other words, Eve 
Black is clearly the most deviant and dis- 
ordered personality. Finally, it should be 
noted that these three personalities differ 
somewhat in their stability, as indexed by the 
I/II correlations, Eve White being the least. 
stable and Jane the most. 


CHANGES IN MEANING OF SPECIFIC CONCEPTS 


As noted earlier, the meaning of a specific 
concept is’ operationally defined as its profile 
against the differential (e.g., its position in 
semantic space). Change in meaning between 
testings I and II can be measured directly by 
the D between I and II profiles for the same 
concept (e.g. between the paired columns in 
Tables 2-4). These D values are given under 
“Within Personalities, Between Testings" in 
Table 6. Changes in meaning between per- | 
sonalities for the same concepts can be meas- | 


— M 


ured directly by the D between profiles for 
the same concept but as judged in two differ- 
ent personalities; these D values are given | 
under “Between Personalities, Within Test- | 
ings” in Table 6. | 


Semantic Stability Through Time (| 


In general, although the differences are not |. 
great, Eve Black is the least stable personality 


BLIND ANALYSIS WITH SEMANTIC DIFFERENTIAL 


TABLE 6 
CHANGES IN MEANING OF SPECIFIC CONCEPTS 
7 
Within Person- | Between p, ities, 
alites, Hetween| Pe TM Tastings, 
Velum SE ME iod Dy. Dy.; Ds., 
s as pee 
ü E I|H|I|IH|I|I! 
LOVE 42 | .67 | .23 |1.58/1.44/.35| .57/1.62/1.81 
CHILD -71 | .96 | .37 |1.65[1.40].68| .54/1.47/1.41 
MY DOCTOR +15] .05 | .27 | .23| .23).28) .25| .30| .12 
ME «36 | .32 | .42 11.21/1.40].60| „88| .83| .77 
MY JOB -27 | .62 | .42 |1.30]1.43].49| .43/1.19/1.54 
MENTAL SICK- | .45 | .45 | .19 |1.24/1.38].40| .32/1.30|1.47 
NESS 
MY MOTHER «54 | .78 | .46 | .71| .78|.66| .23/1.02| .68 
PEACE OF MIND | .40 | .35 | .41 | .86| .66].21| .28| .81| .61 
FRAUD -64 | .12 | .40 |1.46/1.35|. 73| .34/1.29]1.22 
MY SPOUSE +30 | .62 | .47 | .67| .96].61] .89]1.04]1.75 
SELF-CONTROL | .32| .25 | .24 | .78| .80].34| .39| .92!1.01 
HATRED «51 49 | 144 |1.54|1.31|.51| .23/1.37/1.19 
MY FATHER -53 | .71 | .09 |1.06| .60|.25| .43/1.06| .43 
CONFUSION «64 | .67 | .44 | .86| .96].71| .42| .98| .88 
SEX +62 | 34 | .47 [1.10] .62).63]1.20]1.45|1.76 


Note.—Numbers in italics indicate concepts that serve best to 
characterize differences between Eve White and Jane. 


through time and Jane the most stable (cf., 
columns 2-4 in Table 6). The concept-by- 
Concept data thus confirm the stability of 
Jane as shown in the structural data given 
above. For Eve White the most unstable or 
labile concepts are CHILD, MY MOTHER, FRAUD, 
HATRED, MY FATHER, CONFUSION, and sEX. For 
Eve Black the most labile concepts are LOVE, 
CHILD, MY JOB, MY MOTHER, MY SPOUSE, MY 
FATHER, and CONFUSION. For Jane the most 
unstable notions are ME, MY JOB, MY MOTHER, 
MY SPOUSE, HATRED, CONFUSION, and sEX. We 
note that the family constellation—mother, 
father, spouse, child—tend to be more sus- 
ceptible to change through time within these 
personalities, but that the self concept stays 
relatively constant within personalities (e.g., 
the location of the self concept, in a sense, 
defines these three personalities). HATRED, 
SEX, and CONFUSION also seem to be points of 
Stress, 


Semantic Stability Between Personalities 


The data given in columns 5-10 in Table 6 
make it clear that concept meanings in general 
Shift more between Eve Black and the other 
two than between Jane and Eve White, this 
again substantiating the over-all correlations 
between total structures. The only concept 

t remains strictly constant in meaning 


587 


through the personality changes in this woman 
is wv DOCTOR, although PEACE OF MIND and 
CONFUSION show greater stability than most 
others. It is interesting to note which con- 
cepts serve best to characterize the differences 
between Eve White and Jane (Eve Black . 
shows gross differences on almost all con- 
cepts). These two “socialized” personalities 
differ from one another chiefly on mE, my 
SPOUSE, and SEX, and these differences are 
increasing in magnitude through time. This 
clearly suggests this woman’s sexual life as a 
core problem, Eve White being highly critical ' 
of all three concepts and Jane accepting them 
as positive values. It is also interesting to note 
in this connection that semantic differences 
between Eve White and Jane on cum, my 
MOTHER, FRAUD, HATRED, and CONFUSION are 
decreasing through time. 


INTERPRETATIONS OF THESE SEMANTIC DATA 


The analyses of these personalities and their 
changes given so far have been descriptive 
rather than interpretive for the most part. In 
a sense, we have merely put into words what 
this woman herself, in her several personali- 
ties, has indicated by her check marks, The 
treatment of semantic differential data, from 
the patterns of check marks to construction 
of the models shown in Fig. 1-6, is completely 
objective, and any investigator starting from 
the same data and following the rules would 
have to end up with the same pictures we 
have. 

Making interpretations and predictions 
about this case on a “blind” basis is another 
matter entirely. In this section we go far 
beyond the objective data, and we are conse- 
quently much less confident about our state- 
ments. For one thing, neither of the writers is 
an experienced clinician—certainly not ex- 
perienced with respect to the dynamics and 
characteristics of multiple personality. For 
another thing, we do not know at what stage 
in therapy our two testings were made, and 
interpretation would certainly vary greatly in 
terms of such information. It should also be 
pointed out that in the ordinary use of the 
semantic differential as a clinical tool (as . 
compared with a blind analysis) many other 
sources of information ‘would be available to 
support certain alternative interpretations 
and render others farcical. Let it be under- 
stood, then, that what follows is a flight into 


588 


conjecture, in contrast with the preceding, 
factual reporting of semantic data. 


Interpretive Descriptions of the Three Per- 
sonalities 

Eve White is the woman who is simultane- 
ously most in contact with social reality and 
under the greatest emotional stress. She is 
aware of both the demands of society and her 
own inadequacies in meeting them. She sees 
herself as a passive weakling and is also con- 
sciously aware of the discord in her sexual 
life, drawing increasingly sharp distinctions 
between LOVE as an idealized notion and SEX 
as a crude reality. She maintains the greatest 
diversity among the meanings of various 
concepts. She is concerned and ambivalent 
about her currp, but apparently is not aware 
of her own ambivalent attitudes toward her 
MoTHER—and seems to become more resistant 
to this by testing II. Those psychoanalytically 
inclined may wish to identify Eve White with 
dominance of the superego: certainly, the 
superego seems to view the world through the 
eyes of Eve White, accepting the mores or 
values of others (particularly her mother) but 
continuously criticizing and punishing her- 
self, If this case came to the psychotherapists 
with a voluntary, self-initiated plea for help, 
then it seems likely that Eve White was 
dominant at the time. 

Eve Black is clearly the most out of contact 
with social reality and simultaneously the 
most self-assured. To rhapsodize, Eve Black 
finds PEACE OF MIND through close identifica- 
tion with a God-like therapist (my DOCTOR, 
probably a father symbol for her), accepting 
her HATRED and FRAUD as perfectly legitimate 
aspects of the God-like role. Naturally, she 
sees herself as a dominant, active wonder- 
woman and is in no way self-critical. She is 
probably unaware of her family situation. 
Those psychoanalytically inclined could say 

- that the id looks out at the world through the 

eyes of Eve Black. Like a completely selfish 
infant, this personality is entirely oriented 
around the assumption of its own perfection. 
Actually, Eve Black seems to be more har- 
monious with the Adlerian than with the 
Freudian model, since personal perfection is 
apparently the demand acceded to rather 
than sexuality. If the case was committed to 
an institution, it seems likely that this per- 
sonality was the reason for commitment. 


CHARLES E. Oscoop AND ZELLA LURIA 


Jane is the most puzzling of the three per- — 
sonalities, and our interpretation will have to | 
depend upon assumptions about the stage of 
treatment (see below). Superficially, Jane is 
a very healthy personality: “all’s well with the 
world, and day by day I’m getting better and 
better.” Thus we find all the people in her 
life perceived as good and strong and active 
and all the socially approved modes of action 
perceived as good and strong and passive; 
SEX is LovE-ly, her sPouUSE is becoming more 
like the noble pocror all the time, and she is 
coming to perceive herself even, as a pleasant ' 
and reasonably active (if somewhat weak and 
submissive) person. But all this is a little too 
rosy, a little too pat. We note that Jane is 
becoming more and more “simple-minded”— 
all of her judgments tending to fall along a 
single factor of good-strong vs. bad-weak— . 
which makes the Jane II model the most 
restricted and undiversified of all. Those 
psychoanalytically inclined may wish to view 
this personality as representing dominance of 
a self-deceptive ego which has woven a web of 
repression as to the state of reality; or, they | 
may wish to view Jane as an essentially 
strong, healthy, and improving ego-dominated 
personality. In any case, we doubt if Jane 
would have either come for therapy or have 
been institutionalized—as such. 


Possible Dynamisms Operating in the Case 


Identification mechanisms. We say the pa 
tient “identifies” with some other person when 
her meaning of herself, mE, is semantically 
close to her meaning of the other person; ei! 
if she sees her father as a kind, active, relaxed, 
etc. person and describes herself in the same 
terms, we infer identification. However, the 
pattern of identifications displayed by this 
patient seems unusual. Only in Eve Black, the 
obviously disoriented personality in terms © 
her values, is there clear differential identifi- 
cation—with her FATHER (and this may 1 
flect the semantic tie-up between FATHER an” 
MY DOCTOR). Jane shows some slight tend- 
ency toward closer identification with MOTHER, 
but it is not close. Eve White shows nOU*. 
With either parent. The fact that identificar 
tion with FATHER (and Mv pocror) in Ev? 
Black is accompanied by rejection of MOTS% || 
to meaninglessness is suggestive of an un er 
lying conflict in identifications. Note also, » 
this connection, that in Eve Black I S 


BLIND ANALYSIS WITH SEMANTIC DIFFERENTIAL 


ascendancy of ME to the good, strong, active 
position is accompanied by making FATHER 
weak—as if she were taking over her father’s 
role and putting her mother in her own previ- 
ous place. And, interestingly enough, the 
concept SELF-CONTROL suffers the same fate 
as MOTHER. This picture of Eve Black is 
certainly suggestive of an Electra complex as 
the underlying dynamism. In “real” life, her 
MOTHER is or was the dominant, threatening 
figure—moralizing, demanding standards and 
SELF-CONTROL—and in Eve Black this woman 
escapes the pressure by rendering both 
MOTHER and SELF-CONTROL meaningless and 
simultaneously identifying with and taking 
her rATHER'S place (via the therapist). Sug- 
gestive evidence may be found in Tables 2-4: 
MOTHER is consistently colder than FATHER and 
usually more tense and fast (e.g., Factor 3). 
Identification of the self with the therapist 
in Eve Black is perfect, of course. The con- 
cept MY DOCTOR is the only personal concept 
to show perfect stability both between per- 
sonalities and through time. The patient thus 
displays what might be called maximal positive 
transference in all three personalities; there is 
no sign of any negative transference at either 
testing, which may be indicative of the stage 
of therapy (e.g., early). 

Significance of the patiens sexual life. 
Although Jane shows a rosy acceptance of 
normal sexual patterning—with Love and 
SEX linked and passively favorable, Eve 
White clearly displays awareness of a basic 
Conflict in this area—sEx is early somewhat 
more distasteful than rove and becomes 
distinctly distasteful and dirty by testing II. 
In Eve White also we find ME and MY SPOUSE 
becoming linked with sex in this unpleasant 
location. Eve Black, on the other hand, re- 
Jects both sex and rovE—but closely links 
them in her thinking. If we were to relate 
these facts with the Electra situation de- 
Scribed above, the interpretation would be 
that her persisting conflict with her mother 
and attempts to identify with her father make 
it impossible for her to experience normal 
Sexual satisfactions with her husband and to 
Carry out the normal mother-wife-home role. 
Eve White is aware of this, in a sense, but 
Jane clearly is not. The concept MY JoB is 
Interesting in this connection: its persistent 

ages with LOVE, PEACE OF MIND, and 
SELF-CONTROL in the two “socialized” per- 
Sonalities, coupled with its linkage with sex 


589 


and MY SPOUSE in Eve Black, clearly suggests 
to us that this woman is interpreting My JOB 
in the sense of “my job asa mother, wife, and 
homebuilder” rather than in terms of her 
outside work (which we understand she has). 
In any case, there is clear evidence of involve- 
ment with her sex life as a major problem, 
and this may have been the presenting prob- 
lem when she began therapy. 

Repression and amnesia. Knowing that we 
are dealing with a case of multiple person- 
ality—usually characterized by complete 
dissociation between states—it is interesting 
to speculate on what meanings are repressed 
in the several personalities. It will be recalled 
that, operationa]ly, meaninglessness of a con- 
cept is defined by its closeness to the origin 
(the solid balls in the models). This is prob- 
ably to be interpreted as “connotative dead- 
ness" or “damping of affect” with respect to 
the concept involved. Within the matrix of 
our instrument, however, there is another 
way in which repression or amnesia may 
show up, and that is via a complete shift in 
meaning of the concept being judged (e.g., 
CHILD may shift from the personal reference 
of “my own youngster” to “children-in- 
general”). Looking back at the semantic data 
with these points in mind, we hazard the 
following guesses: Eve White probably has 
the best contact with reality and may not be 
amnesic at all (except for the other person- 
alities); Eve Black may be amnesic for her 
mother and her own role as a mother and 
wife; Jane in Pollyanna fashion may be 
amnesic for her own problems, e.g., MENTAL 
SICKNESS and CONFUSION, and the indis- 
criminate way in which she lumps all socially 
favorable concepts at least suggests that she 
is judging CHILD, MOTHER, etc. in the abstract 
rather than as MY CHILD, MY MOTHER, and so 
forth. 


Interpretation I—Assumed Early Stage of 

Therapy 

The “original” personality, in the sense of 
being most characteristic of the woman her 
friends and relations knew, was Jane. The first 
testing of this personality shows a relatively 
weak ME that is associated with MENTAL 
SICKNESS; i.e., she wa$ dimly aware of her 
own inadequacies but was striving to main- 
tain a rigid acceptance of the real world and 
maintain an adequate home life. The people 
about her, with the exception of her husband, 


590 


were seen as strong and active (perhaps 
threatening) in relation to herself, and her 
love life was regarded as a sort of deliberate, 
controlled duty. She was completely un- 
aware of her (repressed) emotional ambiva- 
lence toward her mother, husband, and child. 
The things being hidden in this personality, 
and providing the force behind the eventual 
split, were (a) her Electra complex, (b) her 
repugnance for sexual relations with her 
husband, and (c) her ambivalent attitude 
toward herself. We suspect that she had a 

position in society that demanded ‘good 
front." 

We must assume strong and about equal 
pressures toward solving the Electra complex, 
(a) by identifying with FATHER and asserting 
the self (id?), and (b) toward solving it by 
identifying with MorHER.and devaluing her- 
self (superego?). This produces a two-way 
split away from the Jane pattern, one into 
Eve Black where selfish needs for superiority 
and playing the father role are achieved and 
another into Eve White where societal needs 
for submission and playing the mother role 
are achieved. This split, and the subsequent 
availability of the other roles, allows Jane to 
shift toward the “sweetness and light” view 
of the world, and this is clearly demonstrated 
by the changes between Jane I and Jane II. 
Eve White continues to become more simply 
and rigidly self-critical and Eve Black con- 
tinues to become more simply and rigidly 
self-satisfied. 

Assuming successful therapy is possible— 
which seems questionable—it will involve less 
and less time being spent in being Jane and 
Eve Black and a consequent shift into Eve 
White, where better contact and differentia- 
tion seems to be maintained. But here it will 
be necessary to bring Eve White to under- 
stand the reason for her depression, the role 
of her ambivalence toward her mother in her 
problem—which shows no signs of happening 
yet—and thence a gradual restructuring in 
which ME becomes more favorable, along with 
SPOUSE and sEX, and identifications with 
FATHER and MOTHER are reassigned. This will 
probably involve a period of negative trans- 
ference, with My DOCTOR and MOTHER becom- 


ing closely identified “and being temporarily . 


shifted to bad, strong, and active directions of 
the semantic space. In other words, successful 
treatment will mean increasing time spent in, 


CHARLES E. OSGOOD AND ZELLA LURIA 


and a gradual restructuring of, Eve White 1 
to the point where it incorporates what is now 
Jane, but with a realignment of significant 
persons. On the other hand, if this woman is | 
in a mental institution and remains there, it 
seems likely that Eve Black will become the 
dominant house she lives in. In either case, it 
is probable that Jane will appear less and less. 


Interpretation II—Assumed Late Stage of 
Therapy 


If we assume that we are seeing the ter- 
minal stages of therapy with a case of this | 
sort, then a quite different interpretation is 
necessary. The difference in interpretations | 
hinges upon Jane, either as a deceptive and 
vanishing original personality (interpretation 
I) or as an increasingly healthy and aug- 
menting personality (interpretation II). In 
the latter case we would assume that Eve- 
White had been the “original” personality as 
people knew her—a socially acceptable wife 
and mother, but one laden with conflicts, 
anxieties, and self-criticism. The split in this 
case—into one personality in which the self- 
criticism completely disappears via irra- 
tionality (Eve Black) and another in which 
self-criticism vanishes via rationality (Jane)— 
seems less sensible, however. Jane seems unl 
necessary at this stage and really should have: 
developed out of Eve White rather than being 
contemporaneous. If we assume there was 4) 
split in any case, Jane is clearly the most 
healthy personality, since Love and SEX are 
identified, the world is viewed in acceptable 
fashion, and the self concept is becoming 
more favorable all the time. The prediction 
here would be increasing time spent in Jane 
and less in the others. The stumbling blocks 
in the way of this interpretation are (a) the 
lack of any realignment of the system of 
parental identifications, and (b) the fact that 
Jane is becoming ess diversified semantically 
(more “simple-minded”) rather than 
reverse. This second interpretation was aC 
tually the one we first adopted— because of 
the superficial “healthiness” of Jane—but 
consideration of all the evidence seems t0 
favor the first interpretation. However, it 
should be noted that if this case is near tht 
end of successful therapy, Jane is the only 
personality that combines both a normal viel 
of the world and reasonable (increasing) 8% 
ceptance of the self. 


| 


Ga 


BLIND ANALYSIS WITH SEMANTIC DIFFERENTIAL 


Interpretation III—Combination of I and II 


It is possible to combine interpretations I 
and II by assuming that Jane is both the 
original personality which broke apart and 
the terminal personality which is being de- 
veloped out of therapy. In this case, the early 
development of the case, probably in child- 
hood, would be the same as that given under 
interpretation I—the conflicting parental 
identifications (id and superego determined 
respectively) were of about equal strength and 
finally became too intense to be contained 
within the self-deceptive personality organiza- 
tion of Jane. During the middle course of the 
case, when therapy was undertaken, we thus 
find all three personalities oscillating, tem- 
porary dominance of the mother and wife 
tole being represented by Eve White, tem- 
porary dominance of the self-gratifying father 
tole being represented by Eve Black, and 
temporary dominance of the face-saving, 
problem-solving ego being represented by 
Jane. Intentionally or unintentionally, the 

_ effect of therapy may be to strengthen the 
self-deceptive organization of Jane without 
resolving the underlying conflicts dramatized 
by Eve White and Eve Black. The over- 
simplified, Pollyanna-like ways of perceiving 
herself as good along with all the other signifi- 
cant persons in her life yields a superficially 
happy person who views the world in an ac- 
ceptable, if rigidly stereotyped, fashion. If 
the present combined interpretation approxi- 
mates the actual situation, then we feel 
Compelled to predict another breakdown at 
Some later period in this person’s life. In 
other words, the effect of therapy (whatever 
type it may have been) seems to have been 
further to strengthen the  self-deceptive 
original organization of Jane, while making 
this personality even more rigid and insensi- 
five to subtle differences in meaning and 
without resolving the underlying conflicts 
which created the original disturbance. 


What Price Therapy? 


It is impossible to tell from our semantic 
data whether the increasing simplification ip 
Structure characteristic of all three person- 
alities is due to therapy itself or is happening 
despite therapy. However, a number of spe- 
Cialists in Psychotherapy have from time to 
time expressed concern over the “hidden” 
effects of therapy even in so-called successful 


591 


cases—particularly reduction in initiative, 
creativeness, and flexibility of the patient. 
Certainly in the present case we are wit- 
nessing an over-all reduction in differentiation 
of meanings. If overt behavior is in consider- 
able part determined by meanings, as we be- 
lieve it is, then we must expect Jane (if she is 
the terminal personality) to be now even less 
capable of behaving differentially to her 
mother, father, spouse, and child—they are 
all essentially undifferentiated “strong-active- 
goodness” to her. This would also be true, 
but to a lesser extent, of Eve White, although 
here we would assume an earlier stage of 
therapy and hence a possibility of secondary 
elaboration of sgnantic diversity under sensi- 
tive therapy. 

Is rigidity of this sort a necessary price of 
therapy? In striving to achieve the goals of 
societal acceptability and invididual happi- 
ness, does the therapist have to sacrifice the 
richness, individuality, and subtler adjustive- 
ness of the patient? These are serious ques- 
tions raised by the data of this single case— 
but not answered by them, of course. From 
the larger sample of cases we are presently 
working on, better answers may be forth- 
coming, but the cases are generally less severe. 
One other interesting phenomenon in the 
present cases should be mentioned: Despite 
the gross changes in meaning of concepts in 
the several personalities, and the over-all 
reduction in diversity, the semantic judg- 
mental frame of reference remains constant. 
In other words, all three of the personalities 
in this woman utilize semantic scales in the 
same ways—the correlations between scales 
are the same for all three personalities and 
reduction in diversity in all of them is accom- 
plished by a coalescence of good, strong, and 
active into a single evaluative dimension. Thus 
it would appear that the level of scale mean- 
ings is below that at which concepts vary, and 
common to all three personalities. 


REFERENCES 


1. Oscoop, C. E. The nature and measurement of 
meaning. Psychol. Bull., 1952, 49, 192-237. 

2. Oscoon, C. E., & Suct, G. J. A measure of relation 
determined by both mean difference and profile 
information. Psychol. Bull., 1952, 49, 251-262. 

3. Tureen, C. H., & CrecKiey, H. A case of multiple 
personality. J. abmorm. soc. Psychol, 1954, 49, 
135-151. F 


Received December 1, 1953. 


CRITIQUE AND NOTES 


EFFECT OF INCREASING THE NUMBER OF RORSCHACH RESPONSES 
ON SUM C AND M: A NOTE ON FISKE AND BAUGHMAN'S STUDY 


CONAN KORNETSKY anp DONALD L. GERARD 
Laboratory for Socio Environmental Studies, National Institute of Mental Health, Bethesda, Maryland 


SKE AND BAUGHMAN in their recently pub- 

lished study inquired “...to what extent are 
the frequencies of responses in the several scoring 
categories related to the total number of re- 
sponses?” (2, p. 25). Their data indicated that 
there was a positive relationship between the num- 
ber of responses (R) and a number of clinically im- 
portant Rorschach determinants. They state, 
“While it is clear that frequencigs in most categor- 
jes are related to the total number of responses, we 
cannot state, on the basis of the present data, 
which influences the other” (2, p. 30). This ques- 
tion which Fiske and Baughman ask may be par- 
tially answered as follows. If additional responses 
are elicited from a group of subjects (Ss) who 
spontaneously produced comparatively few re- 
sponses on the Rorschach so that their total num- 
ber of responses will equal those of a group who 
initially produced a larger number of responses, 
the influence of R on various scoring categories may 
be determined. Thus, it would be reasonable to 
expect that if the sum of the color responses (Sum 
C) and the human fantasy responses (M)! are a 
function of R, there would be no statistically sig- 
nificant difference between the amount of Sum C 
and M in these two groups when the number of 
responses are equated, 


METHOD 


Data which could be utilized for this purpose were 
obtained in a current study of teen-age opiate addicts. 
A procedure similar to that used by Jernigan (3) was 
employed. The Rorschach was first given in the usual 
manner. After this, but before the inquiry, the cards 
were again presented to the 5 with the instruction to 
tell again everything that he sees, but not to give any 
responses that he had given in the previous adminis- 
tration. 

Twenty-two Ss were studied in this manner. They 
were divided into two groups: Group A consisted of Ss 
who had 18 or more responses on their first Rorschach 
administration; group B consisted of Ss who had less 
than i8.responses on their first Rorschach (mean: 
15.3, range: 13-16) but who were able to add suf- 
ficient new responses on their second Rorschach so that 


1Jn the present study Sum C and M were selected 
because of: (a) their clinical importance, (b) their 
importance in a study now in preparation by the 
senior author, and (c) they appeared frequently 
enough in the small sample used so that statistical 
comparisons could be made. 


the total number of responses for Rorschach 1 and 2 
was over 182 For group B the determinants for 
Rorschach 1 and 2 were pooled, i.e., Sum C consisted 
of Sum C on Rorschach 1 plus Sum C on Rorschach 2. 
The distributions of the total number of responses for | 
group A and group B were not significantly different 
(group A, mean: 25.0, range: 20-31; group B, mean | 
23.4, range: 19-30). 

The Ss were dichotomized at the medians of the 
total distribution of Sum C (median: 1.0) and M 
(median: 1.3). Fisher's “exact” test of the probability 
of obtaining a set of frequencies in a 2 X 2 table was 
used (1, p. 84) to evaluate the significance of the dif- 
ference between Ss above and below the dichotomy for 
the two groups. 


The data for Sum C and M are summarized in 
Table 1 and 2 respectively. The probability that | 
these obtained sets of frequencies would occur by | 
chance are < .03 for Sum C and < .06 for M. | 


RESULTS l 
| 
| 


TABLE 1 
Amount or Sum C DicHOTOMIZED AT 1.0 ror GROUPS 
A AND B 
Group Sum C 210  SumC «10 
A 8 3 
B 1 6 
TABLE 2 
Amount or M DicHorowizED AT 1.3 ror GROUP | 
A AND B 
Group M>13 M <13 
A 7 4 
B 1 ó 
DISCUSSION 
These results indicate that equating the number 
of responses will not equate the frequency of color 
or human movement-determined responses. In 
terms of the problem raised by Fiske and Baugh 
man, the results suggest that the frequency of colot 
and human movement-determined responses 5 


2 Four Ss were discarded because their total numb! 
bids nai for Rorschach 1 and 2 was not greate 
18. 


592 


ADEQUACY OF MEASURES OF REACTIONS TO FRUSTRATION 


probably not due to the influence of the number of 
responses as an independent variable. 

The Ss who initially proffered the smaller num- 
ber of responses were a constricted nonproductive 
group. The failure of this group to increase the 
number of affective responses so that it would 
equal a more productive group when additional 
percepts were forced may be the consequence of 
their personality structure. Thus a study that used 
proportions in comparing the amount of affective 
responses between a constricted nonproductive 
group and a nonconstricted productive group may 
negate clinically meaningful differences. On the 
other hand, when nonconstricted productive indi- 
viduals add additional percepts, they may con- 
tinue to use affective responses in the same 


593 


proportion as they previously did, so that in a 
comparison between two productive groups the use 
of proportions may be justified. 


REFERENCES 


1. EnwaRps, A. L. Experimental design in psycho- 
logical research. New York: Rinehart, 1950. 

2. Fiske, D. W., & BaucnHMaN, E. E. Relationship 
between Rorschach scoring categories and the 
total number of responses. J. abnorm. soc. Psychol., 
1953, 48, 25-32, 

3. JERNIGAN, J. A. A Rorschach study of normal and 
psychotic subjects in a situation of stress, Un- 
published doctor’s dissertation, Univer. of Ken- 
tucky, 1951. 


Received July 17, 1953. 


ADEQUACY OF THE REPORTING-TEST MEASURES FOR DETECTING 
REACTIONS TO FRUSTRATION! 


W. BRUCE DOUGLASS 
University of California, Los Angeles 


AND KENNETH B. BROWN 
University of Missouri 


[acer McKinney et al. (1) reported on a 
paper and pencil test designed to measure 
changes in performance due to frustration in a 
group situation. This device, called the Reporting 
Test, consists of three similar series of diagrams 
having varying numbers of three types of symbols 
which the subject (S) is required to count. In the 
first series, Condition I (C I), S is allowed to work 
at his own pace for eight minutes. In Condition II 
(C ID), the frustrating situation, he is given an im- 
possible task of completing a designated number of 
diagrams every 30 seconds, for two eight-minute 
periods, To emphasize his failure, S is required to 
mark an “F” for every problem not completed in 
each 30-second period. The final condition (C IIT) 
I5 an eight-minute postfrustration period in which 
S again works at his own speed. 
__ Scores for each of the conditions on the Report- 
g Test can be in terms of problems attempted, 
net correct, errors, and percentage correct, The 
differences in scores between the nonfrustrating 
condition, C I, and the frustrating condition, C II, 
presumably are indicative of proneness to frustra- 
tion. McKinney et al. (1) present evidence for a 
&roup of 196 Navy radar trainees that shows that 


- This article is based upon the senior author's 
master’s thesis (2) done under the junior author's 
direction, and submitted to the department of psy- 
chology, University of Missouri. 


under C II as compared with C I Ss made signifi- 
cantly more errors and attempted more problems, 
but had fewer net correct and a smaller percentage 
correct on the average. “However, not all individu- 
als reacted in the same way to experimental frus- 
tration: the majority speeded up performance at 
the expense of accuracy, but a smaller group 
showed stable performance without increase of 
speed or errors” (1, p. 322). Similar results were 
found when the test was repeated on 98 college 
students. 

While the investigators do not indicate what 
they consider the most appropriate method of 
scoring, they do state that the percentage correct 
score “, . . appears to be the most promising meas- 
ure of efficiency . . .” (1, p. 320). 

The present study was designed to investigate 
the reliability and, indirectly, the validity of the 
four scoring methods for relatively unselected col- 
lege students. It is based on the assumption, sup- 
ported by the results cited, that the frustrating 
situation of C II should produce different changes 
in performance scores for different individuals from 
the scores earned under C I. That is, a frustrating 
situation would be expected to have more effect on 
some Ss than on others. For an adequate measure, 
one would then expect (a) high correlations among 
the various parts of the test when the conditions 
are fairly constant and (5) relatively low correla- 
tions when the frustrating condition is introduced. 


594 


The above reasoning provided a basis for a fairly 
simple method of investigating the four scoring tech- 
niques used on the Reporting Test. This was done by 
setting up a nonfrustrated (NF) group, which was ad- 
ministered the three parts of the test without the 
frustrating condition. Another group of Ss, the frus- 
trated (F) group, was given the test according to the 
standard procedure.. Following this, intercorrelations 
were obtained among the three parts of the test for 
each group on each measure (problems attempted, net 
correct, errors, and percentage correct). The NF-group 
correlations would provide measures of reliability. Then 
a comparison of the F-group correlations with the 
paired NF-group correlations would give an indication 
of which types of scores significantly reflected the dif- 
ferential changes that the frustrating condition was ex- 
pected to produce. 

The NF group consisted of 101 university students 
(74 males and 27 females) from two general psychology 
classes. The F group also had 101"Ss (65 males and 36 
females) from two other general psychology classes. 

The NF group was given the same instructions at 
the beginning as were the Ss of the F group. However, 
after the first eight-minute period (C I), NF-group Ss 
were stopped and told that E was interested in deter- 
mining the relative difficulty of the different parts of 
the test. They were then instructed to begin working 
on the C II portion of the test. This was done also for 
the second eight-minute section of C II and for C III. 


A total of 28 product-moment correlations was 
calculated for each group, and the significance of 
differences between paired F- and NF-group :'s 
was tested by the z transformation technique. 
However, only a summary of the results will be 
given here. 

All the obtained r’s for the two groups were sig- 
nificantly positive at beyond the .01 level of sig- 
nificance (r of .25 required for 99 df). The correla- 
tions for the NF group may be considered measures 
of reliability of the four types of scores used. How- 
ever, few of the r’s obtained for the NF group were 
high. For both error and percentage correct scores, 
none of the intercorrelations among various parts 
of the test was above .68. For problems attempted 
and net correct the r’s for C I and C II or C III 
faired no better (highest r was .61). However, when 
the two sections of C II (independently and com- 
bined) were correlated with C III, the r’s for both 
measures were appreciably higher, ranging between 
.18 and .87. 

"These results for the NF group suggest that for 
all four measures scores obtained on the prefrustra- 
tion period may be too unreliable to be used in 
individual analysis. That is, Ss of the NF group did 
not reach fairly stable relative performance levels, 
as shown by problems attempted and net correct, 
until after the first eight-minute period. This fur- 
ther suggests that an uriscored “warm-up” period 
would be beneficial, so that an individual’s per- 
formance under frustration could be compared 

with his stabilized performance without frustration. 


W. Bruce DoucLass AND KENNETH B. BROWN 


When the r’s for the F group were compared with 
the corresponding r’s for the NF group, few of the 
F-group r’s were found to be significantly lower, | 
None would be expected to be significantly higher, 
and this was found to be the case. When the group | 
differences in correlations between C I and C II 
were investigated, only those for the percentage 
correct scores were found to be significantly differ- 
ent in the expected direction (NF r = .66, Fr = 
446; p = .05). However, this measure faired poorly 
in other comparisons (C II vs. C III and C I ys. 
C III) as indicated below. Since the results given 
previously suggest that it is relatively unreliable, 
the over-all results do not appear to indicate that. 
the percentage correct score is a dependable 
measure of individual performance changes under 
frustration. í 

When the differences between the paired F-group | 
and NF-group correlations for the C II and C III 
periods were treated statistically, those for the F | 
group were found to be significantly lower (p <| 
.01) for both problems attempted and net correct | 
scores. The respective r’s obtained are as follows: 
For problems attempted, NF r = .87 and Fr = 
.60; for net correct, NF r = .86 and F r = .70, 
Further study is needed to clarify the significance 
of changes in performance from frustrating to post- 
frustration conditions. Casual inspection of the 
test data suggests that some Ss showed a carry- 
over of frustration reactions, others recovered 
quickly, others had little change from C I under 
either condition, while still others evidenced little 
change under C II but manifested it during the 
following period. While our results seem to indicate 
that dependable differential changes occur, theif 
nature and meaning in terms of personality varia- | 
bles need further investigation. 

Perhaps in part because of the low reliability of 
the error scores, no significant differences in the 
correlations of the F and NF groups were found ony 
this measure. There were as many F-group r’s that 
were higher than were lower than the comparable 
NF-group values. These results, along with the pre 
viously indicated low reliability of this measure 
suggest that errors as well as percentage correct art 
not an adequate indicator of reactions to frustra- 
tion, at least as the test is presently designed. — 

While there is considerable indication of face 
validity of the Reporting Test in observations 0. 
Ss taking it, in their introspective comments on it, 
as well as in the design of the frustrating situation 
itself, the present study suggests that refinement 
ip scoring and construction would improve its use 
fulness for determining frustration tolerance. 


REFERENCES 


1. McKinney, F., Srrorner, G: B., Hives, RUTE Ra | 
& Arie, Roms A. Experimental frustration i? * | 


STABILITY OF AUTOKINETIC JUDGMENTS 


group test situation. J. abnorm. soc. Psychol., 
1951, 46, 316-323. 

2. Douctass, W. B. An investigation of various scor- 
ing methods for the McKinney Reporting Test. 


595 


Unpublished master's thesis, Univer. of Mis- 
souri, 1952. 


Received June 22, 1953. 


THE STABILITY OF AUTOKINETIC JUDGMENTS! 


J. H. ROHRER, S. H. BARON, E. L. HOFFMAN, anp D. V. SWANDER 
Urban Life Research Institute, Tulane University 


A MAJOR criterion used in a series of studies 
concerned with problems of personality 
structure and interpersonal interactions, carried 
out by the writers, has been rate of convergence 
to a common social norm established to perceived 
autokinetic light movement. Generalizations de- 
veloped from this experimental model of a social 
situation are limited by the stability, in time, of 
the norms so established. Sherif (8) in his classic 
experiment demonstrated that such norms were 
stable when tested after a period of one week. 
Bovard (2) reported stability of norms after 28 
days. This report deals with the stability of 
socially affected judgments, established to an 
autokinetic producing light S, after an elapsed time 
interval of one year. 


METHOD 


_ Subjects. The stability-of-norms test was carried out 
with 46 observers (Os) who had established social norms 
one year prior. All Os were undergraduate male students 
at Tulane and all were enrolled in either Army, Navy, 
or Air Force ROTC training units. None of the Os was 
aware of the purpose of the study and none had expe- 
rienced the autokinetic-producing situation in the in- 
terval between test-retest. 

€ experimental situation and room were identical 
for the norm formation and retesting sessions. The 
experimental situation has been described in detail else- 
Where (4). The apparatus also has been described else- 
where (7). 
y Experimental sessions. Three sessions were utilized: 

T àn individual training session; ^X" a social norm 
formation session; and, “Y” an individual test of norm 
Stability. The method of training utilized in session T 
has been described elsewhere (4). Briefly, Os were 


1 This study is one of a series carried out under ONR 
Contract Nonr-475(01) with Tulane University. That 
contract Provides for a systematic investigation of the 
relationships between personality, learning, and social 
peciebles The project, under the direction of J. H. 

ohrer, 1s a part of the research program of the Neyro- 

uda Branch, Bureau of Medicine and Surgery, 

. S. Department of Navy. While this paper constitutes 

a technical report to ONR, the interpretations pre- 

ten dee nud of the authors and do not repre- 

» ly, those of the sponsoring agency, the 
Department of Navy, j 


trained in individual sessions using actual movement of 
a pinpoint of light within a circular luminous frame- 
work. The Os reported their judgments of the extent of 
movement after each exposure. Fifty exposures were 
made in this session. The actual distance that the light 
moved wasdeterrfined by the experimenter (E) at the 
outset of the session and remained constant throughout 
the session, although the direction of the moyement was 
systematically varied on a plane perpendicular to the O 
as permitted by the apparatus (7). 

About one hour following the training session, the 
norm formation session occurred. In this session the lu- 
minous framework was absent, the pinpoint of light re- 
mained stationary, and the auditory cues from the 
apparatus were present. In session X the Os were tested 
in pairs, one member had been trained in session T at a 
distance of 2 in., the other member had been tested at a 
distance of 8 in. During session X a verbal report of the 
perceived extent of light movement was obtained from 
the Os after each exposure. Fifty presentations were also 
made in this session. 

In session Y, which took place approximately 12 
months following session X, each O was tested individu- 
ally and judgments were given following each exposure. 
Fifty exposures were given in this session also. Thus, in 
session Y the conditions were identical with those used 
in session X, with the exception that each O was tested 
individually. 

Scoring. The scores were in terms of inches of move- 
ment reported by the O. The median of each O's 50 
judgments, for each session, was computed. It is this 
median that constitutes the O's norm or score for each 
session. References to “mean score" indicate the mean 
of medians. 


RESULTS 


The data for the Os trained with a 2-in. light 
movement were analyzed separately from the data 
obtained from the 8-in. training group. This was 
done because the means of the T scores for the 
groups differ significantly—which was to be ex- 
pected—and the groups were heterogeneous with 
respect to variances. The results of an F test pro- 
vide evidence for rejecting the hypothesis of 
homogeneous variance at the .01 level of confi- 
dence. Furthermore, it*was not feasible to adjust 
the autokinetic X scores to account for the differ- 
ences in T scores because an analysis of the re- 
gression data revealed that one regression line 


596 


could not be used for all observations. Although 
the slopes of the regression lines within the two 
groups were not significantly different and the 
regression line for the two means was linear, the 
regression coefficient for the data within the 
groups differed significantly from the regression 
coefficient for the group means. x 

Table 1 presents the mean and variance for the 
T scores for Os trained at 2 in., and 8 in. respec- 
tively. When Os who had been trained with a light 
that moved 2 in. were paired with Os trained with 
a light that moved 8 in., and these pairs were 
tested together for their judgments of autokinetic 
movements, the judgments converged to produce 
the X means also presented in Table 1. The 
variances for these scores are also presented. It 
may be noted that the variability of the judgments 
in the autokinetic session is cohsiderably larger 
than the judgments in the more structured training 
session. 

The group means for the autokinetic Y scores 
obtained one year after the X scores are also pre- 
sented in Table 1 together with their variances. 
A reduction in the group means may be noted for 
both T groups. This reduction is statistically 
significant (p < .01) for the 8-in. training group 
but not for the 2-in. training group. Differences 
between sessions have been evaluated and are 
presented in Table 2. 

In spite of the reduction in group means when 
retested after an interval of one year, there is still 
a high positive relationship between the norms 
established in sessions X and Y. This relationship 
is shown by the correlation coefficients in Table 3, 
The multiple-correlation coefficient, Ry.xz, is 
approximately the same as the simple coefficient 
of correlation between first and second autokinetic 
Scores. 

In order to evaluate the relative contributions 
of X and T to the prediction of Y, the standard 
partial-regression coefficients were computed. The 
standard regression of Y on X, independent of T, 
is .739 and .600 for the 2-in. and 8-in. traini 
groups respectively, while the standard regression 
of Y and T, independent of X, is .264 and —.008 
for the 2-in. and &in. groups respectively. It is 


TABLE 1 
MEANS AND VARIANCES 
(N — 23 in both training sessions) 


2-in. Training &-in. Training 
Session 
Mean Variance | Mean Variance 
T 3.08 6.08 | 8.74 .79 
x 6.29 — 24.95 6.41 4.24 
Y 5.50 12.54 | 5.09 5.06 


J. H. Rognzn, S. H. Baron, E. L. HOFFMAN, AND D. V. SWANDER 


TABLE 2 
OBTAINED DIFFERENCES BETWEEN SESSIONS 
(N = 23 in both training sessions) 


2-in. Training 8-in. Training 


Statistic 
TX XY T-X x-Y 
Mean diff. | 3.21 0.79 2.33 1.32 
t 2.675 0.6076 | 4.8724 2.0301 
? «.001 >.50 «.01 «.05 
TABLE 3 


INTERCORRELATION COEFFICIENTS 
(N = 23 in both training sessions) 


Coefficient 2-in. Training 8-in. Training 
TxY .892 .595 
rrr .693 .341 
fxT .581 .582 
Ry.xr +918 .598 


evident from a comparison of these coefficients 
that the contribution of T to the prediction of Y 


is negligible. 
Discussion 


In a previous study (4) it was demonstrated 
that the effects of training Os to judge the ac- 
tual distance moved by a light S, generalized to an 
autokinetic light movement situation and thus 
influenced the magnitude of the perceived auto- 
kinetic movement, Stated in more formalistic 
terms, the habit strength of an R tendency built 


up to an S of real movement, generalized to an 5 , 


situation in which the perceived movement was 
illusionary. The data presented in this study 
demonstrate that judgmental norms of perceived 


autokinetic movement, modified in the direction of | 


conformance with judgments of a second person 
in an interpersonal (social) situation, tend to negate 
the effects of the earlier individual training. More- 
over, this negation of training effects by social 
interaction persists in time, the effects of the 
social interaction manifesting itself in a ^non- 
social” test situation after a period of one year. 
Again in more formalistic terms, the modification 
due to social interaction was in the intervening 
habit variable. 

Kelman (5) has demonstrated that felt *suc- 
cess" leads to stability of norms established to 
perceived autokinetic movement and felt “failure” 
leads to norm modification. The terms “success” 
and "failure" are most meaningful when used 
within a theoretical framework employing thé 
concept of reinforcement. Such a framework re 


| 
| 


STABILITY OF AUTOKINETIC JUDGMENTS 


quires the identification of “motives” or “drives” 
that, when reduced, strengthen the tendency to 
respond, The change made in our S situation, 
which produced the altered behavior, was the 
introduction of a second O who began giving 
reports of perceived movement that were at 
variance with the judgments given by the other 
member of the pair. The altered behavior that 
resulted brought into conformance the judgments 
of perceived movement reported by both members 
of the pair. It is hypothesized that this shift was 
activated by an acquired motive for social con- 
formance.? The data argue for the shift being due 
to a drive variable, rather than due to a situa- 
tional, or S, variable because the altered R per- 
sisted when the interpersonal interaction was not 
operating (session Y). Thus a stable R tendency 
was established to the light S in the 50-trial X ses- 
sion by the reduction of the conformity drive, and 
this R tendency was manifested in the subsequent 
individual Y session. 

An attempt was made to explain the data by use 
of Bateson’s (1) and Harlow’s (3) notions of 
learning sets. The rather rapid rate of convergence 
noted in session X suggests that a set for con- 
formance produced by the interpersonal cues 
could be involved. What is known about human 
Socialization practices also suggests that sets to 
conform are one of the most likely by-products of 
socialization. Yet the results obtained in session Y 
Suggest that the notion of learning sets is not 
sufficient to explain the results; the specificity and 
stability of the acquired R tendency suggests the 
theoretical need for an additional notion similar 


* Within the theoretical framework of Parsons ef al. 
(6), this would be equivalent to their “need-dispositions 
to conform to expectations” that was felt by “ego.” In 
the case of our Os, there was no evidence of overt con- 
flict involved in the conforming behavior. They were 
Not aware that they were conforming, yet all converged 
Very rapidly, despite the presence of acquired R tenden- 
cies, established in the T session, that would cause them 
to tend not to conform. 


597 


to what Harlow (3) has referred to as “Thorndik- 
ian bondage" resulting from drive reduction. 


SUMMARY 


1. The stability of judgmental norms, estab- 
lished as a result of interpersonal interaction, was 
evaluated one year after they were established. 
The evaluation was made by testing the subjects 
individually. 

2. It was shown that the norms, so established, 
were stable. 

3. Data were presented which showed that 
social interaction resulted in a rapid modification 
of response tendencies established in a previous 
individual training session. 

4. The results obtained were interpreted within 
a reinforcement, theory framework. 


REFERENCES 


1, Bateson, G. Social planning and the concept of 
“deuterolearning.” In T. M. Newcomb & E. L. 
Hartley (Eds.), Readings in social psychology. New 
York: Holt, 1947. Pp. 121-128. 

2. Bovarp, E. W., Jr. Social norms and the individual. 
J. abnorm. soc. Psychol., 1948, 48, 62-69. 

3. Harrow, H. F. The formation of learning sets. Psy- 
chol. Rev., 1949, 56, 51-65. 

4. Horrman, E. L., SwaNpzn, D. V., Baron, S. H., & 
Ronnn, J. H. Generalization and exposure time 
as related to autokinetic movement. J. exp. Psy- 
chol., 1953, 46, 171-177. 

5. KxruaN, H. C. Effects of success and failure on “sug- 
gestibility" in the autokinetic situation. J. abnorm. 
soc. Psychol., 1950, 45, 267—285. 

6. Parsons, T., & Suits, E. A., (Eds.) Toward a general 
theory of action. Cambridge: Harvard Univer. 
Press, 1951. Pp. 159-189. 

7. Ronrer, J. H., & Horrman, E.L. An apparatus for 
studying the perception and judgment of light 
movement. Amer. J. Psychol., 1954, 67, 143-146. 

8. SHERIF, M. A study of some social factors in per- 
ception. Arch. Psychol. New York, 1935, 27, No. 
187. 


Received June 12, 1954. Prior publication. 


REVIEWS 


STUDIES IN THE SCOPE AND METHOD or “THE 
AUTHORITARIAN PERSONALITY." Edited by 
Richard Christie and Marie Jahoda. Glencoe, 
Illinois: The Free Press, 1954. Pp. 279. Price 
$4.50. 

This is the second in a series of which Studies in 
the Scope and Method of “The American Soldier” 
was the first. Three of the book’s six chapters are 
brief ones. Jahoda, in an editorial introduction, 
reconstructs some of the historical setting, both 
theoretical and methodological, in which the au- 
thors of The Authoritatian Personality began and 
continued their work. One of its unique features 
which she notes is its use of the methods of clinical 
psychology in conjunction with psychoanalytic 
theory “for other than clinical purposes, that is, 
for the exploration of political ideologies [as re- 
lated to] personality structure.” She also notes 
that none of the original authors was invited to 
reply to the criticisms (some of which, as will be 
shown, are rather harsh) in the volume under 
review here, 

"The other two short chapters, by E. A. Shils 
and H, D. Lasswell, respectively, present an inter- 
esting contrast. The latter refers hardly at all to 
the volume by Adorno et al. but, like it, is prima- 
rily concerned with the scientific problem of the 
contribution of personality variables to behavior 
in the public arena—specifically, in their contribu- 
tion to political participation on the part of "the 
power-centered person." Shils's chapter, with its 
single-minded devotion to the charge that Adorno 
el al, were insufficiently aware of authoritarianism 
on the political “left,” keeps its eye on the authors 

_ but not on their scientific problem. Lasswell’s 

Chapter, by contrast, is studded with relevant 

propositions, always interesting and generally 

testable. 

The final chapter, by Frenkel-Brunswik, is ap- 
propriately titled “Further Explorations...” 
"Though it contains an occasional explanation or 
defense of something in the original volume, it is 
in no sense an apologia. The largest portion of it is 
devoted to her own post-1950 investigations of 
phenomena in children similar to those originally 
reported for older subjects. For example, prelimi- 
nary results from a set of experiments in perception 
"support the conjecture that, by and large, such 
tendencies as the quest for unqualified certainty, 
the rigid adherence to the given—be this an au- 
thority or a perceptual stimulus . . . tend to occur 
again and again within an individual in contexts 
seemingly far removed from each other." 

Ina final passage she raises the question whether 
or not the value biases of the authors of The 
Authoritarian Personality invalidated its conclu- 


sions. Among several “arguments” offered in de- 
fense of a negative answer is this interesting one: 
“From this angle we must welcome the fact that 
E. R. Jaensch has given us articulate description of 
the ideal personality type from the standpoint of 
Nazi ideology. The studies of Jaensch, reported in 
some detail above...have thus certainly been 
conducted under a bias opposite to ours, yet they 
were found to lead to the recognition of similar 
patterns albeit with a different evaluative slant.” 

The meat of the book is to be found in two long 
and critical chapters of which the first, by H. H. 
Hyman and P. B. Sheatsley, is a methodological 
one. Their conclusions, in the main convincing to 
this reviewer, are worth quoting at some length: 


The findings rest on an admittedly unrepresentative 
sample, from which large generalizations are incau- 
tiously drawn . . . .Weaknesses in the authors’ handling 
of the quantitative data invariably work in favor of the 
assumptions, and the procedures are often inadmissible. 
..-The questionnaire data are not validated save by 
their correspondence with the results of personal inter- 
views with extreme groups and... the criterion for 
validation was not independently derived . . . no con- 
trol at all was exercised over the important variable of 
formal education .. . Statistical tables are presented, 
the relationships are noted, and they are then “ex- 
plained” in terms of the authors’ theory. But there is no 
safeguard on the interpretations, and they become mere 
statements of opinion....Much of the qualitative 
analysis of respondents’ ideology is almost completely 
unrestrained by any attachment to scientific method. 
The authors’ propensity for psychodynamic explana- 
tions leads them to ignore the widespread distribution 
of certain sentiments in the American population . . . 
and to attribute them instead to psychodynamic 
processes unique in the ethnocentric individual. 


These harsh judgments of the methodological 
trees do not, however, prevent the critics from 
taking an appreciative view of the research forest: 
"One feels intuitively that [the authors] have 
wisdom in their views and soundness in their con- 
clusions, and it is sad that the acumen which in- 
spired the project, the energy in executing a 
research task unparalleled in scope, and the intui- 
tive power devoted to the appraisal of the results, 
were not matched by equal methodological skill." 

In the other long chapter, *Authoritarianism 
Re-examined," R. Christie tackles five problems 
raised by the research reported in the original vol- 
ume or by later researches inspired thereby. (2) 
The F-scale items “did not appear to capture 
authoritarianism per se... the postulated per- 
sonality variables underlying their acceptance had 
tenuous but provocative support." (b) The F scale 
“did discriminate between individuals who dis- 


598 


REVIEWS 


played behavior which was ‘authoritarian in in- 


terpersonal relations, rather than politically”; its 
discriminating power is "consistent" but not “ex- 
ceptionally high." (c) The authors' contention that 
ethnocentrism is rather highly generalized is pretty 
well supported, and its relationship to "certain 
aspects of personality appears to be well estab- 
lished”; e.g., “those persons who view the world as 
threatening and unpredictable tend to be less 
tolerant of minority groups." (d) Various measures 
related to socioeconomic status show “a sizeable 
negative correlation" with F-scale scores; "these 
findings are interpreted as running counter to the 
general argument in The Authoritatian Personality 
which emphasized purely personality determinants 
«and discounted contemporary social influ- 
ences,” (e) With regard to “the hypothesized 
‘greater importance of early environment in con- 
trast to adult experiences in the genesis of poten- 
tially fascistic personality . . . neither the original 
‘data nor subsequent findings are sufficient to prove 
or disprove the hypothesis.” 

Were it the present reviewer’s function to offer 
a summary evaluation of The Authoritarian Per- 
sonality, he could hardly do it more sagely than has 
Christie: “Both [its] strength and weakness lie in 
its basic assumptions which are rooted in psycho- 
analytic theory, Such an orientation has led to the 
uncovering of a host of data which in all likelihood 
would not have been discovered by investigators 

With differing theoretical viewpoints. Despite some 
methodological weaknesses in the original research, 
Subsequent findings have been predominantly 
confirmatory.” 

It is this reviewer’s function, however, to evalu- 
ate this volume of evaluations. Was it worth while 
to publish what might be termed a 275-page book 
review? „The present verdict is a thrice-qualified 
Affirmative. Qualification 1: Some chapters are 

carelessly edited and/or proofread. Qualification 
2: All but the two long chapters might have found 
other publication outlets. Qualification 3: Only a 


_ Work as important as The Authoritarian Person- 


ality could justify such an outsize review. In sum- 
“Mary, the justification of the book under review 
Ties in the importance of the book which it reviews, 
‘together with the carefully documented criticism 
Of its two major chapters. Readers who enjoy fair, 


“astute, detailed, and documented criticism of im- 


Portant psychological contributions will wish that 
ere were more frequent opportunities for cri- 
ues like these, 
RU THEODORE M. NEWCOMB 
niversity of Michigan. s 


HE STRUCTURE OF PERSONALITY. By H. J. 
ysenck. London: Methuen & Co. and New 
York: Wiley, 1953. Pp. 348. Price $5.75. 


me fifty years of endeavor on the part of 


599 


several hundred psychologists are dissected and 
reported upon in this book. The literature was 
combed for empirical studies bearing on theories of 
personality structure, the proper statistical model 
for which, according to Dr. Eysenck, is R factor 
analysis, and more especially analysis into first- 
and second-order factors along Thurstone’s well- 
known lines. The theory favored by Eysenck is 
a hierarchical construction of his own, according 
to which habitual modes of response become or- 
ganized into traits, which in turn are arranged into 
a few all-embracing types. Traits and types cor- 
respond to first- and second-order factors, respec- 
tively. Traits are of the kind “shyness,” “per- 
sistence,” “rigidity”; types are groups of correlated 
traits, there being three such clusters, called intro- 
version-extroversion, neuroticism, and psychoti- 
cism. The booh is a disquisition on facts to 
support this hierarchical construction, drawn vicari- 
ously from studies on the analysis of question- 
naires, objective behavior tests, physique, 
physiological measures, interests and attitudes, 
correlation between persons, and "trait" measure- 
ments. 

Such a construction lends itself very well to 
Eysenck's stature in this field, and we may be sure 
that some penetrating analyses are undertaken 
in these pages, many sound conclusions as to fact 
drawn, and many interesting lines pursued. 
Eysenck takes a firm grip on his comb, pulling 
it where it will Psychologists who have to deal 
with the practical problems of society, with the 
gross diagnosis of mental disorders, or with the 
selection of personnel in group testing situations 
and the like, will find a great deal indeed to please 
them in these chapters. The present reviewer does 
not feel competent to assay some of these facts, 
but where he is able to judge them in detail, as, 
for example, in the treatment of important studies 
by Jenkins and his collaborators (3), he is of the 
opinion that Eysenck's judgment is sound and 
constructive, The types themselves are perhaps 
badly named—introversion-extroversion in Ey- 
senck's system is scarcely what Jung had in mind, 
and neuroticism ordinarily connotes a more specific 
set of conditions than Eysenck's second-order 
factor embraces. The chapter on correlation of 
persons is no worse for its inadequacies than others 
written by those who adopt the primacy of R 
methodology. One is sure, with such exceptions, 
that Eysenck has dealt fairly with facts, and that 
there is much substance to his construction. 

Even so, a construction of this kind is not a 
theory in a genuine sense, and Eysenck's comb 
seems to have many broken teeth, so that much is 
left unteased on Psyche's shaggy brow. To take 
the lesser matter first; one may wonder whether 
or not important papers in the area, which happen 
to be of a theoretical nature, have been unwisely 


600 


passed over on the mere ground that they are not 
factual. Thus, it may be argued that a theoreti- 
cal paper that relates perseveration to Pavlov's 
theories of inhibition (4), that is, substantially to 
conditions of true experimental neurosis, is worth 
more attention than most so-called factual work 
on the subject of perseveration. It is clear, also, 
that the armory of factor analysis does not end 
with first- and second-order factors, and that third 
and higher orders can be expected, or have been 
unearthed: what “theory,” one wonders, will 
Eysenck offer for these? Since increase of com- 
plexity is embraced in the hierarchy, no doubt 
archetypes will correspond to third-order factors. 
Or, a third-order bipolar may perhaps correspond 
to the sexes, and fourth orders to the different 
races of mankind? These questions, it seems to this 
reviewer, lay bare the weaknuss of Eysenck’s 
construction: clearly it is not a theory in any hypo- 
thetico-deductive sense, as Albino (1) has already 
indicated. Eysenck's construction is purely ad 
hoc, explaining nothing, and permitting of no 
pertinent hypotheses or deductions. 
Here, it seems, is the essential difficulty. Nothing 
genuinely theoretical about personality, or even 
about scientific method, is to be found anywhere 
in these pages. This is the more astonishing be- 
cause Eysenck professes above all else to be 
thoroughly scientific, and to be devoted to the 
hypothetico-deductive method. A theoretician, 
however, would have felt it essential to offer a 
theory for the types outlined in this book: several 
are possible, one, in particular, in relation to ego 
theory and defenses. Eysenck offers no such theory, 
of course, and would regard all of them, inevitably, 
as so much speculation. This is because as a 
factorist, he has grasped, above all else, the im- 
portance of operations in scientific work: for 
Eysenck a theory without actual tests and Proofs 
is anathema. Most theories in personality study, 
Eysenck correctly sees, are without the essential 
operations. It may be suggested, however, that 
not all such theories are purely speculative. A 
little perspective might lead one to conclude that 
many are perhaps on the right lines, and that the 
factorist would do a really important job if he 
could provide the missing operations. The factor 
methods that Eysenck espouses so cleverly can 
in fact be placed at the service of the very theories 
that he rejects. Thus one must agree with Eysenck 
that matters of scientific proof are essential; one 
must agree, too, that factor methods can provide 
really cogent operations; but it is as well to re- 
member that operations without a genuine theory 
in attendance are apt to become the worst kind of 
scientism (2), and it is from such a charge that the 
present reviewer would like to save Eysenck. 
However, it will avail little to discuss this 
methodological difficulty. What is required from 


RrviEWS 


those of us who are factorists is considerable 
tolerance for all who have theoretical interests in 
personality studies. The necessary operations are 
not far away. And about the merits of our own 
efforts we might profit by remembering Queen 
Elizabeth I's outburst at Cambridge University 
nearly four centuries ago when her incomparable 
virtues were being extolled by the dons of her day: 
“Non est veritas," she blurted, “et utinam. ..." 
So, too, for Eysenck and factor analysis. The time 
has surely come for a great deal of circumspection 
from all who are trying so hard to comb Psyche’s 
hair. Otherwise we shall never find out whether, 
after all, she only wears a wig. 


REFERENCES 

1. Arsmo, R. C. Some criticism of the application of 
factor analysis to the study of personality, 
Brit. J. Psychol. (Gen.), 1953, 44, 164-168. 

2, Haver, F. A. The counter revolution of science: 
studies in the abuse of reason. Glencoe, Ill.: Free 
Press, 1952. 

3. Lorr, M., & Jenxmns, R. L. Patterns of maladjust- 
ment in children. J. clin. Psychol., 1953, 9, 16-19, 

4. STEPHENSON, W. Studies in experimental psychia- 
try. J. ment. Sci., 1932, 78, 315-330. 


WiLLIAM STEPHENSON 
University of Chicago 


Tue MAKING oF a Scientist. By Anne Roe. New 
York: Dodd, Mead, 1953. Pp. ix + 244, 
Price $3.75, 

This book by a psychologist is addressed pri- 
marily to nonpsychologists. Since the researches 
reported here are already available to psychologists 
in their professional journals, the interest of the 
volume for them will lie largely in the fact that 
it is an example of popular writing, The planning, 
the execution, the results, and the implications of 
the research are all here. 


Most readers of this journal, doubtless, have ' 


a speaking acquaintance with Dr. Roe's project. 
She spent nearly four years making "clinical 
studies of vocational choice and performance in 
terms of life history and character structure." 
Sixty-four distinguished American-born research 
Scientists in the age range 30 to 60 were her sub- 
jects. They were drawn in about equal numbers 
from the ranks of biologists, physicists, and social 
scientists (psychologists and anthropologists). A 
control group of competent but less distinguished 
Scientists in the same fields was also studied by 
group methods. A control group of similarly dis- 
tinguished nonscientists was not used. The results 
indicate that there are patterns in the life histories, 
intellectual abilities, and personality structure 
that are “more characteristic of scientists than 
they are of people at large, and some which are 
more characteristic of special kinds of scientists 


| 


REVIEWS 


than of other scientists.” No differences were found 
between the more eminent and the less eminent 
physicists, but in the other two groups the more 
eminent “tended to have extremer degrees of the 
factors that characterized the men in the field.” 
The writing is clear and animated. The exposi- 
tory sections, especially those dealing with method- 
ological problems and technical information, are 
models of simple, economical writing. The human 
interest value of the subject investigated is aug- 
mented by the chatty, autobiographical cast of 
the presentation itself, as well as by the inclusion 
of many personal experiences that were incidental 
to the conduct of the research. This added human 
interest, however, is had at a price: often the 
author, as a person, stands in front of the subject 
matter of the book, and thus distractingly com- 
petes for the reader’s attention. 
Dr, Roe tells us the several purposes she had 
in writing this volume. They were (a) to “enable 
"me to review the experience as a whole,” (b) “to 
give my subjects a more coherent and compre- 
hensive account than the technical reports can 
give anyone but an expert in my field,” (c) to 
“make it possible for others to share in a pro- 
foundly moving experience, and (d) toexaminewith 
me the meaning and value of science for the man 
who does it and for society” (p. 17). It is doubtful 
that writing this sort of book is essential to achiev- 
ing the first purpose, and it is not likely that the 
author seriously believed it to be so. As to the 
Second purpose, fully one fourth of the content of 
the book is not essential to achieving it; and much 
, of this fourth, the reviewer believes, would annoy 
her distinguished subjects by its irrelevance. The 
purpose, that of sharing a profoundly moving 
experience, is realized with a high degree of suc- 
cess. The fourth is really two purposes. One of 
‘these, the examination of the meaning of science 
for the man who does it, is achieved with distinc- 
tion, The data are carefully evaluated, and the 
generalizations are circumspect. The other, the 
examination of the value and the meaning of 
Science for society, is not attained. Here the re- 
Sponsible scientist yields to the beliefs and hopes 
of the Socially conscious citizen-scientist. Properly 
Speaking, Dr. Roe does not examine the meaning 
“and value of science for society. Rather she testi- 
‘fies to a religious faith: '*. . . the things that make 
good scientists are also the things that make good 
x Citizens". (p. 240). What is good for General 
Motors is good for the nation! According to the 
findings of her study, the things that make good 
Scientists are experiences in the course of devélop- 
“ment which foster and make rewarding (a) in- 
dependence in individual thinking, and (6) in- 
D pendent empirical investigation as a means of 
answering questions. Her data, however, do not 
Indicate clearly or reliably that these two abilities 


601 


and dispositions, which she found were charac- 
teristic of her subjects in the fields in which they 
were distinguished, are characteristic of them in 
other fields, nor do her data bear on the question 
of whether or not these characteristics are sufficient 
to make good citizens. 

With her faith we cannot quarrel; indeed, many 
of us subscribe to it. But let us not embrace science 
and condemn dogma while at the same time we 
promulgate dogma in the name of science. It was 
not a scientist nor a psychoanalyst but a jurist, 
Mr. Justice Holmes, who advanced the opinion 
that the great difficulty with all thought is the 
inarticulate major premise. 


Harvard University 


RACIAL AND CULTURAL MINORITIES: AN ANALYSIS 
Or PREyUbICE AND Discrmmmation. By 
George Eaton Simpson and J. Milton Yinger. 
New York: Harper and Brothers, 1953. Pp. 
ix + 773. Price $6.00. 

In a few prefatory remarks the authors tell us 
why they wrote the book: the prejudice-discrimi- 
nation melon, they say in effect, has been sliced a 
great many times and from a great many angles. 
This is a good thing. But the very proliferation of 
specialized studies may obscure the important 
truth that the melon is more than the sum of its 
slices; certainly it makes the task of synthesis 
increasingly difficult. Accordingly, it is time to re- 
examine the whole in the light of our accumulated 
knowledge about its component wedges. In the 
course of this re-examination, or in consequence of 
it, the authors hope to furnish the reader with 
guiding concepts which will make him “. .. less 
likely to adopt an oversimplified, one-factor view, 
and less likely to study one situation in isolation 
from the total society of which it is a part." They 
propose to do this in two ways. First, by “. . . re- 
lating the various analyses to each other and to a 
systematic group of principles that underlie them 
all,” and secondly, by writing from the point of 
view that “ . . .every proposition concerning inter- 
group relations should be. . . a part of the general 
principles being developed in social science.” As it 
turns out (perhaps atypically) these objectives 
have a subtle but profound influence on the sub- 
stance they preface. 

There are three major sections. Part I, The 
Causes and Consequences of Prejudice and Dis- 
crimination (ten chapters, 315 pages), includes 
analyses of various types of majority-minority 
situations and of a number of “approaches” to the 
concept of race, a discussion of the personality and 
group-conflict functions of prejudice and of as- 
sorted consequences df prejudice and of discrimi- 
nation, and a two-chapter “case study” of the 
sociology and social psychology of anti-Semitism. 
Part II, Minorities in the Social Structure: The 


STANLEY G. Estes 


602 


Institutional Patterns of Intergroup Relations (ten 

- chapters, 324 pages), provides a one-chapter review 
of social class and caste notions and presents data 
on economic, political, legal, familial, religious, 
educational, and artistic characteristics of promi- 
nent minority groups in the United States. Part 
III, Prejudice Discrimination and Democratic 
Values (two chapters, 71 pages), comprises a pro- 
gram of strategies with which to reduce prejudice 
and discrimination by, respectively, changing the 

- prejudiced person and changing situations condu- 
cive to prejudice. Remaining materials include 
a tabular appendix on the size of various minority 
groups in the United States, Brazil, and the Union 
of South Africa, a bibliography of 498 entries, 
and separate subject and name indexes. 

In essence the book is a comprehensive review 
of historical and sociological fact znd fancy regard- 
ing prejudice and discrimination and, as such, is 
Potentially valuable for reference purposes. The 
authors’ knowledge of their subject is impressive, 
their approach scholarly, even polymathic, and 
their prose learned but easy. For psychologists the 
‘second section particularly is likely to prove useful. 
Tt includes a wealth of factual material, admirably 
organized about specific behavioral areas, which is 
not otherwise available in a single work. For ex- 
ample, the four chapters in this section which are 
devoted to minorities and the American Political 
and legal processes and to the education of minori- 
ties in the United States provide both a fascinating 
and a compelling setting for the recent Supreme 
Court decision on segregated schools and a remark- 
able demonstration of the institutionalization of 
attitudes toward minority groups. Unfortunately, 
the book’s usefulness is impaired, perhaps vitiated, 
by several pervasive inadequacies. In the opinion 
of this reviewer, all are attributable to the familiar 
holistic bias implicit in the intent to "integrate" 
and to present a “total context.” Two are illus- 
trative. 

First, the attempt to provide what one must 
suppose is a “multifactored” view and to consider 
the concepts and methods of a variety of disci- 
plines leads to the inclusion of material which is of 
questionable relevance and/or dubious validity, 
and results in a diffuse and uneven treatment. 
What purports to be a critical evaluation turns out 
to be an uncritical admixture: demographic fact 
(Census data on the distribution of farms of non- 
white operators) and secondhand anecdote ("a 
Negro actress who travelled . . . through the South 
reported . . .”); hardheaded appraisal (“We have 
a measure of only the changes in responses to verbal 
tests, not of other behavioral changes"), and facile 
speculation (“for the minotity artist, the creative 

process provides emotional release... [and]... 
social recognition"); historical erudition (“there 
were three major dispersals of Jews during an- 


REVIEWS 


tiquity"), and psychological naïveté (“... preju- 
dice . . . seized upon by an insecure person may 
result in . . . a rigidity of mind and compulsiveness 
in adjustment”); devotion to science (“to fail to 


take into account methodological problems is to | 


give an impression of finality that no science ... 


can accept”) and methodological confusion (“In | 


defining relations... with so many variables in- 
volved, one is bound to run into disagreement over 
what is more and what is less”) are inextricably 
compounded. And while it may be true, for exam- 
ple, that “Reproduced, spurious, and bastard 
dialects tend to reinforce unfavorable racial stereo- 
types, [while] dialect poetry of the Sterling Brown 
type provides valuable insights into folk Negro 
life” (the reviewer and two colleagues were unable, 
in threetries, to separate the bastard and legitimate 
dialects), it is difficult to tell just where this obser- 
vation fits. The task of sorting out the tangential 
and the suppositious falls to the reader. 

Secondly, since the “general principles” which 
purportedly underlie the varied phenomena of 
minority-majority relations simply do not exist— 
the authors do not, at any rate, get around to 
stating them—the desired “synthesis” is accom- 
plished by a kind of verbal stickum. Sometimes the 
Stickum is a persuasive quotation from a gifted 
writer, for example, Lillian Smith or Richard 
Wright, and sometimes it is an equally persuasive 
passage by the gifted authors. In both cases, plau- 
sibility and fluency are apparently accepted as 
criteria of truth. In a discussion of the use of anti- 
Prejudice propaganda, for example, the reader is 
admonished to heed the interpretive limits imposed 
by the available experimental evidence. These 
limits are restrictive; the experimental findings 
cannot easily be extrapolated to the pressing social 
problem of reducing prejudice. Accordingly, the 
discussion concludes with the quotation of a num- 
ber of quite reasonable “principles that help one to 
understand the effectiveness of anti-prejudice 
Propaganda.” These principles have wide applica- 
bility and are wholly uncorroborated. 

It is difficult to strike a balance between the 
book’s principal asset—its truly exhaustive cov- 
erage of sociological and historical data and opin- 
ion—and its principal liability—its lack of selec- 
tivity. The answer undoubtedly depends in part 
on the level of sophistication of its potential read- 
ers. The reviewer is somewhat hesitant about 
recommending its use as a text, for example, for 
fear that undergraduates will be too easily taken 
in by its facile summaries and pseudosystematic 
bia. Those better qualified to pick its meat, how- 
ever, will probably find it a useful refresher and 
reference source. 


Josera G. SMITE | 


Commission on Human Resources 
and Advanced Training 


4 


REVIEWS 


SoctaL PSYCHOLOGY AND INDIVIDUAL VALUES. 
By D. W. Harding. New York and London: 
Longmans, Green (Hutchinson's University 
Library), 1953. Pp. vii -- 184. Price $2.40 
(Text $1.80). 

This book is concerned with what seems to be 
the central theme of social psychology, viz., the 
behavior of the individual in the group, or, as its 
author explicitly states it, with "the development 
of the social individual." 

While he is fully aware of the opposing belief, 
Harding begins his study of the social individual 
with the widely held biological view that all ani- 
mals, including man, are inherently social. Man's 
sociability is not, however, an instinct. Harding 
rejects the "herd instinct" of Trotter, the “gregari- 
ous propensity" of McDougall, and other similar 
modes of explanation. The error in the older views 
of the social nature of man, according to Harding, 
was their attribution of specificity to human socia- 
bility. What man wants most of the time, Harding 
believes, is “an awareness of friendly companion- 
ship in other activities, not specifically social ones” 
(p. 15), a knowledge that his activities are taking 
place in a congenial social context which stimulates 
a sense of being reciprocally sanctioned. 

Although he prefers the biological explanation 
of an innate social disposition, the argument and 
the evidence which Harding submits are not cru- 
cial. As a matter of fact they might be used to sup- 
port the contrary position equally well. To say, 
moreover, that this innate disposition requires a 
Social context for its exercise can only mean opera- 
tionally that it is an acquired habit in the first 
place. In the face of his own analysis the author 
Would have been on firmer ground had he left the 
question of priority an open one. 

Any attempt to account for the natural socia- 
bility of man inevitably runs into a collision with 
the stubborn fact of human pugnacity and hos- 
tility. Harding is as conscious of this problem as 
anyone, but his "solution" of it is unconvincing. 
In “integrative” behavior, he holds, conflict of 
aim is eliminated through the realization by the 
disputants that each looks upon the conflict from 
an exclusive Perspective. By means of a special 
kind of Cooperation, characterized by mutual in- 
fluence in the use of means to the achievement of 
an end, they resolve their common difficulty. This 
ls the wish, no doubt, of all reasonable men; but 
the realities of human conflict force one to describe 
its consummation as unduly optimistic. 

Although man is naturally a social animal, his 
Sociability, it has been said, is developed only in a 
Social context. An explanation of his sociability is 
found in his early social development. The process 
of Socialization, while long and varied, is summed 
up in the infant's attachment to his mother, who 
ministers to his needs and provides him with his 


603 


basic satisfactions. Out of this attachment to the 
mother the infant develops a rudimentary. senti- 
ment, and, as his circle of interactions widens, he 
forms sentiments with respect to other individuals 
and their behavior, the final product of which is 
his own personality. 

Unlike many psychologists, Harding is not 
afraid to tackle the problem of the moral develop- 
ment of man. In a' chapter on “The Individual's 
Morality" he deals with moral standards without 
moralizing. Being social beings, men place a high 
value on consideration for others. Whence comes 
this need to do right even when detection of doing 
wrong is unlikely? There are two reasons for this, 
one individual, the other social. “A breach of the 
moral code that we share with other people," 
Harding writes, “automatically deprives us of our 
sense of having group support in what we do: we 
know that our group now sanctions us not for 
what we are but for what we pretend to be” (p. 
54). 

The individual explanation is based on the 
holistic character of man. This consists in “the 
central effort of a living being to remain an inte- 
grated whole and avoid tensions that result from 
actions conflicting with the stable features of his 
personality, such as his sentiments and the estab- 
lished hierarchy of his values” (p. 54). Coherent 
moral behavior is not derived from membership 
in a social group but is a characteristic of man’s 
biological make-up. 

In a period of the development of social psy- 
chology when the Group, like the King of old, can 
do no wrong, it is gratifying to find in this book an 
examination of the group’s fitness for its members. 
Adjustment and conformity are here put in a bal- 
anced perspective. An important psychological 
problem is to examine the extent to which the 
group permits and to what extent it stifles the 
individual’s satisfaction of private needs and de- 
sires and his ability to adhere to values that other 
members of the group will not support. A group 
whose culture fails to bring out an individual’s 
possibilities, or one which, through rejection or iso- 
lation, handicaps its gifted members, is inadequate 
to their needs. In this connection the author rightly 
criticizes the trend of much social psychology and 
psychotherapy toward a passive and uncritical 
acceptance of the standards of the dominant social 

up. 
re chapter on “The Social Meaning of Nor- 
mality” we find familiar material, but it bears 
repetition. Like most psychologists, Harding re- 
spects the statistical conception of normality, but 
he is not beguiled by its quantitative formulation. 
Neither does he go the whole way with those cul- 
tural relativists who would define normality and 
abnormality wholly in terms of cultural norms. A 
view that will do justice to the richness and com- 


604 


plexity of human behavior, he shows, must also 
consider the degree of integration or wholeness of 
the individual. Much ill health, he says, expresses 
*'the mind's failure to achieve or maintain integra- 
tion"; and an individual is *maladjusted" to the 
extent that his mind is "divided against itself and 
failing to maintain its wholeness" (p. 118). 

There are other important topics in this book 
that deserve consideration but which must be 
omitted in this brief review. Harding's analysis of 
leadership, a topic yet to be adequately treated by 
social psychologists, deserves the reader's careful 
attention. The term “leader,” he suggests, “might 
profitably be kept for one who guides and coordi- 
nates the efforts of a number of people (or at least 
one other person) while they perform some activ- 
ity" (p. 107). A chapter each on “Innovators” and 

* 


REVIEWS 


“The Average and the Excellent” round out the 
book and suggest a measure of the sensitivity of 
the author to human problems. 

The American reader of this British book who 
gauges its adequacy by textbook standards will miss 
a number of conventional topics. On occasion he 
may even become impatient with some of its philo- 
sophical overtones. These are on close examination, 
however, the astute observations by a sensitive 
mind of some of the broad implications of empirical 
research in social psychology. While the author is 
bound by the facts, he is not their slave. The result 
is a fine combination of scientific truth, practical 
wisdom, and literary excellence. 

HUBERT BONNER 

Columbia University. 


NEW BOOKS RECEIVED 


Asramson, H. A. (Ed.) Problems of consciousness. 
New York: Josiah Macy, 1954. Pp. 177. Price 
$3.25. 

AnNOLD, Macpa B., & Gasson, J. A. The human 
person. New York: Ronald, 1954. Pp. x + 593. 
Price $5.75. 

BAKER, JAMES, JR. The exteriorization of the mental 
body. New York: William-Frederick Press, 
1954, Pp. 32. Price $1.50. 

BARRELL, Josepu. A philosophical study of the hu- 
man mind. New York: Philosophical Library, 
1954. Pp. xi + 575. Price $6.00. 

Bossarp, J. H. S. The sociology of child development. 
(Rev. Ed.). New York: Harpers, 1954. Pp. xi 
+ 788. Price $6.00. 

Bnaarov, TRYGVE. Fundamentals of psychoanalytic 
technique. New York: Wiley, 1954. Pp. xi + 
404. Price $6.00. 

Bray, D. W. Issues in the study of talent, New 
York: King’s Crown Press, 1954. Pp. xi + 65. 
Price $2.00. 

Burmeister, Eva. Roofs for the family. New York: 
PEG Univer. Press, 1954. Pp. 203. Price 

125; 

Bunr, Sm CYRIL. The causes and treatment of back- 
wardness. New York: Philosophical Library, 
1954. Pp. 128. Price $3.75. 

Carrow, THEODORE. The sociology of work. Min- 
neapolis: Univer. of Minnesota Press, 1954. 
Pp. viii + 330. Price $5.00. 

Cunoy, Erv. Sociological perspective: basic con- 
cepts and their application. New York: Double- 
day, 1954. Pp. vi + 58. Price $.85. 

Commas, W. D., & Facrn, Barry. Principles of 
education psychology. (2nd Ed.) New York: 
Ronald, 1954, Pp. xvi + 795. Price $5.75. 

Cronpacu, Ler J. Educational psychology. New 
York: Harcourt, Brace, 1954. Pp. xxvii + 
628. Price $7.50. 

Doniczr, Stwon. (Ed.) Religion and human beha- 
vior. New York: Association Press, 1954. Pp. 
xxii + 233. Price $3.00. 

Encs, O. P., & Fca, S. M. Introduction to 
psychiatry. New York: Norton, 1954. Pp. viii 
+ 621. Price $7.00. 

Fastan, Roperr. London after dark. New York: 
et Book Center, 1954. Pp. 237. Price 

Finx, H. K. Long journey. New York: Julian Press, 
1954. Pp. xviii + 298. 

Fink, H. K. Mind and performance. New York: 
Vantage Press, 1954. Pp. xi + 113. Price $3.00. 
ER, D. C. A psychological glossary. Cambridge, 
England: Heffer & Sons, 1954. No pages 
numbered. Price 3s6d. 

Froscu, Jon. (Ed.) The annual survey of psycho- 


analysis. Vol. II. New York: International 
Universities Press, 1954. Pp. xiv + 724 
Price $10.00. 

Gatpston, lAco. (Ed.) Beyond the germ theory. 
New York: Health Education Council, 1954. 
Pp. viii + 182. Price $4.00. 

GEDDES, D. P. (Ed.) An analysis of the Kinsey re- 
ports. New York: Dutton, 1954. Pp. x + 319. 
Price $3.50. 

GILL, Merton, NEWMAN, RICHARD, & REDLICH, 
F. C. The initial interview in psychiatric prac- 
tice. New York: International Universities 
Press, 1954. Pp. 423. Price $6.00. plus phono- 
graph records $4.60. 

Gonzn, GEOFFREY. The life and ideas of the Mar- 
quis de Sa&e, New York: British Book Centre, 
1954, PP. 244. Price $3.50. 

Gutxmp, E. A. Community and environment. New 
York: Philosophical Library, 1954. Pp. xiii + 
81. Price $3.75. 

HADFIELD, J. A. Dreams and nightmares. Baltimore: 
Penguin Books, 1954. Pp. x +. 243, Price 
$ .65. : 

Hears, A. G., et al. Studies in schizophrenia. 
Cambridge: Harvard Univer. Press, 1954. Pp. 
xiv + 619. Price $8.50. 

Honcsow, K. W. The deaf and their problems. New 
York: Philosophical Library, 1954. Pp. xx + 
348. Price $6.00. 

Horsrarrer, P. R. Einführung in die Sosialpsy- 
chologie. Stuttgart-Wien: Humboldt-Verlag, 
1954. Pp. 535. Price: DM 12.50. 

Irretson, W. H., & Cantril, Hadley. Perception: 
a transactional approach. New York: Double- 
day, 1954. Pp. ix + 33. Price $ .85. 

Jensmp, A. T. Child psychology. (4th Ed.) New 
York: Prentice-Hall, 1954. Pp. xii + 676. 
Price $6.00. ) 

June, C. G. (Trans. by R. F. C. Hull.) The practice 
of psychotherapy. Vol. XVI of Collected Works 
of Jung. New York: Pantheon (Bollingen 
Series XX), 1954. Pp. xi + 377. Price $4.50. 

Katz, Dantet. (Ed.) Public opinion and propa- 
ganda. New York: Dryden, 1954. Pp. xx + 
779. Price $6.25. 

Kumesere, Orro. Social psychology. (Rev. Ed.) 
New York: Holt, 1954. Pp. xiii + 578. Price 
$5.25. 

Konzer, CLAUDE. Les déficiences intellectuelles chez 
Venfant. Paris: Presses Universitaires de 
France, 1954. Pp. 220. Price not supplied. 

LANDER, BERNARD. Towards an understanding of 
juvenile delinquency. New York: Columbia 
"Univer. Press, 1954. Pp. xv + 143. Price $3.00. 

Learning theory, personality theory and. clinical re- 


605 


606 


search. The Kentucky Symposium. New York: 
Wiley, 1954. Pp. ix + 164. Price $3.50. 

Lewis, N. D. C. & EwcrE, Bernice. (Eds.) 
Wartime psychiatry: a compendium of inter- 
national literature. New York: Oxford Univer. 
Press, 1954. Pp. vi + 952. Price $15.00. 

LiNDGREN, H. C. Mental health in education. New 
York: Holt, 1954. Pp. xiii + 561. Price $4.75. 

McGmuz, V. J. Emotions and reason. Springfield, 
Ill.: Charles C Thomas, 1954. Pp. xiii + 122. 
Price $3.25. 

MARTIN, J. B. Break down the walls. New York: 
Ballantine Books, 1954. Pp. 310. 

My«tesust, H. R. Auditory disorders in children. 
New York: Grune & Stratton, 1954. Pp. xiii 

+ 367. Price $6.00. 

Neumann, Erica (Trans. by R. F. C. Hull.) The 
origins and history of consciousness. New York: 
, Pantheon (Bollingen Series XLII), 1954. Pp. 
xxiv + 493. Price $5.00. 

Optom, Doris M. Psychology, the nurse, and the 
patient. New York: Philosophical Library, 
1954, Pp. 168. Price $4.75. 

Parsons, TALCOTT. Essays in sociological theory. 
(Rev. Ed.) Glencoe, Ill.: Free Press, 1954. Pp. 
459. Price $6.00. 

Pearse, G. H. J. Psychoanalysis and the education 
of the child. New York: Norton, 1954. Pp. x + 
357. Price $5.00. 

PENNINGTON, L. A., & BERG, I. A. An introduction 
to clinical psychology. (2nd Ed.) New York: 
Ronald, 1954. Pp. vii + 709. Price $6.50. 

Pepinsky, H. B., & Pepinsxy, PAULINE N. Coun- 
seling: theory and practice. New York: Ronald, 
1954. Pp. viii + 307. Price $4.50. 

Remmers, H. H.: Introduction to opinion and atti- 
tude measurement. New York: Harpers, 1954, 
Pp. viii + 437. Price $5.00. 

Resman, Davm. Individualism reconsidered. 
Glencoe, Ill.: Free Press, 1954. Pp. 529. Price 
$6.00. 

Runes, D. D. Letters to my daughter. New York: 


New Books RECEIVED 


Philosophical Library, 1954. Pp. 131. Price 
$2.50. 

Ryan, T. A., & Surrg, ParniciA C. Principles of 
industrial psychology. New York: Ronald, 
1954. Pp. xiv + 534. Price $5.50. 

Sarason, S. B. The clinical interaction. New York: 
Harpers, 1954. Pp. x + 425. Price $5.00. 
Scuanck. R. L, The permanent revolution in science. 
New York: Philosophical Library, 1954. Pp. 

xvi + 112. Price $3.00. & 

Selected writings of de Sade. (Selected and trans- 
lated by Leonard de Saint-Yves) New York: 
British Book Centre, 1954. Pp. 306. Price 
$6.75. 

Suara, J. A. Alcoholism. Philadelphia: Lippincott, 
1954. Pp. vii + 72. Price $3.00. 

Sotomon, J. C. A synthesis of human behavior. New 
York: Grune & Stratton, 1954. Pp. xii + 265, 
Price $5.50. 

Suttivan, H. S. The psychiatric interview, New 
York: Norton, 1954. Pp. xxiii + 246. Price 
$4.50. 

THE TULANE DEPARTMENT OF PSYCHIATRY AND 
NEUROLOGY. Studies im schizophrenia: a 
multi-disciplinary approach to mind-brain 
relationships. Cambridge: Harvard Univer. 
Press, 1954. Pp. xiv + 619. Price $8.50. 

THORPE, L. P., & ScHMULLER, A. M. Contemporary 
theories of learning. New York: Ronald, 1954. 
Pp. viii + 480. Price $5.50. 

Worpers, L. R. Technique of psychotherapy. New 
York: Grune & Stratton, 1954. Pp. xiv + 
869. Price $14.75. 

WorsrEm, Benjamin. Transference: its meaning 
and function. New York: Grune & Stratton, 
1954. Pp. xiii + 206. Price $5.00. 

Wotreck, J. W., & BENNETT, E. M. The language 
of dynamic psychology as related to motivation 
research. New York: McGraw-Hill, 1954. Pp. 
111. Price $4.00. 

ZiLBOORG, GREGORY. The psychology of the crimi- 
nal act and punishment. New York: Harcourt, 
Brace, 1954. Pp. xi + 141. Price $3.50. 


- AUTHOR INDEX 


i i F of books reviewed, in Roman; 
f authors of original articles are printed in CAPITALS; the author 
Been T and of reviewers in Italics.) 


PAGE 
PAcE 423, 554 
427 Gmcmusr, J. C. 325 
Apaus, J. 246 — GorpsEme, S. C. 443 
ADELSON, J. 219 Goonsrem, L. D. 475 
Auuison, H. W. 219° Gorlow, L. P 
Aruson, S. G. 224 Greer, F. L. 341 
ApPrEzwEIG, D. G. 477 Grossack, M. M. "a 
Arxinson, J. W. 405  Harrwzm, D. : 156 
Atkinson, J. W. 445 Hau, H. E., Jr. 19 
AXELROD, H. 474 Harn, A. W. 99 
Baker, A. 595  HANFMANN, E. 603 
Baron, S. H. 419 Harding, D. W. 362 
Becker, W. C. 178 HARRINGTON, R. 478 
Ber, E. G. 156 Hartley, E. L. 129 
BELL, G, B. 463 Hastorr, A. H. | 405 
Bennett, E. M. 543 Hews, R. W. 475 
BerkeLEY, M. H. 251 Hoch, E. L. 595 
Brnkowrrz, M. 256  HorrwAN, E. L. ; 365 
Bpra, D. / 29  HOLLANDER, E. P. 319 
Biro, C. 311, 573 Holt, R. R. 485 
Brake, R. R. 94  HorrzwaN, W. H. 201 
Bru, G. S. 603 Horwitz, M. 399 
Bonner, H. 311 Howarp, A. R. 513 
Bremm, J. W. 427 Horr, H. A. 87 
Brown, D. R. 593 Irvine, R. P. 330 
Brown, K. B. 454 Janona, G. 598 
Brown, R. W. 314 Jahoda, M. 211 
Bunpocx, E. I. 129 Janis, IL. 196 
Canrri, H. . : A71 Jessor, R. 463. 
Canscu, H. 478  JOHANNSEN, D. E. 319 
Cartwright, D. 508 Jones, E. 474 
Cxoporkorr, B. 598 Jones, M. 51 
Christie, R. i 417 Jourarp, S. M. 111 
Clark, R. A. 135 Joyce, J-N. 115 
Ctecktey, H. 84 Karz, D. 533 
Conen, L. D. 290 Kezem, J. D. 399 
Conn, B. N. 471 Ketty, G. A. 211 
Coun, T. S. 7 Kine, B.T. 158 
Cores, M. R. 178 Kinsey, A. C. 592 
Cowen, E. L. 371 KorNeTSKY, C. 173 
Cox, F. N. 355  KuRLAND, S. H. 251 
Curtiss, J. 99  LarraL, J. 529 
Dorris, R. J. ? 593  LANDFIELD, A. W. 445 
Dovorass, W. B. 251 LANGHAM, P. 445 
Dnzvpanr, J. 241 — LANZETZA, J. T. 396 
Ecrasn, A. 362 Lazarus, R. S. 201 
Eurmann, J. C. 491 Le, F.J. 454 
Ersen, N. H. 45,561  LENNEBERG, E. H. 251 
Eriksen, C, W. "600 Levine, J. 99 
Estes, S. G. 476 Levinson, D. J. 472 
Evans, J. 509 Levitt, E. E. 251 
Eysenck, H. J. 513 LINDEMANN, J. 75, 464 
Fesrincrr, L. 381 Layton, H. B. 477 
Fizpier, F, E. 503 Lowell, E. L. "i 423 
Frsuxn, S, 111 Lupemay, J. F. 579 
Firzceratp, BJ. ^ 474 LURIA, Z. 423 
Freeman, T. 466 Lysax, W. * an 
FRENKEL-BRUNSWIK, E. 573 McArthur, C. C. : 36 
Frucuter, B. 41  McBam, W.N. 477 
GALANTER, E. H. 158 McClelland, D. C. 109 
Gebhard, P. H. 592 McCoNNELL, J. 
Gerar, D. L. 


608 


'.McGavcnnaN, L. S, * 
MoKzacur, W. J. - 
MacRar, D., Jr. - 
Martin, C. E. . 

_ Mastine, J. M. 

` MAUSNEZ, B. 
MELIKIAN, L. 
Mensu, I. N. 
MeRrESKO, R. 
Merry, J. 

* Monsm, S. M. 

* Monacuest, E. D. 

` Moreno, J. L. 
Morrow, W. R. 
Movron, J. S. 
Munnay, E. J. 
Mossen, P. H. 
NAvron, H. K. 
Newcomb, T. M. 
Nort, P. G. 
O’Rertty, C. T. 
O'Reny, E. J. 

a Oscoon, C. E. 
Peterson, D. R. 
PrrcnEs, B. 
Pomeroy, W. B. 
Pomryn, B. A. 

... Proruro, E. T. 

c 7 Ramsey, G. V. 
Reece, M. M. 
RoniNs, E. 

* Roe, A. 

Ronnzm, J. H. 

~ ROSENBERG, S. 


PAGE 
183 
282 

14 

158 
316 
65, 557 
59 

t 435 
89 
474 
152 
29 
478 
89 
573 
305 
235 
235 
598 


: AUTHOR INDEX 


Sarason, S.B. . 
SARNOFF, L 
Scen, E: H., | ^* 

. Serrs, S. B. Wo 
Sgaw, M. E.. TN 
Suontz, F, Cy e N 
SmvERMAN,R. E+ 
Simpson, G. E. 

© Singer, J. L. 
Smith, J. G. 
SorowoN, R. L. 
Sommer, R. 
SoskiN, W. F. * 
Spence, K. W. 
Sracey, C. L. 
SrEINER, I. D. 
Stephenson, W. 
Suxtrvan, P. L. 
SwaNDER, D. V. 
Tart, R. 
TALLAND, G. A. 
TAYLOR, J. A. 
Telschow, E. F. 
TurGPEN, C. H. 
Tuxford, J. 

U. S. Veterans Administration 
Verorr, J. 

WALKER, L. C. 

Wess, S. C. 

Wenar, C. 

WnrTEMAN, M. 

Winnie, C. 

Wisrf, L. G. 

Wood, A. B. 

Waicat, G. O. 


M Y ronmwonp « whudo oo 9. 


The revolution in S odom physics has forced P to re-examine fundaitental 4 


assumptions both i in science and in our everyday thinking. No man can predict, - 
theultimate consequences of this re-examination, but nothing seems ‘more + 


ings By John W. Gardner dien "ER 


certain than that it will lead to a more intensive sendy. of the psychology of, 


perception and the psychology of language.'For one of the most significant yields 


of the recent developments in physics has been a renewed awareness of the role , 
> 


of the observer. ` 

The intimate relationship between the observer and the observed is, of course, 
a very, very old story. Parmenides and Democritus were aware of it. Philosophers 
through the centuries have commented on it and some have built their philoso- 
phies upon it. The recent work in physics has simply pointed up explicitly and 
with considerable poignancy certain possible limitations on man’s capacity to 
perceive and conceptualize. 

Any concern with intrinsic limitations upon man’s capacity to conceptualize, 
or limitations inherent in his mold of thought, must lead inevitably to a concern 


"for the psychology of language. P. W. Bridgman* made the point vigorously in 


a recent paper: ‘We cast the world into the mold of our perceptions. The fact 
that the world I construct is so much like the world you construct is evidence of 


“the similarity of our nervous systems, something which any physiologist could 


demonstrate for you more directly. We all of us perceive the world in terms of 
space and time. An interesting question is how inevitably we are forced to this 
perception by the common properties of our nervous systems, or to what extent 
it is adventitious, depending on universal features in early experience and in 


particular on necessities incident to the use of language. This question is possibly... 
‘capable of some sort of experimental attack, but I think in any event we are here 


perilously close to the verge of meaning, itself. Some answer may eventually be 
found to the meaningful aspects of the question.” A 

The renewed interest in language growing out of the perplexities of modern 
science is only one—and by no means the most important—of theinfluences which 
have produced intensified work on the psychology of language. Descriptive 
linguists came out of the war immensely stimulated by the heavy demand which 


. had been placed on their skills during the emergency. Starting from a wholly 


different vantage point, communications engineers have carried through an 
enormously productive series of studies in acoustics, auditory perception, and 
the intelligibility of speech sounds. Out of these studies has developed a theory 
of communication which has proved of great interest to psychologists and philos- 


, ophers as well as to mathematicians and physical scientists. 


Through these and other developments, psychologists, anthropologists, philoso- 


PPW: Bridgman, The task before us. groceediigs of the American Academy of Arts and 


"Sclences, 88: 3. 104. 


ii e 


xm ox 


à * 


iv PSYCHOLINGUISTICS: A SURVEY OF THEORY AND ‘RESEARCH, PROBLEMS 
igre * xe Y + ed à 
* phers | and others who had ways exhibited some interest in language developed a 
renewed concern for the . But their various lines of approach to the problem 
of language were in some respects remarkably disparate. The descriptive linguists 
discussing phonemes, the communications engineers discussing binary digits, 
and the psychologist discussing linguistic responses seemed most of the time to 
s be engaged in wholly ne conversations. Here and there one could find 
individuals whose training was sufficiently broad to participate in all three 

. conversations, but the overlap was tenuous. 


_ It was in this context that the Soeial Science Research Council set up a Com- * 


mittee on Linguistics and Psychology in October, 1952. The purpose of the Com- 
~ mittee was to bring together men trained in the various fields relating to the: 
study of language with a view to planning and developing research on language 


behavior. 
» The initial membership of the Committee was as follows: Charles E. Osgood 
m (psychologist, University of Illinois), chairman; John B. Carroll (psychologist, 
4 * Harvard); Floyd G. Lounsbury (linguist, Yale); George A. Miller (psychologist, 


^. * Massachusetts Institute of Technology); and Thomas A. Sebeok (linguist, 
Indiana University). Joseph B. Casagrande (anthropologist) of the Social 

_ Science Research Council served as staff for the Committee. Mr. Miller resigned 

| after serving on the Committee for one year, while Joseph H. Greenberg (linguist, 
Columbia) and' James J. Jenkins (psychologist, University of Minnesota) were 
added to the Committee in the autumn of 1953. 

One of the early steps taken by the Committee was tô plan and sponsor & 
research seminar in psycholinguistics. This seminar was held in conjunction with 
the Linguistic Institute at Indiana University during the 1953 summer session. 
The seminar first set itself the task of examining three differing approaches to 
the language process: (1) the linguist’s conception of language as a structure of 

» systematically interrelated units, (2) the learning theorist’s conception of lan- 
guage as a system of habits relating signs to behavior, and (3) the information 
theorist’s,conception of language as a means of transmitting information. These 
various points of view were explored in order to appraise their utility for han- 
dling different problems and to discover in what respects they could be brought 
into a common conceptual framework. The second task which the seminar set 
itself was to examine a variety of research problems in psycholinguistics with a 

view to developing possible experimental approaches to them. 

"This monograph is one result of the seminar. Tt is a collaborative product of 
the entire group of seminar participants, each of whom is author of one or more 
sections. i 

The authors of the monograph, and particularly the two editors, Charles E. 
Osgood and Thomas A. Sebeok, are to be congratulated upon having carried, 
through an exceedingly arduous assignment. Those who have been familiar with 
one or another of these fields (and the monograph is written precisely for them) 
will recognize how difficult it was to bring into a common framework theoretical 
models of the language process which had evolved independently out of differing. 


is n2 au 
ee > x f LAT 
PSY CHOLINGUISTIOS A SURVEY OF THEORY AND RESEARGH. mns v 
Ea K x 
kinds of data and differfhe o UNE to these data. The Cal eat e the 
first to recognize the extent to which they have fallen short eir goal. Yet it 
seemed important to them—and this feeling must surely be widely shared—that 
someone undertake the difficult das task of bringing together these vital 
ines of research. 
Research workers in the special fields involved'have reason to be Baien to. 
the authors of this monograph who took time out from their own active research 
, interests to undertake this difficult exploratory task. 


May 12, 1954 - 


+ 


PREFACE ^ 
The Summer Seminar on Psycholinguisties was sponsored by the Social 
Science Research Council with funds provided by the Carnegie Corporation of 
New York and held at Indiana University in 1953. It was part of the program of 


study being developed by the Councils Committee on Linguisties and Psy-. 


chology, most of whose members were participants. It was also in part a con- 
tinuation of the interuniversity summer research seminar in psychology and 
linguistics held at Cornell University, June 18—August 10, 1951. The general 
purpose of this Committee is to stimulate research in the field of language be- 
havior, by conducting a survey of on-going and contemplated research, by 
organizing where feasible small-scale work-conferences on special problems, 
by serving as a communication channel among people working in this area, 
and by discussing and evaluating the present status of the field. It was felt that 
a summer seminar would provide an unusual opportunity for the members of 
this Committee to work together intensively over an eight-week period and 
thereby develop a more intimate understanding of their mutual problems in 
the language area, as well as placing them in a better position to organize effec- 
tive work-conferences and study programs. 

In the course of the seminar’s activities, it was planned to examine three of 
the theoretical models of the language process which have been developing 
rather independently; the membership in the seminar included persons with 
training in each of these areas. Another purpose of the seminar was to study 
intensively a number of basic research problems, combining the training and 
research experiences of the participants in analysing the theoretical backgrounds 
of these problems and in formulating possible experimental approaches to them. 
In rough accord with these plans, approximately the first half of the eight-week 
Period of the seminar was spent in the presentation and discussion of the various 
psycholinguistic problems as approached from these theoretical positions; during 
the second half of the seminar, the participants worked informally in over- 
lapping groups on particular problems in psycholinguistics that were felt to be 
of major significance. 

Participants in the Summer Seminar on Psycholinguistics, and hence joint 
authors of this report, included, in addition to the senior staff members, Green- 
berg, Jenkins, Lounsbury, Osgood, and Sebeok, the following graduate student 
members: Susan Ervin (psychologist, Bureauof Social Science Research, Washing- 
ton, D.C.), Leonard D. Newmark (linguist, Indiana University), Sol Saporta (lin- 
guist, University of Illinois), Donald E. Walker (psychologist, The Rice Institute), 
and Kellogg Wilson (psychologist, University of Illinois). It is fair to say that our 
graduate students contributed on equal terms with the senior staff both in dis- 
cussion of psycholinguistic problems and in the writing of this report; it also can 
be fairly said that they profited greatly from the summer's experience. The 
development of any new interdisciplinary field must ultimately depend on 
young scholars who maintain in a single nervous system the habits of both 

vu 


a 


viii PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


sciences. Three others were able to participate only through two-week periods 
of the seminar—John B. Carroll (psychologist, Harvard University), Eric 
Lenneberg (linguist, Massachusetts Institute of Technology), and Joseph B, 
Casagrande (staff representative for the Social Science Research Council)—but 
they also joined significantly in the work of the seminar and have contributed 
to the content of this report. In addition, the seminar enjoyed visits from a 
number of scholars interested in the same general area: Grant Fairbanks (psy- 
chologist, University of Illinois) demonstrated his speech compression and 
expansion techniques and also discussed delayed auditory feedback phenomena 
and the theoretical and practical implications of this work; E. M. Uhlenbeck 
(linguist, University of Leyden) sat in on our discussion of entropy profiles in 
sequential speech and played tape recordings made of conversational Javanese; 
John Lotz (linguist, Columbia University) participated in discussions on the 
problem of meaning and Werner F. Leopold (linguist, Northwestern University) 
in discussions on the dèvelopment of language behavior in children. 

We decided to hold our seminar on the campus of Indiana University in 
conjunction with the Linguistic Institute. The members of the seminar were 
welcomed at the daily luncheons of the Institute and were thus able, informally, 
to meet and discuss many matters with the staff of the Linguistic Institute. 
Our graduate student participants typically carried two courses offered by the 
Institute and usually sat in on others. Most of the senior staff also took advan- 
tage of this opportunity and sat in on one or more of the courses being offered. 
While these “extra-curricular” activities certainly reduced the time we could 
devote to the seminar, they contributed to our understanding of psycholinguistic 
problems. The members of the Summer Seminar on Psycholinguistics thank 
both the Linguistic Institute, particularly its Director, C. F. Voegelin, and the 
administration of Indiana University, particularly Vice-President John W. 
Ashton, for making our summer visit both enjoyable and profitable. We also. . 
wish to express our gratitude here to the Social Science Research Council and to 
the Carnegie Corporation of New York for their continued support of this and 
other interdisciplinary studies in the area of language behavior. 

A final word is in order concerning the preparation of this report and its nature. 
During the latter portion of the seminar, each of the informal work-groups had 
a chairman whose responsibility it was to organize study of a particular problem 
and its presentation to the seminar as a whole. When it was later decided to 
prepare a report for possible publication, it became each chairman’s responsibility 
to collate materials from the members of his group and write an initial draft. 
Although specific sections of this report were thus written by individuals (as 
indicated by footnotes throughout), the actual thought and discussion of each 
topic was so thoroughly shared within the seminar that it would be difficult if 
not impossible to properly assign either credit or responsibility as the case might 
be. Therefore, we wish the reader to view this report as truly a joint product. We 
also hope the reader will keep in mind that this represents the result of only eight 
weeks’ work. It is an exploratory survey of an interdisciplinary area, not & 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS ix 


scholarly exposition of well-mapped territory; it formulates many problems and 
suggests possible attacks on them, but it does not present the results of research. 
So, it is with some trepidation that we offer this rather crude map of what is 
becoming an important research area—psycholinguistics. 
4 Charles E. Osgood, Editor 
University of Illinois 
Thomas A. Sebeok, Associate Editor 
| Indiana University 
December 1, 1953 


+ 


k ; 1. INTRODUCTION 
1.1. Models of the Communication Process ^ i 


In thee most general sense, we haye eni HEATON whenever one system, a 
source, influences another. eye. ‘a estination, by manipulation of the alterna- 
tive signals which can be carried in the channel connecting them. The information 

* source is conceived as producing one or more messages which must be trans- 
formed by a transmitia into signals which the channel can carry; these signals 
must then be transformed by a receiver back into messages which can be accepted 
at the destination. This minimal system, borrowed from Shannon’s discussion of 
the theory of information? and diagrammed in Figure 1, has been applied, with 


‘ 


«cow 


e „Figure 1 


f d 
great generality, to information transmission in electrical, biologieal, psycho- 
logical and social systems as well as language communication in the strict sense. 
In a telephone communication system, for example, the messages produced bya 
speaker E the form of variable sound pressures and frequencies which must 
be transformed into proportional electrical signals by the transmitter; these 
signals are carried over wire (channel) to a receiver which transforms them back 
into the variable sound pressures and frequencies which constitute the message 
to be utilized by the listener. The activity of the transmitter is usually referred 

_ to as-encoding and that of the receiver as decoding. Anything that produces 

’ Variability at the destination which is unpredictable from Variability introduced 
'a the source is called noise. 

This model of the communication process, developed in connection with, 
engineering problems, was not intended to provide a satisfactory picture of iu 
man communication. For one thing, it implies.a necessary separation of source 
and destination, of transmitter arid receiver, which is usually true of mechanical 
communication systems but not of human ones. The individual human functions 
more or less simultaneously as a source and destination and as & transmitter and 
receiver of messages—indeed, he is regularly a decoder ofthe messages he himself 


, 


* 


encodes through various feedback mechanisms. Each individual in a speech ‘is: 


community may be conceived as a more or less UA CUM communicating 
System, Jis in his nervous apparatus, from 


model in the fashion shown in Figure 2, what mii e called a communication 


Mg 


'éptors to effectors, all of - ay 
the components 5 shown in Figure 1. If we rearran e components in Shannon’s P : 


unit is describi uipped to both receive and send messages. In the process of w 


1 Shannon and Weaver, The mathematical theory of communication (University of Illinois 
Press, 1949), Mathematical aspects of Shannon's theory of signal transmission are discussed 
in section 2.3. of this report. d T 


P 1 r LI 


- from the remainder of the environment) do not derive from A's 


2 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


human decoding, input of some form of physical energy, linguistically or otherwise 

coded, is first recoded into sensory neural impulses, eperated upon by receiving 

apparatus, and finally ‘interpreted’ at the destindtion (presumably as some 

pattern of activity in the higher centers)? In the process of human encodi g, an 

"intention' of the source (presumably some pattern p s in the same centers) 

is operated upon by transmitting apparatus in the motor areas, is re&oded into 
€ ‘yy $ 


Communication Unit " i 
EIVER-»DESTI! CEE 
decoding. 4. encoding 

C TH Figure 2 " 

d E 4 
physical movements, and becomes the output of this unit. Translating into 
traditional psychological language, input becomes equivalent to ‘stimulus,’ 
receiver becomes ‘reception’ and ‘perception,’ destination and source become 
‘cognition? (meaning, attitude, and the like), transmitter becomes ‘motor organ- 
ization and sequencing,’ and output becomes ‘response.’ " i 
Another insufficiency of engineering models for human communication pur- 
poses is that'they are not designed to take into account the meaning of signals, 
e.g., their significance when viewed from the decoding side and their intention 


. when viewed from the encoding side. The research generated by such models 


has dealt almost exclusively with relations between transmitter and receiver, or 
with the individual as a single system intervening between input and output 
signals. This has not been because of lack of awareness of the problem of meaning 
or its importance, but rather because it is admittedly difficult to be rigorous, 
objective, and quantitative at this level. Nevertheless, one of the central prob- 


lems in psycholinguistics is to make as explicit as possible relations between 


message events and cognitive events, both on decoding and encoding sides of the | 


equation. 

Human communication is chiefly a social affair. Any adequate model must 
therefore include at least two communicating units, a source unit (speaker) and A 
a destination unit (hearer). Between any two such units, connecting them into a 
single system; is what we may call the message. For purposes of this report, we 
will define message as that part of the total output (responses) of a source unit 
which simultaneously may be a part of the total input (stimuli) to a destination 
unit. When individual A talks to individual B, for example, his postures, gestures, 
facial expressions and even manipulations with objects (e.g., laying down a play- 
ing card, pushing a bowl oftfood within reach) may all be part of the message, a8 


-`of course are events in’ the sound wave channel. But other parts of A’s total 


behavior (e.g. breathing, toe-wiggling, thinking) maynot affect B atall and other 
parts of the total stimulation to B (e.g., sensations from B’s GP posture, cues 

avior—these 
events are not part of the message as we use the term. These R-S message 
events (reactions of one individual that produce stimuli for another) may be 
either immediate or mediate—ordinary face-to-face conversation illustrates the 


» = 


i eh 
3 


Lf Ms B 


E d 


ls x 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 3 
à ea stics 
E3 * qietingatetios i "Um 
‘ & WT. m r 
o mee re Hoazeustico á a 


Destination Unit 


RECEIVER 3-MEDIATOR —> TRANSMITTER 
DECODING 


LL pegetotingistáeo 3 
z 
» * * 


p Social Sciences | A | 
—-——_—____ aa 


L Commmications 


Figure 3 
à S ey 
former and. written communication (along with musical recordings, art objects, 
and so forth) illustrates the latter. a 
Figure 3 presents a model of the essential communication act, encoding of a 
message by a source unit and decoding of that message by a destination unit. 
_ Since the distinction between source and destination within the same communs; ^ 
icator (as shown in Figure 2) seems relevant only with respect to the direction 
of information exchange (e.g., whether the communicator is decoding or en- 
coding), we substitute the single term mediator for that system which intervenes 
between receiving and transmitting operations. The ways in which the various 
Sciences concerned with human communication impinge upon and divide up the 
» total process can be shown in relation to this figure. 


1.2. Disciplines Concerned with Human Communication 


Microlinguistics (or linguistics proper) deals with the structure of messages, the 

* signals in the channel connecting communicators, as events independent of the 
characteristics of either speakers or hearers. Once messages have been encoded 
and are ^on the air,” so to speak, they can be described as objective, natural 
Science events in their own right. In an even stricter sense, the linguist is con- 
cerned with determining the code of a given signal system, the sets of distinctions 
which are significant in differentiating alternative messages. The term ezolin- 4 
guistics (sometimes called metalinguisties) has been used rather loosely by 1 
linguists to cover all. those other aspects of language study which concern rela- - 
tions between the characteristics of messages and the Characteristics of indivi icio 
uals who produce and receive them, including both their behavior and culture, 
Whether or not the grammatical structure of a age influences the thinking 
of those who speak it is thus an exolinguistic problem. The social sciences in 
general, and psychology, sociology, and anthropology in particular, are concerned ' 
with the characteristics of human organisms and societies which influence the 
selection and interpretation of message - es, meanings, social roles, values, 


® QE % 


$ ett 
4 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


and so forth. The rather new discipline coming to be known as psycholinguistics 
(paralleling the closely related discipline termed ethnolinguistics) is concerned in 
the broadest sense with relations between messages and the characteristics of 
human individuals who select and intefpret them. In a narrower sense, psycho- 
linguistics studies those processes whereby the intentions of speakers are trans- 
formed into signals in the culturally accepted code and whereby these signals 
are transformed into the interpretations of hearers. In other words, psycho- 
linguistics deals directly with the processes of encoding and, ing as they relate. © 
states of messages to states of communicators. The terminal aspect of human. 
speech encoding, production of speech sounds, is the special province of phonetics. 
Similarly, the initial aspect of human speech decoding, whereby sound pressures 
and frequencies are transformed into impulses in auditory nerve fibers and re- 
layed to the cortex, is a special field of psychoacoustics. Finally, the science of * 

human communication would be concerned with relations between sources who — 
select messages and destinations who interpret and are affected by them. In the 
broadest séhse, therefore, human communications as a science includes the other 
disciplines that have been mentioned; in a narrower sense—and one more in 
keeping with contemporary activities—students of communications research 
have usually worked at grosser levels of analysis, concerning themselves with 
sources such as radio and the newspaper and destinations such as the mass 
audience, members of another culture, and so on. " 


1.3. Plan of This Report 


Psycholinguistics is that one of the disciplines studying human communication 
id which is most, directly concerned with the processes of decoding and encoding. 
What are the major divisions within psycholinguistics itself? Mapping of this 1 
area was one of the tasks of the seminar, but it was done in a casual manner and i 
appears as a spontaneous clustering of the research problems the participants t 
found significant. In other words, the organization of the field of psycholinguistics — - 
followed here is one that the members of this seminar found fruitful. 

Section 2 of this report provides brief and non-technical orientations to the i 
three approaches to language study, linguistics, learning theory, and information 
theory, in which we were particularly interested. The members of the seminar — : 
spent the first few weeks in such orientation as a means of providing themselves 
with amore homogeneous background, and most readers of this report are prob- 
ably in the same position we were in, e.g., perhaps trained in linguistics but not 

-" psychology and only remotely conversant with information theory, or possibly 
familiar with both learning and information theory but entirely vague about 
— - linguistics. During the course of this orientational work, discussion by the 
seminar repeatedly devolved; upon the problem of psycholinguistic units—the 
need for clearly defined units in quantitative research, the relevance of available 
J linguistic units, and so on. Although we have been able to do no more than set up 
the problem and suggest possible ways of attacking it, the prior importance of 

this issue justifies a separate treatment, given in section 3. 
The body of this report da — uum analyses and suggested research 

" z " 


^ 


vi 


PSYCHOLINGUISTICS: A SURVEY OF THEORY.AND RESEARCH PROBLEMS 5 


within specific areas. At the time of presentation of these research problems for 
preliminary discussion by the seminar, it became clear that we could not organize 
this field in terms of the three methodological approaches, linguisties, informa- 
tion theory, and learning theory, since each problem seemed to require combina- 
tions of techniques drawn from all three aproaches. Rather, the various problems 
suggested by members of the seminar seemed to fall quite naturally into clusters 
based 6n similarity of content and underlying theory. During roughly the last 
half of the seminar period, its members worked in overlapping groups of about 
three or four people on such clusters of related problems, reporting back to the 
seminar as & whole for general discussion. These work-group reports, as written 
up by the chairman of each group, form the basis for the remainder of this 
published report. à 

The organization of content in psycholinguistics developed by the seminar can 
perhaps best be seen by reference to Figure 4. The temporal dimension runs, as 
usual, from left to right. Brief sequences of time are indicated by the banded 
arrows. Periods A and B may refer to either two different stages within the de- 


‘velopment of an individual speaker or two different stages in the development of 


a language within a speech community. The upper half of the figure represents 
diagrammatically the interacting levels of behavioral organization within the 
individual; this is the special province of psychology and, more remotely, of the 
other social sciences. The lower half of the figure represents the various levels or 
bands of the message; thisis the special province of linguistics and, programmati- 
cally, kinesies (study of facial and bodily gestures) and, more remotely, all 
disciplines concerned with media (content analysis, aesthetics, etc.). 


Period 4 


cognitive = 
aS States 


anticipational anc 
dispositional sots 


a am 


@8O4>OHzcEZOA 


linguistic > 
kinesio = > 


amA oa co a e 


m 


6 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


The levels within the communicator are here labeled cognitive states, motive 
states, anticipational and dispositional states (or sets), and sensory and motor 
skills—these labels are intended to be suggestive, not limiting. Synchronic 
Psychology would deal with organization both within these levels and between 
them in decoding and encoding. The various synchronous bands which comprise 
messages are here labelled linguistic, kinesie, situational (e.g., manipulation of 
significant objects, arrangement of the social or physical situation in which 
communication takes place) and *other' (e.g., odor, warmth, touch, and other 
modalities which may contribute to communication)—signals in any of these 
bands may be either naturally or arbitrarily coded. Synchronic Linguistics in the 
broad sense would deal with both organization within these bands (e.g., de- 
scriptive linguistics deals specifically with the structure of linguistically coded 
stimuli) and between these bands (e.g., relations between linguistic, kinesic 
codes and the like). Synchronic Psycholinguistics deals with relations between 
momentary psychological states of communicators and momentary states of 
messages. Since a large number of problems fall in this area, the seminar divided 
them into two groups: Synchronic Psycholinguistics I.: Microstructure (relations 
of phonemic units of messages to perceptual and motor discrimination in com- 
municators, for example) is discussed in section 4; Synchronic Psycholinguistics 
II.: Macrostructure (problems of meaning, of relations of language to thought 
and culture, for example) is discussed in section 7. This distinction between 
microstructure and macrostructure is probably not a happy one, but it seemed 
to serve our purposes, 

Over short periods of time, at least, events at any psychological level are to 
some degree predictable from antecedent events at either the same level or other 
levels. Principles of association, for example, are concerned with the dependence 
of one cognitive state upon another. Similarly, enforced regularities in either 
input or output events (e.g., grammatical regularities) may give rise to sequential 
neural organization. Study of problems of this order could be called Sequential 
Psychology. On the message side, likewise, events at one point in time can be 
shown to be dependent to varying degrees upon antecedent message events— 
presumably such phenomena could be studied within kinesic or other bands as 
well as the linguistic. Such study could be called Sequential Linguistics. The rela- 
tions between transitional sequences in messages and transitional sequencing 
mechanisms in the communicator is the field of Sequential Psycholinguistics, and 
problems in this area are discussed in section 5. 

When the psychologist deals with changes in organization, either through 
maturation or through learning, he makes comparisons between two stages of 
performance in time (e.g., pre-training and post-training) and this might well 
be termed Diachronic Psychology. The same term could be applied to differences 
in organization between two stages in culture, e.g., comparison of the habits or 
associations between S and R for two sets of individuals at two discrete times. 
When the linguist compares the structures of messages produced by members of 
the same speech community at two discrete periods in time, this is called Dia- 

? 


* 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 7 


chronic Linguistics. In this report, we are extending the same term to cover the 
linguist’s comparisons between the structures of messages produced by the same 
individual at two discrete periods in time—that is, to the study of first language 
learning, second language learning and bilingualism. Diachronic Psycholinguistics 
would be concerned with relations between the changing behavioral organiza- 
tions of either the individual or the group and the changing structures of mes- 
sages they produce, particularly the application of learning principles to these 
problems. This area is discussed in section 6. 


2. THREE APPROACHES TO LANGUAGE BEHAVIOR 


By way of orientation to the theoretical analyses and research proposals which 
form the body of the report, this section provides a brief summary of each of the 
three major approaches to language study under investigation, linguistics, learn- 
ing theory, and information theory. These introductions are aimed, as it were, 
at those in other disciplines—the linguistics summary is written for non-linguists, 
the learning theory summary for non-psychologists, and so forth. This means 
that the specialist in each area may find much to take exception to, much that 
he would present differently, and this is to be expected. These summaries are also 
intended to be non-controversial, but our need for conceptions adequate to 
handle psycholinguistic problems has undoubtedly influenced the emphases given, 
particularly in learning theory and linguistics sections. For the reader who wishes 
to go further into the details of each field, a limited annotated bibliography ac- 
companies each summary. 


2.1. The Linguistic Approach? 

As distinct from psychology, which is concerned with verbal behavior in the 
context of events occurring within the organism, and from the other social sci- 
ences, which analyze the contents of verbal behavior insofar as it consists of 
shared cultural beliefs and actions (e.g., religion, philosophy, economie and 
political norms), linguistic science has as its traditional subject matter the signal 
system as such. Its orientation tends to be social rather than individual, since 
the use of speech in communication presupposes a group of intercommunicating 
people, a speech community. In general, therefore, it has dealt with the speech of 
individuals merely as representative of the speech of a community. The interest 
in an individual's speech as such, his idiolect, in relation to his personality struc- 
ture constitutes a relatively new, marginal, and little explored area. The distinc- 
tion between language as a system and its actual employment has been variously 
phrased as langue vs. parole (de Saussure), syntactic vs. pragmatic (Morris) or 
code vs. message (information theory). However stated, it marks in general the 
boundary between what has traditionally been considered the province of lin- 
guistic science and what lies outside it. 


2.1.1. The Field of Linguistics 


The primary subject matter of the linguist is spoken language. Writing and 
other systems partly or wholly isomorphic with speech are viewed by most 
linguists as secondary systems. Speech has both ontogenetic and phylogenetic 
priority. There are even now peoples with spoken but not written languages (so- 
called primitives), but the reverse situation has never been obtained. Moreover, 
written systems are relatively stable while spoken language, by and large, changes 
more rapidly. It is always the written language which must make the readapta- 
tion, when it is made, by way of a new orthography. The effect of, say, alphabetic 


5 


2 Joseph H. Greenberg. 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 9 


writing on speech, in the form of spelling pronunciations, is a real but quite 
minor factor in the change of spoken language. The linguist views writing, then, 
as a derivative system whose symbols stand for units of the spoken language. 

Linguistic science is divided into two main branches, the descriptive and the 
historical. Historical interests presided at the inception of modern linguistic 
science (ca. 1800) and have predominated until fairly recently. Within the last 
few decades the focal point of linguistics has shifted to problems of description. 
These two chief areas of study complement each other. The degree of success of 
historical inquiry is largely dependent on the adequacy of descriptive data. On 
the other hand any particular stage of a language, while it can be completely 
described without reference to its past, can be more fully understood if the time 
axis is also taken into account. A cardinal and generally accepted methodological 
principle, however, is the clear distinction between synchronic and diachronic 
investigations. In particular, descriptive grammars were, and sometimes are, 
so replete with historical interpretations, that the locus in time of individual 
linguistic facts is obscured and observed phenomena are not distinguished from 
inferences, so that no clear picture of the structure of the language at any one 
time emerges. 

The aim of a scientific language description is to state as accurately, exhaus- 
tively, concisely, and elegantly as possible, the facts concerning a particular 
language at a particular time. It is assumed that the changes which are inevitably 
proceeding during the period in which the linguistic informant’s speech is being 
studied are negligible and can be safely disregarded. It is also assumed that the 
speech of the informant is an adequate sample of some speech community. This 
concept is applied rather vaguely to any group within which linguistic commu- 
nication takes place regularly. Minor cleavages within a group of mutually intelli- 
gible speech forms are called dialects. The maximal mutually intelligible group is 
a language community, as defined by scientific linguistics, but the term is often 
loosely applied on a political basis. Thus Norwegian is usually called a language 
although it is mutually intelligible with Danish, while Low German is considered 
a form of German, although objectively the difference between Low and High 
German is greater than that between Danish and Norwegian. The phrase ‘mutu- 
ally intelligible’ is itself vague. 

The speech of an informant is normally characteristic of that of a dialect 
community along with some idiosyncrasies. Language is so standardized an 
aspect of culture, particularly in regard to those structural aspects which are of 
chief concern to the linguist, that a very small number of informants usually 
Proves to be adequate. If necessary, the linguist will even be satisfied with a 
single informant in the belief that systematic divergence from the shared habits 
of the community as a whole are likely to be of minimal significance. However, 
the sampling problem must eventually be faced in & less makeshift manner. The 
systematic mapping of speech differences on a geographic basis, through sampling 
at selected points, is known as linguistic geography and is a well-established sub- 
discipline of linguistics. Far more remains to be done with non-geographic factors 


o 


10 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


of cleavage within the language community, on sex, occupational and class lines, 
Such study is a prerequisite for adequate sampling. 


2.1.2. Units of Linguistic Analysis 


Linguistic description is carried out in terms of certain fundamental units 
which can be isolated by analytic procedures. The two key units are the phoneme 
and the morpheme, of which the phoneme has a somewhat more assured status. 
The phoneme is the unit of description of the sound system (phonology) of a 
language. Many widely differing definitions have been offered, some of which are 
objects of doctrinal differences between various linguistic ‘schools.’ Fortunately, 
the actual results in practice of the applications of these divergent approaches 
are surprisingly similar. 

The phoneme was foreshadowed by the pre-scientific invention of alphabetic 
writing. An adequate orthography of this kind disregards differences in sound 
which have no potential for the discrimination of meaning. Moreover, unlike 
syllabic writing, alphabetic writing selects the minimal unit capable of such 
differential contrast. The naive speaker is generally unaware of sound variations 
which do not carry this function of distinguishing different forms. For example, 
speakers of English have usually never noticed that the sound spelled ¢ in ‘stop’ 
is unaspirated as contrasted with the aspirated ¢ of ‘top.’ Yet this difference is 
sufficient to differentiate forms in Chinese, Hindustani, and many otherlanguages. 
Phonemic theory is necessary because if we approach other languages naively 
we will only respond to those cues as different which are significant in our own 
language. On the other hand, we will attribute significance, and consider as 
indicative of separate elements, those differences which have a function in our 
own language, although they may not have such a function in the language we are 
describing. 


For example, in Algonquian languages distinctions of voicing are not significant. A 
naive observer with an English linguistic background will carefully mark all p’s as different 
from b's. The reaction of an Algonquian would be similar to that of an English speaker if 
he were presented with an orthography devised by a Hindu in which the ¢ of ‘top’ was 
represented by a different symbol from the t of ‘stop.’ The arbitrariness of such a procedure 
comes out when we realize that an untrained Frenchman would describe the sound system 
of a particular language in different terms than a naive Englishman or German. As a matter 
of fact, this has often occurred. Equally unsatisfactory results are obtained by a phoneti- 
eally trained observer, unaware of the phonemic principle, who indicates all kinds of non- 
essential variants because his training permits him to distinguish them. Here also there i8 
a certain arbitrariness based on the particular phonetic training of the observer. The 
logical outcome of such a phonetic approach would be to carry discriminations even further 
by instrumental means, and the result would be that every utterance of a language would 
be completely unique, for no two utterances of the ‘same’ sequence of phonemes is ever 
acoustically identical with any other. 


The procedure of the descriptive linguist, then, is a process of discovering the 
basic contrasts which are significant in a language. Since he cannot know 4 
priori which particular features of an utterance will prove to be significant, he 
must be prepared to indicate them all at the beginning by a phonetic transcrip- 


> 


q 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 11 


tion. Instrumental aids, though useful, are not essential to the preliminary re- 
search. The linguist gradually eliminates those sound differences from his trans- 
scription which prove to be non-significant so that the phonetic transcription 
becomes a phonemic one. In doing this, he makes use of the two principles of 
conditioned and non-conditioned variation. If the oceurrence of one or another of a 
set of sounds may be predicted in terms of other sounds in the environment, this 
variation is said to be conditioned. If either of two sounds may be used for the 
other and still produce a meaningful utterance, the variation is called free, or 
non-conditioned. Such variant sounds grouped within the same phoneme are 
called allophones. In English, &, a front velar sound is found before 7, 7, e, E and 
other front vowels (e.g., the initial sound of ‘key’). A sound different enough to be 
a separate phoneme in many languages, k, a back velar sound, is found before 
u, v, 0, 9 and other back vowels (e.g., the initial sound of ‘coat’). Since the partic- 
ular variant can be predicted by reference to the following vowel sound, k and k 
are in conditioned allophonic variation and are members of the same English 
/k/ phoneme. 

The number of potential phones (sounds) in a language approaches infinity. 
The great virtue of the phonemic principle is that it enables the linguist to effect 
a powerful reduction from this complexity to a limited number of signals that 
constitute the code, and this represents a great economy in description. For 
languages so far investigated, the number of phonemes runs about 25 to 30 (the 
English system tending toward the higher figure). It is possible to effect a still 
greater economy in description. This is achieved by the analysis of phonemes 
into concurrent sets of distinctive features. Since the features which distinguish 
certain pairs of phonemes are found to be identical with the features which dis- 
tinguish certain other pairs, the number of entities necessary to describe the 
significant aspects of the sound matter is thus further reduced. For example, in 
English /p/ is distinguished from /b/, /t/ from /d/, /k/ from /g/, and /s/ from 
/2/ on the basis of the same feature, the former being unvoiced and the latter 
voiced. Other distinctive features, such as tongue position or nasalization, pro- 
duce other sets of contrasts. By contrasting every phoneme in the language with 
every other phoneme, each phoneme comes to be uniquely identified in terms of 
the set of contrasts into which it enters, this ‘bundle of distinctive features’ being 
the definition of that phoneme. The distinctive oppositions that occur in languages 
studied so far run about 6 to 8. These are perhaps the minimal discriminanda in 
language codes. 

Analysis into distinctive features is a development within the past two decades, 
associated with the Prague School but not universally accepted. Jakobson and his 
associates (ef., 9, 11) go one step further still, by imposing upon the entire pho- 
nemic material binary opposition as a consistent patterning principle, but this 
needs much further exploration. Whereas American linguists usually say that 
sounds must be phonetically similar to be classed as members of the same pho- 
neme, members of the Prague School state that members of the same phoneme 
class must share the same set of distinctive features. These criteria will generally 
lead to the same classificatory structure. 


12 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


For example, k and & would be said by members of the Prague School to share the fol- 
lowing features in common: velar articulation, non-nasality and lack of voicing. These 
would be the relevant features shared by all varieties of the /k/ phoneme while, in this 
instance, back or forward articulation is irrelevant. The /g/ phoneme shares velarity and 
Ron-nasality with /k/ but not lack of voicing. The /n/phoneme (as in ‘sing’) shares velar 
articulation but not non-nasality or lack of voicing. The /t/ phoneme shares non-nasality 
and lack of voicing with /k/ but not velar articulation. Thus /k/ is uniquely determined 
by these three relevant features. Certain recent American analyses employ a methodology 
nearly identical with that just described. 


Phonemes are sometimes distinguished as being either segmental or prosodic, 
The former proceed in one dimensional time succession without gap. The latter 
are intermittent and necessarily simultaneous with segmental phonemes or suc- 
cessions of segmental phonemes. Examples of prosodic phonemes are phonemes 
of tone (sometimes called tonemes), stress, etc. In principle, we should sharply 
distinguish prosodic phonemes simultaneous with a single segmental phoneme 
from those which are distributed over a grammatically defined unit such as a 
phrase or sentence. The former can always be dispensed with in analysis, though 
they often prove convenient. For example, in a language with three vowel 
phonemes /a, i, u/ and two tone levels high /’/ and low /*/ we might analyze 
/M, /&/, ÀI, /1/, /ù/ and /á/ as six separate segmental phonemes or we might 
make /a/, /i/ and /u/ segmental and /’/ and /'/ prosodic. This particular 
analysis has no doubt been largely determined by our traditional orthography 
which uses separate marks for pitch. The carrying through of this procedure to its 
logical conclusion is called componential analysis and results in the resolution of 
each phoneme into a set of simultaneous elements equivalent to the distinctive 
features mentioned above. The other type of prosodic element is illustrated by 
question or statement intonation in English. Unlike the elements just discussed, 
it cannot be dispensed with. 


Still another type of phoneme is the juncture or significant boundary, whose status is 
much disputed in contemporary linguistics. The conditioning factor for phonemic varia- 
tion is sometimes found to be the initial or final position in some grammatical unit such as 
a word, rather than a neighboring sound. For example, unreleased stops p, t, k are found 
in English in final morpheme or word position. Unless we indicate the boundary in some 
fashion we must nearly double the number of phonemes in English. Spaces, hyphens and 
other devices are employed to indicate the presence of these modifications. For example, 
the n of ‘syntax’ is shorter than the n is ‘sin-tax.’ Either we posit two different n phonemes 
or we describe the longer n as n plus juncture, transeribing /sintaks/ and /sin-taks/ re- 
spectively (or we deny the existence of the phenomenon altogether).? The agreement a8 
to the boundaries of grammatical elements is almost never perfect, and some linguists 
assume that if such boundary modifications exist in some cases they must exist in all, even 
though they have not actually been observed to occur. 


Tn addition to the enumeration of phonemes and their allophonic variants, the 
phonological section of a description usually contains a set of statements regard- 

? Actually there is also a louder stress on the second syllable of ‘sin-tax’ and some would 
maintain that it is merely the stress difference which is phonemic. Even if this is true for 
English, the question arises in other languages, 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 13 


ing permitted and non-permitted sequences of phonemes, frequently in terms of 
the structure of the syllable. In this as in other aspects of linguistic description 
it is not usual to give text or lexicon frequencies. Statements are limited to those 
of simple occurrence or non-occurrence. Only such quantifiers as some, none and 
all occur in most linguistic description. 

Corresponding to the minimal unit of phonology, the phoneme, we have a 
unit of somewhat less certain status, the morpheme, which is basic for grammatical 
description. Bloomfield (2) states as the fundamental assumption of linguistic 
science that in a given speech community some utterances show partial formal- 
semantic similarity. For example, in the English-speaking community the utter- 
ances ‘the dog is eating meat’ and ‘the dog is eating biscuits’ are partially similar 
in their sequence of phonemes and refer to partially similar situations. The 
linguist, through the analysis of these partial similarities, arrives at the division 
of utterances into meaningful parts. The analytical procedure as applied to 
individual utterances must eventually reach a point beyond which analysis be- 
comes arbitrary and futile. The minimum sequence of phonemes thus isolated, 
which has a meaning, is called a morpheme. The morpheme is a smaller unit 
than the word. Some words are monomorphemie, e.g., ‘house.’ Others are multi- 
morphemic, e.g., ‘un-child-like.’ There is some uncertainty as to the point up 
to which such divisions are justified and the rules of procedure may be stated in 
several alternate ways. Thus all would concur in analyzing ‘singing’ as having 
two morphemes ‘sing-’ and ‘-ing’ and there would likewise be general agreement 
that to analyze ‘chair’ as containing two morphemes, say ‘ch-’ meaning ‘wooden 
object’ and ‘-air’ meaning ‘something to sit on’ is not acceptable. But there is an 
intermediate area in which opinions differ. For example, ‘deceive’ contains two 
morphemes ‘de’ and ‘ceive’ according to some but not according to others, In 
such borderline cases it becomes impossible to specify the meaning of each mor- 
pheme without some arbitrariness. 


2.1.3. Morphology and Syntax 


The work of the descriptive linguists in this area is not exhausted by the ana- 
lytic task just described. Having arrived at his units he must describe the rules 
according to which they are synthesized into words, phrases, and sentences. In 
somewhat parallel fashion to the situation in phonology, having isolated minimal 
units, he must describe their variation and their rules of combination. 

In regard to the first of these problems, it is not sufficient to consider each 
Sequence of phonemes which differs either in form or meaning as a different 
unit from every other. For example, the sequence ‘leaf? /lijf/ is different in form 
from ‘leav-’ of the plural ‘leaves’ /lijv-z/ but we cannot consider them as units 
without relation to each other. We call /lijf/ and /lijv-/ morphs rather than 
morphemes and consider them allomorphs of the same morpheme because: (1) 
they are in complementary distribution /lijv-/ occurring only with /-2/ of the 
plural and /lijf/ under all other conditions; (2) they have the same meaning; 
(3) there are other sequences which do not vary in form and which have the same 

e 


14 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


type of distribution, e.g., ‘cliff? for which we have /klif/ and /klif-s/. Such 
variation in the phonemic composition of allomorphs of the same morpheme is 
called morphophonemic alternation, and systematic statements of such alterna- 
tions comprise the portion of grammar known as morphophonemics. Some alterna- 
tions occur in all instances in a language regardless of the particular morphemes 
in which the phonemes occur. Such alternations are called automatic. There are 
others which are unique. These are called irregular. Others are intermediate in 
that they apply to classes of morphemes of various sizes. In English, morphemes 
which have s, z and əz as variants exhibit automatic alternation, oz occurring 
after sibilants (and affricates), s after unvoiced non-sibilants and z after voiced 
non-sibilants. Thus the same rule applies both for the third person singular 
present of the verb and the nominative plural. On the other hand, the variation 
between /éajld/ ‘child’ and /čildr-/ of the plural ‘childr-en’ is a unique irregular- 
ity. Psychologically, there would seem to be a real difference between these 
extremes. ə 

Having distinguished morphemic units, there remains the basic task of gram- 
matical description—the setting up of rules of permitted combinations of mor- 
phemes to form sentences. Generality of statement is here obviously a prime 
requirement. Languages vary widely in number of morphemes, from some hun- 
dreds to many thousands. Their possible sequences in constructions can only be 
stated in practice by the setting up of classes whose members have the same 
privilege of occurrence. In setting up such classes, modern linguistics character- 
istically uses a formal, rather than semantic approach. Classes of morphemes or 
classes of sequences of morphemes (word classes, phrase types, etc.) are defined 
in terms of mutual substitutability in a given frame. Any utterance and the 
morpheme or morpheme sequence within it, for which substitutions are made, 
defines a class. Thus, in English, among other criteria, substitution of single 
words for house in the frame ‘I see the house’ determines the class of nouns. This 
contrasts with the traditional a priori semantic approach according to which all 
languages have the same basic grammatical categories (actually based on Latin 
grammar) and a noun, for example is defined as the name of a person, place, or 
thing. Actually, formal criteria have always been used in grammars, although 
often tacitly. ‘Lightning’ is a noun in traditional English grammar also, although 
it names an event, because it functions in the same constructions as other nouns. 

It is customary to regard sentences as the largest normalized units,‘ and these 
are successively decomposed into clauses, phrases, words, and morphemes. These 
units constitute a hierarchy which is also reflected in the speech event by con- 
figurational features, which, like the distinctive features of phonemic analysis, are 
assumed to operate on a strictly binary, ‘yes-no’ basis. Configurational features 
include such distinctions as those of pitch, stress, rhythm, and juncture, and 


* This is too simple a formulation. Many problems arise at this point which cannot be 
discussed here. : 

5 However, discourse analysis, being currently developed by Zellig S. Harris, carries 
linguistic techniques beyond the boundary of the sentence, and Thomas A. Sebeok has 
attempted to study the construction of sets of whole texts of folkloristic character in this 
manner (16). 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 15 


provide appropriate signals as to construction. The sentence is so complex a unit 
that it cannot be described directly in terms of morpheme constructions. Rather, 
the description is built up in layers. On any particular level, the combinations are 
practically always accounted for in terms of immediate constituents. In the sen- 
tence ‘unlikely events may actually occur,’ the morpheme un- and the morpheme 
sequence -likely are the two immediate constituents which make up the word 
unlikely. In turn, likely has as immediate binary constituents the morphemes 
flike- and ‘ly.’ On a higher level unlikely enters as a whole in a construction with 
events while events itself has event- and -s as immediate constituents. 

It is usual to distinguish as primary divisions,of grammar all constructions of 
morphemes to form words as morphology and all constructions using words as 
units to form phrases, clauses, and sentences as syntax. Although no generally 
accepted definition of the word-unit exists, in fact very nearly every grammar 
written makes use of the word as a fundamental unit and describes morpho- 
logical and syntactic constructions separately.® In spite &f traditional differences 
of terminology in morphology and syntax, it is generally agreed that the same 
fundamental principles of analysis apply. 


2.1.4. Problem of Meaning in Linguistics 


Besides specifying meaningful units and their constructions, a complete 
linguistic description must state the meanings of these units and of the construc- 
tions into which they enter. The status of meaning has been a crucial point in 
contemporary linguistic theory. The statements of Bloomfield concerning 
meaning in his influential book (2) have sometimes been interpreted both by 
followers and opponents as indicating that the field of linguistic science only 
includes a logical syntax of language without reference to meanings. The defini- 
tion of meanings, on this view, rests with other sciences which deal with the sub- 
ject matters which speakers talk about. Thus, the definition of ‘moon’ is the 
business of the astronomer, not the linguist. The actual practice of linguists 
both here and in Europe, however, indicates that semantic problems are in fact 
dealt with and cannot well be excluded from scientific linguistics. 


Without entering into the exegetical problem of what Bloomfield meant, which is ir- 
relevant to the present purpose, it may be pointed out that Bloomfield coined the technical 
terms ‘sememe’ for the meaning of a morpheme and 'episememe" for the meaning of a con- 
struction, both of which are current in American linguistics. Moreover, problems of histor- 
ical meaning change are discussed at length in his book. This would imply that scientific 
linguistics does not exclude semantics. It is evident that historical linguistics draws conclu- 
sions regarding relationships by comparisons of cognates, that is, forms with both formal 
and semantic resemblances, so that in this branch, at least, meanings must be dealt with, 
It is likewise clear that the compiling of dictionaries has traditionally fallen within the 
linguist’s province and continues to do so. No linguist has ever written a grammar in which 
the forms cited were not accompanied by translations. 


The linguist deals with meaning by the bilingual method of translation or the 
unilingual method of paraphrase, that is, by the apparatus of traditional lexi- 


* For a discussion of the word as a unit/&ee section 3.3. of this report. 


16 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


cography. In keeping with the general orientation of linguistics as a social Science, 
the linguist defines the socially shared denotative meanings. Avoiding as far as _ 
possible controversial issues in the domain of epistemology, it may perhaps be 
ventured that a distinction may be, and in practice is, drawn between definitions 
which embody our scientific knowledge about a thing and nominal definitions 
which are observed rules of use in a given speech community. The linguist prac- 
tices the latter type of definition. His methods up to now have been the more or 
less rough and ready methods of lexicography based on the traditional logical 
concepts of definition. The difficulties involved in the vagueness of actual usage 
of all linguistic terms in a speech community (if we exclude some scientific dis- 
course in a few societies) are in practice circumvented by the not altogether happy 
devices of translation and paraphrase, which, involving as they do, language in its 
everyday use, are equally as vague as the terms which are to be defined. Ambi- 
guity is dealt with by multiple listings of separate meanings based primarily on 
common-sense analysis: The boundary between the same form with synonymous 
meanings and separate homonymous forms has never been clearly determined, 
since it has not been possible to specify how different meanings must be in order 
to justify treatment as homonyms. Nor, in this instance, does an approach in 
terms of purely formal differences in distribution prove more successful. 


2.1.5. Historical Linguistics 

Thus far all our consideration of linguistic topics has omitted the basic dimen- 
sion of change in time. This is the field of historical and of comparative linguistics 
which form a single sub-discipline. The investigation of the history of a specific 
language may be considered as a comparative study of its sequential synchronic 
states, while one result of comparing related, contemporaneous languages is & 
reconstruction of their history. History and comparison are thus, for the most 
part, inseparable in practice, though a much less frequently employed non- 
historical comparative approach, the so-called ‘typological,’ will be considered 
below. 

It was the recognition of certain facts about language change that ushered in 
the modern scientific period in linguistics. The most fundamental of these were 
(a) the universality of language change, (b) the fact that changes in the same 
linguistic structure when they occur independently, as through geographical’ 
isolation, always lead to different total end results, and finally (c) that certain of 
those changes, particularly in the area of phonology, show a high degree of 
regularity. The acceptance of these three principles—universality, differential 
character, and regularity of language change—add up to a historical and evolu- 
tionary interpretation of language similarities and differences which contrast with 
theolder notion based on the Babel-legend that, as with organic species, languages 
were types fixed from the time of creation and only subject to haphazard, degener- 
ative changes. x 

The second and third of these principles, those concerning the differential 
nature of independent changes and their regularity, in combination, lead to the 
concept of genetic relationship among languages. Whenever a language continues 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 17 


to be spoken over a long period of time, weaknesses in communication through 
migration, geographical and political barriers and other factors, result in a pattern 
of dialect cleavage as linguistic innovations starting in one part of the speech 
community fail habitually at certain points to diffuse to the remainder. As this 
continues, the dialects drift farther and farther apart until they become mutually 
unintelligible languages. However, they continue to show evidence of their com- 


mon origin for a very long period. In fact, a number of successive series of cleav- 


ages may occur within a period short enough for the whole set of events to be 
inferred. For example, the Proto-Indo-European speech community was differ- 
entiated into a number of separate speech communities, one of which was the 
Proto-Italic. The Proto-Italic in turn split into the Latin, Venetic, Oscan, Um- 
brian and other separate language-communities in ancient Italy. One of these, 
Latin, survived, but it in turn developed into the various Romance languages, 
French, Italian, Spanish, etc. Sometimes, as in the case of Latin, the original 
speech from which the descendant forms branched off i$ attested from written 
records. In other cases we legitimately assume that such a language must once 
have existed although no direct evidence is available. Such an inferred language 
is called a proto-language (‘Ursprache’). 

Because of the regular nature of much linguistic change, it is possible under 
favorable circumstances to reconstruct much of the actual content of such extinct 
languages. In particular, the reconstruction of the ancestral language of the 
Indo-European family has been a highly successful enterprise which has occupied 
a major proportion of the interest of linguists up to the present day. Thus far, 
linguistic relationships are well-established only in certain portions of the world 
and reconstruction has been carried out for only a limited number of linguistic 
families, particularly Indo-European, Uralic, Semitic, Bantu, Malayo-Polynesian, 
and Algonquian. Reconstructionis most successful, probably, in phonology, some- 
what less so in grammar, and least of all in semantics. Forms which resemble each 
other in related languages because of common origin from a single ancestral form 
are called cognates, e.g., English foot and German Fuss. The history of such a 
particular cognate is called its etymology and it has both a phonological and 
semantic aspect. 


The difficulties of semantic reconstruction may be appreciated from the following 
artificial example which illustrates, however, the real difficulties often encountered, If 
in three related languages, a cognate form means ‘day’ in A, ‘sun’ in B, and ‘light’ in C, 
here are some of the possibilities among which it is impossible to make a rational choice. 
(1) The original meaning was ‘day’ which remained in A, shifted to ‘sun’ in B and to ‘light’ 
in C. (2) The original meaning was ‘sun’ which shifted to ‘day’ in A, remained in B and 
shifted to ‘light’ in C. (3) The original meaning included both ‘sun’ and ‘day.’ It narrowed 
to ‘day’ in A, to ‘sun’ in B, while in C it narrowed to ‘sun’ and then shifted to ‘ight.’ These 
and others are all possible, and in the present stage of our knowledge, about equally plau- 
sible. On the other hand, various Indo-European languages do have cognates all of which 
mean approximately ‘horse,’ which can therefore be safely reconstructed for the parent 
language. 


The changes undergone by languages whether documented or inferred can be 
classified under various universally applicable processes such as sound change, 


18 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


borrowing and analogy. Such processes show a high degree of specific similarity. 
To cite an example from phonology, au has become o in many different languages 
independently. Similar highly specific parallel changes occur in grammar and 
semantics. In spite of this, our second postulate of differential change shows that 
there are always a number of possible changes from a given state and our knowl- 
edge is not yet sufficient to predict which one will ensue or indeed whether the 
system will change or remain stable in some particular aspect. Parallel changes 
within related languages, called ‘drift? by Sapir, are probably especially frequent 
and presumably strongly conditioned by internal linguistic factors." 


2.1.6. T'ypological Comparison 


The ascertaining of historic relationships and the reconstruction of processes 
of change is not the only possible motive for the comparison of languages. We can 
examine the languages of the world, comparing both related and unrelated ones, 
in order to discover language universals, the greater than chance occurrence of 
certain traits, and the significant tendencies of traits to cluster in the same lan- 
guages. The isolation of such clusters leads to the setting up of criteria for classi- 
fying language types. The classical nineteenth century typologies rested primarily 
on considerations of the morphological structure of the word. Because of the 
relatively unadvanced state of descriptive theory, it suffered from lack of precise 
definitions for the units employed and was, moreover, tied to an ethnocentric 
outmoded type of evolutionism. Recently text ratios of more rigidly defined units 
have been employed in order to construct a more refined typology. 

The problems of typology are of intimate concern to psycholinguistics. The 
universal or more than chance occurrence of certain traits is in need of correlation 
with our psychological knowledge. More data on languages in many parts of the 
world and some effort at cross-linguistic cataloguing are probably necessary 
prerequisites for any considerable advance in this area. One paper growing out of 
The Symposium on American Indian Linguistics at the 1951 Linguistic Institute 
(Voegelin’s 17a) and two papers published subsequent to the Conference on 
Archiving at the 1953 Linguistic Institute (Allen’s and Wells’ 1a, 18) are con- 
cerned primarily with typology. 


2.1.7. Bibliography 


la. Allen, W. S., 1954, Statements for the conference on archiving. International Journal 
of American Linguistics 20. 83-84. 
1b. Bloch, B. and G. L. Trager, 1942, Outline of linguistic analysis, Linguistic Society of 
America, Baltimore. 
A concise summary of linguistic techniques for the general reader. An excellent, 
if somewhat outmoded, introduction. 
2. Bloomfield, L., 1933, Language, New York. 
The most important technical introduction to linguistic science as a whole; 9 
classic of the field. 
3. Carroll, J. B., 1953, The study of language, Cambridge. 


7 The various processes of linguistic change afe discussed in detail in section 6. 


> 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 19 


A survey of linguisties and related disciplines, including chapters on linguistics, 
psychology, communication engineering, and the study of speech. x 

4. Fries, C. C., 1952, The structure of English, New York. 

A formal analysis of English syntax presented in readable form. Useful presenta- 
tion dealing with a language familiar to the reader. 

5. Greenberg, J. H., 1953, Historical linguistics and unwritten languages, in Anthropology 

today, 265-86, Chicago. 
Deals with the establishment of linguistic relationships and historical reconstruc- 
tion. 

6. Hall, R. A., Jr., 1950, Leave your language alone!, Ithaca. 

A good popularization of the fundamentals of linguistic science; recommended as 
a starting point for the novice in this field. 

7. Harris, Z. S., 1951, Methods in structural linguistics, Chicago. 

The procedures of descriptive linguistics are presented from an extremely for- 
malistic point of view. 

8. Hoijer, Harry, and others, 1946, Linguistic structures of native America, Viking Fund 

Publications in Anthropology 6. New York. 
Descriptive sketches of diverse American Indian languages illustrate a variety of 
linguistic techniques and structures. 

9. Jakobson, R., C. G. Fant, and M. Halle, 1952, Preliminaries to speech analysis: 
the distinctive features and their correlates, Acoustic Laboratory, Massachusetts In- 
stitute of Technology, Technical report No. 18, Cambridge. 

The binary principle is employed to present phonemic solutions of phonetic data. 
10. Joos, M., 1948, Acoustic phonetics, Language Monograph, No. 23, Baltimore. 
This contribution indicates results and possibilities of instruments in linguistics. 
11. Lotz, J., 1950, Speech and language, Journal of the Acoustical Society of America 22. 
712-17. 
An outline of the principal theoretical problems of modern linguistics. 
12. Lounsbury, F. G., 1953, Field method and techniques of linguistics, in Anthropology 
today, 401-16, Chicago. 
Review of the various ways in which linguists gather their data. 
13. Martinet, A., 1953, Structural linguistics, in Anthropology today, 574-86, Chicago. 
The three principal schools of structural linguistics are compared and evaluated. 
l4. Nida, E. A., 1949, Morphology*, University of Michigan Publications in Linguis- 
ties, Ann Arbor. 
A presentation of methods for identification of morphemes with useful sections 
of problems to be worked on by reader. Ai 
15. Pike, K. L., 1947, Phonemics, University of Michigan Publications in Linguistics, 
Ann Arbor. 
Techniques for phonemic analysis with examples from widely different languages. n 
16. Sebeok, T. A., and Frances J. Ingemann, Structural analysis and content analysis 
in folklore research, in Studies in Cheremis, Vol. 2: The Supernatural, Part Two. 
Viking Fund Publications in Anthropology, forthcoming. i 
Some techniques of psycholinguistics are applied to collections of texts of folkloris- 
tic character. ; 
17a. Voegelin, C. F., 1954, Inductively arrived at models for cross-genetic COIRDOTISODR 
m American Indian languages, University of California Publications in Linguistics 
. 27-45. 
Iib. Voegelin, C. F. and Z. S. Harris, 1947, The scope of linguistics, American Anthropolo- 
gist 49. 588-600. 
The data and techniques of linguistics and cultural anthropology are compared 
and contrasted, and trends in linguistics sketched. A 
18. Wells, R. S., 1954, Archiving and language typology, International Journal of Ameri- 
can Linguistics 20. 101-107. 


20 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


2.2. The Learning Theory Approach 


Language is perhaps the most complex behavior displayed by the human or- 
ganism, and, in the main, it is learned behavior. It is understandable, therefore, 
that linguists should find learning theory of special interest. Although linguists 
have for many years refrained from ‘psychologizing’ within their science, it now 
appears that more interaction between psychologists and linguists would be 
fruitful. Even while Bloomfield was espousing the separation of the fields, he 
felt it desirable from time to time to deal with linguistic matters in the framework 
of early psychological behaviorism (chiefly as structured by A. P. Weiss). Fortu- 
nately, this has been an aspect of psychology which has seen tremendous develop- 
ment in the last 20 years. At the present time, probably more experimental work 
is being done in learning than in any other psychological field. This section of the 
report attempts to do two things: first, to present some of the major phenomena 
of learning and, second, to discuss briefly some of the major theories of learning 
or ways of organizing and explaining the phenomena. 


2.2.1. Phenomena of Learning 


In order to discuss the phenomena of learning most meaningfully and fruitfully, 
two paradigms will be presented and discussed to reveal the major variables which 
affect the learning process. These models are phenotypes, and it may be argued 4 
that they are in some ultimate sense different kinds of learning or it may be 
argued that they are explicable under one system. This is not our concern here. 
It is sufficient for our purposesthat they act as convenient vehicles for illustration 
and discussion. 

2.2.1.1. Classical conditioning. The first model is taken directly from the famous 
work of the Russian physiologist Pavlov. It is diagrammatically represented in 
Figure 5. In its simplest form this learning proceeds as follows: A given stimulus 
(the unconditioned stimulus or US) is found to be followed by a characteristic 
response (Rx); another stimulus (the conditioned stimulus or CS) is inadequate 
with respect to eliciting Ri but may be followed by some other response (R5) 
irrelevant to the experiment; a long series of trials is given in which the US 18. — 
always preceded by the CS; finally, it is noted that the CS alone elicits some 0! — 
the response characteristics which normally would occur only after the US; at 
this point we say that learning has occurred—an initially neutral stimulus DOW — 
has acquired the ability to elicit a response which originally occurred only 1 — 
the presence of another stimulus. Suppose we have a dog in our laboratory. 


Pj 


5 James J. Jenkins. 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 21 


We know that he salivates when we place meat powder in his mouth. (This is 
the US [meat powder] — R; [salivation] connection.) We decide to condition the 
response of salivation to the stimulus of ringing a bell. We note before experimen- 
tation that ringing the bell (CS) results in extraneous responses (R3) (moving the 
head, pricking up ears, etc.), but not salivation. In the training series we ring 
the bell and, whileit is still ringing, place meat powder in the dog's mouth, elicit- 
ing salivation. After, say, 100 trials, we ring the bell and note that the dog sali- 
vates without the stimulus of the meat powder. The bell (CS) now elicits the 
response (R3) (or part of the response) elicited by the meat powder. The condi- 
tioning is completed. 


To illustrate the manner in which different factors influence this learning process, we 
may now begin to alter the situation by changing first one and then another variable. 

The first thing we might notice is that the number of trials pairing the bell and the meat 
powder is important. If we only pair the stimuli once, we may not detect any effect. If we 
have more trials, we see a slight effect. With a great number 9f trials we get maximum 
response (most like the original response). Our first important variable, then, is frequency. 
The number of times the experimental situation occurs is important. 

Secondly, we might experiment with the temporal relation between the presentation of 
the bell and the meat powder. If the meat powder precedes the bell-ringing, we discover 
(perhaps to our surprise) that little or no learning takes place even after a long series of 
trials. There is practically no backward conditioning. Further experimentation shows that 
simultaneous presentation of the CS and US is “learnable” but not optimal. Maximum 
conditioning occurs when the bell begins to ring about half a second before the meat powder 
is presented. We find that the onset of the bell can be moved further and further ahead of 
the presentation of the meat powder. For example, we might have the bell ring for 30 seconds 
before the US; with enough training we will find that the dog salivates just 30 seconds after 
the onset of the bell. Such learning is called delayed conditioning. We may even let the bell 
ring for a few seconds, stop it, wait for 20 seconds, and then present the US—conditioning 
Tequires still more trials, but is attainable. This is called trace conditioning. If we were 
very persevering, we might even set up temporal conditioning, in which the dog is fed 
periodically on a short time cycle and the CS is the time lapse itself. Our conditioned dog 
would salivate periodically like a short term alarm clock. 

A third discovery might occur accidentally: when a buzzer was inadvertently pressed, 
or a glass fell off a shelf or some other noise intruded in the experimental situation, the 
conditioned animal started salivating. Exploring this systematically, we would find whole 
classes of stimuli which could be substituted with greater or less success for the original 
CS, the bell. This is called stimulus generalization. In the main, the more alike two stimuli 
are, the greater is the likelihood that they will function for each other. If we originally 
condition to a tone, say middle C, we find that, as we move away from C up or down the 
scale, the conditioned response decreases. In our example we would expect the most saliva- 
tion to C itself, the next most to B and D, less to A and E, etc. 

If we proceed with the generalization experiments, but always pair the US with tone C 
and never with tone A, we can discover a related phenomenon. Soon tone A loses its power 
to elicit the conditioned response and it is said that stimulus discrimination has taken place. 
Th effect, we have systematically cut down the gradient of stimulus generalization. By this 
technique we can discover the limits of discrimination of which the organism is capable. 

This ‘damping out’ of a response suggestsstill another question. What happens in general 
when the CS is no longer followed by the US? If we tried this in our example, ringing the 
bell Tepeatedly but never following it with the meat powder, we would note that the magni- 
tude of the response decreased over succegsive trials and finally disappeared altogether. 
The response is said to have been extinguished. At this point we might naively assume 


22 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


that our experiment was at an end and the dog was now unconditioned, but if we happened 
on some subsequent day to bring the dog into the laboratory and again rang the bell, we 
would find that the conditioned response was still observable, reduced in magnitude but 
still there. If we extinguished the response again (in fewer trials this time), let a day elapse 
and again brought the dog into the laboratory, we would still find some residual of the con- 
ditioned response. This apparently mysterious revitalization of an extinguished response 
is called spontaneous recovery and indicates a need to postulate something other than 
‘forgetting’ to account for the decline in responses which we observed in the extinction 
trials. Most psychologists prefer to treat this as a case of "learning-not-to-respond' or in- 
hibition. 

In this model we may measure learning in a variety of ways. We may take 
the occurrence of a response or the frequency of occurrence as an index of learn- 
ing. Alternatively, we may measure the amplitude or magnitude of the response, 
or the resistance of a response to extinction as other indices of strength. Depend- 
ing on the response in question, one measure may be more appropriate than 
another. It should be noted, however, that in many special cases the indices may 
not be in perfect agreement and it may be important to specify exactly what 
aspect of behavior is being considered. 


Stimulus Situation 


pare bae) Ry 
Lever an RY s 
rete mua R3---- Reward 
Etc, ) (Internal A 
stimuli) R5 


Figure 6 


2.2.1.2. Instrumental learning. The second model considered here is markedly 
different (at least phenotypically) from the first. It has been called variously 
‘trial and error’ learning, ‘operant,’ and ‘instrumental’ conditioning and descends 
most directly perhaps from the work of Thorndike. While instrumental condi- 
tioning does not readily lend itself to neat diagramming, it can perhaps be roughly 
portrayed as in Figure 6. In this learning model the organism is placed in a situa- 
tion in which a variety of responses can be made. The organism is usually assumed 
to be motivated, that is, some state of need (hunger, thirst, etc.) is presumed to 
exist on the basis of prior knowledge (hours of food deprivation, water depriva- 
tion, etc.). A ‘correct? response is followed by reward which is appropriate to the 
need state of the organism. The probability of the re-occurrence of the rewarded 
response increases with each rewarded trial up to some limit. The response is 88! 
to be learned when it occurs with high probability. 

When contrasted with classical conditioning, several different features stant 
out. The first difference is that the response is emitted, not elicited by an uncondi- 
tioned stimulus. This is not to say that responses take place without reference 
to the stimuli present but rather that we cannot specify the configurations of 
stimuli which lead to the various complex, responses. A second, and presumably 
important, difference is that the response is instrumental in obtaining rewarc. If 


> 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS ; 28 


the correct response does not occur, the organism is not rewarded. This model 
seems to entail more ‘active’ learning than classical conditioning. It goes without 
saying, of course, that the correct response must be in the behavior repertory of 
the organism prior to conditioning. Finally, motivation (‘drive’) and reward 
(‘reinforcement’) are much more prominent in this model than in the first one. 
If the organism is well-fed and comfortable, he is more likely to go to sleep than 
to learn to solve the experimental problem. 

A simple example of this model of learning is one made famous by Skinner. Let 
us assume that we have a simple box with a small lever in one end of it. The ap- 
paratus may be so arranged as to cause a pellet of food to drop into the box when 
the lever is pressed. If we now put a rat which has not been fed for 24 hours into 
the box we can observe marked changes in his behavior. At first the rat runs 
around the box, sniffs in the corners, washes his face, rears on his hind legs, 
scratches at the walls, etc. Sooner or later the rat ‘accidentally’ pushes the lever 
and a pellet is discharged into the box. After an interval the rat discovers and eats 
the pellet. Sometime thereafter he may blunder onto the lever again. If we chart 
the rat’s behavior we discover that after scattered lever presses the time between 
presses gets much shorter. In an hour or so we may find the rat industriously 
pressing the lever, eating, pressing the lever, eating, etc., with great speed and 
regularity. We say that the instrumental response of lever pressing has been 
learned. 


If we manipulate this situation as we did the first one, we find many of the same variables 
controlling the modification of behavior. Again we might notice that frequency is important, 
Here, however, it is not the frequency of pairing of CS and US but rather the frequency of 
response and reward occurring. Similarly, we would find contiguity or time relations to 
be important, but it is response-reward contiguity. We would discover again that order is 
important, the response must precede the reward, and that the longer the interval between 
the response and the reward, the less learning takes place. Thus, the responses which occur 
immediately prior to the reward (whether they are relevant or irrelevant) will be strength- 
ened more than those which were considerably in advance of the reward. (To take a different 
case, this explains why a maze is learned from the back to the front, errors being reduced 
in the vicinity of the goal box before being reduced in the middle of the maze, etc.) In this 
model, too, we may demonstrate eztinction (if we cease giving pellets for lever pressing) 
and spontaneous recovery. If we alter our situation to include a new stimulus (say a small 
light over the lever), we can train the rat to respond only when the light is on and thus 
introduce the phenomena of discrimination and generalization discussed above. 

Other variables not noted in the first model are more clearly revealed in the instrumental 
situation. It becomes apparent, for example, that the reward functions optimally when it 
is relevant to the need. Giving water to a hungry rat or food to a thirsty rat does not result 
in rapid learning if any learning at all. The amount of reinforcement likewise plays a role. 
All other things being equal, the speed of learning tends to increase as the amount of reward 
is increased. One of the most important phenomena which may be disclosed and studied 
most clearly and easily in this second model is secondary reinforcement. This is the name 
given to the reinforcing power which a neutral stimulus may acquire by virtue of having 
been associated with primary reinforcement. If we put a rat into the experimental box 
Without a lever present, we may train him to approach the food box and eat every time 
the food mechanism clicks. Then we may introduce a lever which will produce the click 
when pressed, but empty the mechanism so%that it provides no food. In this case in spite of 


the fact that lever pressing is never rewarded by food, the rat will learn to press the lever 
. 


24 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


and will make a good many responses before extinguishing. We must conclude here that the 
click of the mechanism itself has acquired reinforcing power. In more dramatic and pub- 
licized experiments it has been demonstrated that chimpanzees will work for, collect, and 
hoard poker chips after experience with the chimp-o-mat—a slot machine arrangement 
in which the poker chips may be traded for food. It seems likely that most human learning 
is obtained under conditions of secondary reinforcement (money, praise, smiles, approval, 
etc.), and it seems especially likely that secondary reinforcement plays an important role 
in language learning. 


In this model, as before, we may measure the extent of learning by frequency 
of response, amplitude, latency, or resistance to extinction. In certain cases we 
may also be interested in error scores. A word of caution is necessary here, how- 
ever. Because these situations differ from each other and from the situation in 
the first model, measures having the same names may require different interpre- 
tations. For example, in classical conditioning amplitude might be a positive 
function of learning (e.g., drops of saliva) while in instrumental conditioning it 
might be a negative function (e.g., as lever pressing becomes more skilled, it may 
be executed with less force). 

In a rather great oversimplification we might generalize that the first model 
presents a picture of the conditioning to an arbitrary stimulus of a highly specific, 
elicitable response, while the second model describes the differentiation and dis- 
crimination of a response out of a mass of behavior emitted in response to à 
complex stimulus field. The first model stresses time and stimulus controls and 
the second model stresses the role of motivation and reward. It should be remem- 
bered, however, that the models are not independent and the phenomena ob- 
served in one are observable (with more or less effort) in the other. 

2.2.1.3. Some additional descriptive statements. While the models given serve 
excellently as pedagogical devices, they do not, of course, do justice to the wide 
areas in which learning studies have been carried on. The development of many 
complex human skills (typewriting, sending and receiving codes, memorizing 
verbal material—both meaningful and nonsense, etc.) has been carefully studied 
and described under a staggering variety of conditions. A few of the many find- 
ings which may be of relevance to us can be briefly described. They are explicable 
in terms of the phenomena described above, but may be of special interest as 
molar phenomena themselves. 

‘A good deal of attention has been devoted by psychologists to learning curves 
(more properly, performance curves). In general, such curves are negatively aci 
celerated, that is, large gains are made initially, then smaller and smaller gains 
until no appreciable improvement is noted. It is likely that most tasks, however, 
are approached with considerable residues of skill and experience. For a few tasks 
'S? curves may be noted, positively accelerated then negatively accelerated. Such 
tasks appear to be those in which subjects have had little experience (¢.8-» tight 
rope walking, juggling, etc.). Perhaps the ‘S’ curves are the ‘true’ performance 
curves and the ‘typical’ negatively accelerated curves are only those portions 
which we see because our subjects already have considerable response repertories 
available to them. s 


^ 


| 
| 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 25 


First Learning Second Learning Test 
Case 1 
Divergent Sr Ri Sr Re Sr Ri 
Case 2 
Convergent Sr Ri S$i— ——R; S Ri 
Case 3 
Unrelated Si Ri S.——_R; Sr Ri 
Figure 7 


Work with skill sequences has revealed both in the lower animals and in man 
an extraordinary capacity for eliminating waste motion and executing a highly 
polished and tremendously rapid series of responses. A brief consideration of the 
movements made by a skilled typist or piano player is sufficient to demonstrate 
the high degree of integration of elements into a smooth series which may be 
attained. It may be further shown that these are actual integrations, not merely 
a rapid sequencing of discrete responses. While the beginner types ‘t-h-e,’ the 
skilled typist writes ‘the’ or larger units such as ‘the next meeting’ with such 
speed that she could not be aware (by virtue of the time lag in the nervous sys- 
tem) that the ‘t’ had printed before the ʻe’ had been struck. This kind of short- 
circuiting, grouping and executing of serial responses plays an important role in 
all frequently executed response chains, including language. 

Finally, a considerable body of research has been devoted to questions concern- 
ing the effects of prior training on subsequent training and vice versa—the prob- 
lems of facilitation and interference. In general, we may consider here three cases 
as illustrated in Figure 7. 

In the first case we observe a divergent structure. To one stimulus, two (or more) 
Tesponses must be made. If the responses are highly similar, there will be little 
interference in the second learning and in the test, but if they are quite different 
(antagonistic) maximum interference will result in both places. In the second case 
4 convergent structure is found. Here the response will be facilitated in both the 
second learning and the test situation, and the amount of facilitation will be a 
function of the similarity of the stimuli. In the third case, we can expect little 
interference beyond that contributed by any interposed activity and little facili- 
tation beyond adaptation to the experimental setting. In general, what is being 
said here is: making different responses to the same stimulus is more difficult than 
making the same response to different stimuli. The first situation gives rise to 
conflicting response tendencies and demands more information about the occa- 
sion, while the second situation broadens the occasion for the use of a single 
Tesponse and, hence, requires less information. 


2.2.2. Learning Theories 

Since ‘theory’ is a somewhat ambiguous word, it seems advisable to outline 
briefly the conceptual framework which the seminar utilized in its discussion of 
learning theory. 


26 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


2.2.2.1. General nature of psychological theories. A. fully developed scientific 
theory contains three distinguishable levels. Level I contains the relatively raw 
‘immediately apprehended’ sense data (e.g., the speech sounds, the observations | 
of a dial reading, the perceived movements of a rat). All sciences contain this — 
level, but they differ in their selection of events. Level IJ contains the concepts - 
which are the special concerns of the science (e.g., the stimulus, the phoneme, 
energy) and Jaws which are summaries of their observed relations or hypotheses 
predicting relations not yet observed. Such concepts are meaningful only if they — 
are unambiguously related directly or indirectly to Level I events. Such a relation 
is equivalent to an operational definition. Concepts which are not operationally 
defined, and systems containing many such concepts, are called meaningless. The 
criterion of meaningfulness is related to that of testability since only meaningful 
concepts can be used in stating testable hypotheses and laws. Level ITI contains 
a formal mathematical or logical system. All concepts on this hypothetical level - 
are purely formal or logical in contrast to those of Level II, which are ‘descriptive’ 
of Level I events. Level III ordinarily consists of statements defining the elements - 
in the hypothetical system, statements defining operations and relations in terms - 
of the elements, and statements of rules of inference to be used in deriving the 
theorems of the system. The theorems may be regarded as the logical results of 
the assumed relations in the postulate set. The interpretation of this formal system 
consists of placing its entities and relations into correspondence with the concepts 
of Level II. Thus, a theorem on Level III leads directly to an hypothesis on Level 
II by means of translation of terms indicated by the interpretation. In turn, the 
laws or hypotheses of Level II are summaries of observed or predicted Level I 
events. Because of these relations between the levels of a theory, the formal sys- 
tem of Level III is said to explain the laws of Level II which, in turn, explain the 
events of Level I. 

A scientist is free to select or develop any mathematical or logical system which ~ 
he desires to use. The utility of his choice is then determined by examining the 
correspondence between his model or system and the concepts or empirical data f 
which he observes. Ordinarily, it is desired that the experimental model be | 
testable (that it generate meaningful predictions), reliable (that it generate con" - 
sistent predictions), coherent (not in conflict with itself), comprehensive (that it 
explain a wide variety of phenomena) and simple. Obviously, both comprehen- 
siveness and simplicity are subjective and debatable, but the other requirements 
are relatively clear. 

Theory-building in psychology has not, of course, proceeded self-consciously to 
develop level by level as our description above might imply. Psychology devel- 
oped as a branch of philosophy, as did the other sciences, but the weaning was 
longer than for most. As late as 1900 most psychology departments were sub- | 
divisions within philosophy departments; some still are. Along with the men! 
tic tradition of the 19th century, psychologists and pseudo-psychologists were 
prone to ‘theorize’ by sticking into the organism whatever faculties, aptitudes 3 
instincts, etc., seemed to serve their immediate purpose. There were practi 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 27 


as many intervening ‘explanatory’ constructs as there were things to be ex- 
plained. This has been aptly entitled ‘junk shop’ psychology. 

In the early part of the present century there was a general revulsion against 
this kind of theorizing, typified by the writings of such men as Watson, Kantor, 
Weiss, and more recently, Skinner. This stress on objectivity paralleled a similar 
revolution taking place in linguistics through the same period. These men went 
to the other extreme, the ‘empty organism’ position. This view held that the psy- 
chologist should concentrate on exploring the many functional relations between 
objectively verifiable S (stimulus) events and objectively verifiable R (response) 
events, avoiding intervening variables which involve ‘going into’ the organism. 
Thus, Skinner i$ content to study the behavior of the rat in the lever box under 
various stimulus conditions where the only observations are tracings on a record- 
ing drum—the actual movements of the animal itself not even being observed. 
Tf all variations in R were in fact predictable from knowledge of the current 
stimulus field, then this model would be sufficient. It is q@ite apparent, however, 
that with S conditions constant, the characteristics of R will still vary as func- 
tions of factors like past history (previous learning), individual differences in 
aptitudes, motivation, personality, and so forth. Facts of this order led Wood- 
worth in the middle 30's, for example, to insert an O in the formula, i.e., S—O—R, 
where the O refers rather vaguely to gross classes of intervening ‘organismic’ 
variables. 

Most contemporary learning theorists utilize models which introduce certain 
terms between the S and R. These terms may be thought of as falling roughly into 
two classes: first, terms which imply nothing about the interna] mechanics of the 
organism but act as convenient summary terms, for example, ‘drive’ defined only 
in terms of hours since last feeding, ‘habit’ defined in terms of response probabili- 
ties or histories, etc.; and second, terms which are intended to describe internal 
states or activities, such as ‘drive’ defined in terms of blood chemistry, neural and 
muscular activity, ‘habit? defined in terms of neural connections and strengths, 
etc. Most systems use both types of concepts and attempt to avoid the ‘junk 
shop’ kind of psychology by introducing such terms only when they are unavoid- 
able and by anchoring these variables firmly to antecedent (S) and subsequent 
(R) observable conditions. 

At present, the models of learning theory are sets of Level II concepts and laws 
~Some of which are little better than plausible hypotheses. There have been no 
acceptable attempts to develop or apply formal Level III systems except on a 
very limited basis. 

2.2.2.2. Four current theories of learning. While it is obviously impossible to de- 
velop in detail even one theory of learning in the space available here, an attempt 
will be made to outline, and present the contrasts between, four current theories 
which have great influence at the present time. These are the theories of Guthrie, 
Tolman, Skinner, and Hull. The interested reader is referred to the more adequate 
accounts of these and other systems given in the list of references following this 


Section. 
5 


28 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


Guthrie's Association Theory 


Of the theories to be considered here, perhaps the system which is simplest in 
appearance is that of E. R. Guthrie. This system, which is one of the early off- 
shoots of Watsonian Behaviorism, reduces all learning to a simple associative 
rule: any combination or totality of stimuli which has accompanied a movement will 
be followed by that movement when the combination occurs again. Complete learning 
thus occurs on the first occasion on which a stimulus is paired with a response. 

At first glance this simple association rule may seem to be in direct disagree- 
ment with the phenomena discussed previously, but this is not at all the case, 
Guthrie is concerned with stimuli and responses at a ‘molecular’ level. Viewed in 
minute detail, no total stimulus pattern is ever exactly like another. Even if all 
external stimuli were rigidly controlled, changes are taking place within the or- 
ganism (it is getting hungrier, thirstier, older, weaker, etc.; it is tense, relaxed, 
asleep, etc.; ad infinitum). Similarly, no two movements are ever exactly alike. 
They differ in the precise musculature involved, the state of the musculature, ete. 
The consequences of this infinite shading and change in both stimuli and re- 
sponses is that learning appears to increase gradually through practice and time 
as more and more of the total possible stimuli and patterns of stimuli become 
associated with more and more of the relevant responses or muscle actions. 


Generalization is taken care of by thinking of similar gross stimuli as actually consisting 
of overlapping pools of minute stimuli. As the stimuli become more dissimilar, the pools of 
stimuli overlap less and less until finally there are too few common elements to mediate 
the appropriate response. In order to handle temporal delays and sequences, Guthrie makes 
extensive use of movement-produced-stimulation as the actual stimulus field to which 
the responses are associated. Motivation and reward have no primary status in Guthrie’s 
system. They operate only as they affect the central principle. Motivation is important 
in that it determines and intensifies sets of movements which then are available for as- 
sociative connections. It supplies members to both the stimulus and response pools. Reward 
is important in that it terminates a class of movements and changes the stimulus situation— 
removing the organism, so-to-speak, from the situation before other movements can be 
associated with the stimuli. Thus, reward acts to prevent associations being formed with 
incompatible responses; it hasno ‘positive’ function. Extinction occurs, accordingto Guthrie, 
when the ‘correct’ response no longer terminates the situation. Other movements follow 
and in turn are associated with the stimuli. In this manner on successive trials more 0! 
more stimuli are related to other movements and responses until finally the ‘correct’ re- 
sponse disappears. With ever changing stimulus pools, competing responses which are close 
to the same strength will occur in various alternations, depending on the exact number of 
stimulus-movement associations present, until one of them gains a clear superiority. 


Tt may be seen even in this brief presentation that Guthrie's theory deals with 
inferred elements of external and internal stimuli and inferred elements of ur 
sponses. If everything is exactly the same, the organism will do exactly as it di 
the previous time. If it does not, then it may be argued that things really were not 
all the same. This amounts to saying that critical tests of the theory are difficult, 
if not impossible, to devise. The theory is facile in explanation but weak in pr’ 


diction; it can be used to explain almost any (even directly opposite) outcomes: 
a 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 29 


Its generality and simplicity are advantages, but it leaves much to be desired in 
the way of precision, reliability, and testability. 


Tolman's Sign-Gestalt Theory 


A sharp contrast to Guthrie's theory both as to sources and complexity is the 
sign-gestalt theory of E. C. Tolman. Drawing from virtually all psychology, from 
Watson’s behaviorism on one side to Lewin’s gestalt psychology on the other, 
Tolman builds a purposive, behavioristic theory of learning. The theory breaks 
sharply with the association of elemental stimuli and elemental movements and 
attempts to deal with goal-directed, whole acts of the organism. The level of 
description employed is molar, showing the relation of the organism to the goal. 
Most significantly perhaps, Tolman insists that what is learned is not movements 
or responses but ‘sign-significate expectations.’ The organism learns meanings 
and ‘what-leads-to-what? relationships. The relationship between a sign and its 
significate (an early stimulus and a later stimulus) is est£blished in accord with 
the usual contiguity rule of association, and this relation is the ‘expectation.’ 
Thus, to Tolman, classical conditioning may be interpreted by saying that the 
buzzer comes to mean food-in-the-mouth and the salivation is a consequence of 
this meaning or expectancy. 

This system stresses contiguity of stimuli in building up expectations. The 
closer in time two stimuli occur the greater the likelihood that an expectation will 
be set up. Practice plays a role in confirming and strengthening expectations, The 
more often S; followsS; the higher is the expectancy. It may be seen that expect- 
ancy can be viewed as a cognition of the probability that a given event will 
follow another. What increases, then, is not response potentials or habits but 
cognitions, which may be acted on in a variety of ways depending on the cumu- 
lative past experiences of the organism with objects and situations in its environ- 
ment. This gives Tolman’s system flexibility and allows him to predict the strik- 
ing changes which are sometimes observed in the behavior of organisms when the 
learning situation is radically changed (such as providing alternative routes to 
a goal, changing the goal object, etc.). 


Generalization is regarded as the result of stimulus sign-equivalence. Alterations in 
stimuli only affect performance by changing the expectancies of the organism. Reward and 
motivation have no direct effect upon learning in this system but affect performance, which 
1s regarded as clearly different from learning. Thus, a rat may ‘know’ how to run a maze 
Ge., he may know the sign-significate relationships of all of the pathways) but not demon- 
strate this in performance until he is rewarded for it, at which time his ‘knowledge’ should 
suddenly become evident. Reward does, of course, enter in as a stimulus significate whose 
Presence or absence confirms or weakens an expectation. Motivation enters in as a sensitizer 
or emphasizer of certain significates or sign-significate relations which have been associated 
With it. Extinction is treated as a progressive disconfirmation of expectancies which cumu- 
latively couples with the pattern of preceding situations to eliminate the learned per- 
formance. Spontaneous recovery takes place because the pattern of preceding situations is 
changed and the expectancy is still at some strength. When alternative responses are avail- 
able, the pattern of behavior will ensue which is in accord with the strongest expectancy, 
and when that expectancy is disconfirmed the next strongest will control behavior and so on. 


30 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


Tolman also points out that individual differences in organisms (heredity, age, 
training, special physical conditions) act to define particular behaviors on any 
occasion. He is thus one of the few learning theorists to comment on capacity 
laws, but even he has done little with them. In general, Tolman’s position is a very 
broad one. He recognizes levels of learning and lately has come to suggest that 
there may be several kinds of learning. His system has stimulated much research, — 
especially in the area of cognition. He has been criticized for vagueness and non- 
quantitativeness, but in part this is true of all of these theories. 


Skinner’s Descriptive Account 


B. F. Skinner himself would deny that his approach is a theory or that psychol- - 
ogy needs theories. He prefers, as indicated above, to collect and classify phe- 
nomena on Level II, using the most rigorous and simple descriptive categories - 
he can develop, toward the end of systematizing knowledge about the basic. 
phenomena of learning. The first major difference between Skinner and the other ` 
theorists discussed here is that he regards the two paradigms, conditioning and 
instrumental learning, as representing different kinds of learning. 


Pavlovian conditioning is regarded as a highly specialized form of learning which plays | 
little part in most human behavior. Skinner terms it respondent conditioning, emphasizing 
that it utilizes a response which can be elicited by a specific stimulus. The laws of respondent 
conditioning state (1) that contiguity of stimulation is the condition for increasing the 
strength of the CS-R relation and (2) that the exercise of the CS-R without the US results - 
in decreased strength. These laws are summary descriptive statements with little elabora- 
tion, and in general Skinner has little concern with this kind of learning. 

In instrumental conditioning stimulus conditions sufficient to elicit the behavior can- - 
not be specified and are in fact irrelevant to the understanding of this behavior. The im- 
portant aspect in this model is that responses are emitted and that they generate conse- 
quences. Skinner calls this operant behavior, stressing the role of the response. He is most 
concerned with the laws of this model and is convinced that most human behavior (in- 
cluding specifically language behavior) is dependent on this kind of learning. The basit 
laws of operant conditioning state that (1) if an operant is followed by the presentation of 
a reinforcing stimulus, its strength is increased and (2) if an operant is not followed by & 
reinforcing stimulus, its strength is decreased. In most situations an] operant does become — 
related to the stimulus field. It may come to occur, for example, only in the presence or 
absence of given stimuli. It is then termed a discriminated operant, but it is still not elicited. 
The stimulus conditions merely furnish the occasion for the appearance of the operant. 


Skinner's system is somewhat like Tolman’s in that it tends to deal with acts 
(not specific muscle movements) but unlike it in that it stresses the role of rein- - 
forcement. The all-important contiguity is that of the response and the reward, 
and one of the major determinants of the strength of an operant is the number of 
times the response-reward pairing occurs. These pairings summate in a non-lineat 
but increasing fashion to increase the probability of occurrence or rate of occur - 
rence of the operant. 

Skinner introduces the concept of a reflex reserve? which may be defined. 
loosely as the amount of ‘available activity’ of a given sort which the organism M 


? This concept was used in Behavior of organisms but has been dropped in later work. 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 31 


capable of emitting. Rewarding an operant increases the size of the reserve and 
non-reward decreases it. The rate of responding at any given moment is the func- 
tion of the size of the reserve. Responses are emitted as some proportion of the 
total reserve remaining. 


The size of the reserve is not a simple function of the number of reinforcements, how- 
ever. Skinner has found that periodic reinforcement (one rewarded response every few 
minutes), aperiodic reinforcement (rewards on a random time schedule), fixed ratio re- 
inforcement (reward every n'^ response), etc., generate very great reserves. His theory 
lays considerable stress on the important role played by secondary reinforcement (the 
discriminatory stimuli), and he finds this quite useful in discussing language behavior. 

The proportionality which exists between the reserve and the rate of responding may 
be altered by differing ‘states’ of the organism. ‘States’ are carefully defined intervening 
variables such as drive, emotion, etc. The hypothetical term ‘state’ is introduced when it 
can be shown that several operations affect several reflexes in a similar fashion. States are 
defined by the operations and their effects and imply no physiological correlates. (It is this 
aspect of the system which has led to its being labeled as an ‘empty organism’ approach.) 
Certain states increase the proportionality; others decrease it, but none of them are said 
to change the size of the reserve. As an example, in a state of high drive a rapid rate of 
responding would be established and, if the operant were not rewarded, rapid extinction 
would take place; in a state of low drive the rate of response would be low and extinction 
slow. Presumably, the same number of responses would be made in either case. 


Skinner has studied stimulus discrimination and response differentiation ex- 
tensively. When reward is made experimentally dependent on stimulus condi- 
tions, discrimination takes place. When it is dependent on response characteris- 
tics, differentiation of response takes place. Skinner’s view is roughly one of mass 
behavior in a context of generalized stimuli, both becoming progressively more 
defined as the situation demands it. The problem, as he sees it, is not explaining 
generalization but rather the lack of it and, similarly, not response variability but 
lack of it. This aspect of his approach has some great advantages in dealing with 
progressively changing behaviors. In situations in which alternative responses are 
available, the response of highest strength has the greatest probability of occur- 
tence. Alternative responses become available as earlier responses are weakened. 

Skinner’s system has been criticized on the grounds of its narrowness, its 
concern with only the lever box situation as an experimental base, and its use of 
the reflex reserve concept. It is, however, basically an empirical, descriptive 
approach and in the main there can be little argument with its basic laws. It has 
been valuable and stimulating in its somewhat different analysis of the learning 
Process and in the attention it has directed towards special facets of learning 
phenomena. 


Hull’s Deductive System 


The most ambitious attempt to develop a rigorous, formal learning theory is 
unquestionably that of C. L. Hull. This system consists of a basic set of postu- 
lates from which, it is hoped, the laws of learning may be deduced in clear and 
quantitative form. The system stems most directly from Watsonian behaviorism 
and Thorndike’s connectionism. 2 


32 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


At the center of the Hullian system are two notions: habit strength and drive — 
reduction. Habit is a tendency for a given stimulus discharge in the nervous sys- 
tem to evoke a given response. It is what is learned. Drive reduction is the diminu- 
tion of the neural state accompanying a need. It is the condition which effects 
learning; it is reinforcement. It is apparent that Hull does not hesitate, as some 
other theorists discussed here, to ‘get inside’ the organism and to make positive 
claims about the nature of physiological events. It should be kept in mind, how- 
ever, that he anchors these variables (in their role as constructs) to observable 
events. 

Step by step Hull’s postulates describe the process of learning as follows: 


Stimuli impinge on the organism and generate neural activity which persists for some 
time before disappearing (P-1).!° Complex stimuli interact in the nervous system to pro- 
duce modified stimulus patterns (P-2). Organisms have innate general responses which are 
set in action by needs. These are not random responses but are selectively sensitized re- 
sponses which have relatively high probabilities of terminating the specific need (e.g. 
withdrawal from pain, general movement and locomotion when hungry, etc.) (P-3). When 
a stimulus trace and a response occur in close contiguity and, at the same time, need is 
reduced, an increment is added to the habit strength of the particular stimulus-response 
pair (P-4). This constitutes learning. 

Stimuli which are similar evoke the same responses, and the amount of generalization 
is a function of the difference between the stimuli in terms of ‘just noticeable differences’ 
(a commonly used form of measurement in psychology of sensation) (P-5). Drives have 
stimulus properties and the intensity of a drive stimulus increases with intensity of the 
drive (P-6). Reaction potential is a product of habit strength and drive (P-7), but does 
not in itself lead directly to responding. Reaction potential to be effective must be greater 
than the resistances to response, reactive inhibition (similar to fatigue), conditioned 
inhibition (learned nonresponding) and the oscillating inhibitory potential associated 
with the reaction potential (P-8, 9, 10). If the momentary effective reaction potential is 
above the reaction threshold and stronger than competing responses, the response will 
occur (P-11). The remaining postulates discuss response measurement and incompatible 
responses. 

Since Hull's system embraces both of the paradigms given above and is at the same 
time a reinforeement theory, his concern with contiguity is two-fold. He is concerned 
with the contiguity of the stimulus and the response and the response and reward. Learn- 
ing is a function of the time lapse between the stimulus and the response according to 9 
rather steep gradient and also a function of the time lapse between the response and the 
reward according to a gradient of reinforcement. This gradient of reinforcement is believed 
to be quite short. The gradient of reinforcement, however, can in effect be lengthened 
into a goal gradient. Stimuli within the range of the gradient of reinforcement acquire 
secondary reinforcing power and develop reinforcement gradients of their own. These 
small overlapping gradients summate to produce a major gradient extending considerable 
distances in space and time from the primary reinforcement itself. This complex treat- 
ment of contiguity proves to be a very useful tool in discussing many learning situations. 


Habits are formed and increase in strength as a function of the number of TN 
inforcements and the amount of need reduction. Since Hull specifies his positio? 
on generalization, it is easy to see how the summation can take place even thou 


10 This is the postulate number, here Postulate 1. The postulates themselves are quite 


lengthy and detailed. The sentences here are crude approximations. 
LI 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 33 


exact conditions are not reproduced. One interesting facet of Hull's theory is that 
habits are never ‘unlearned.’ Habit strength can only increase or remain the 
same since it is a function only of rewarded trials. Unrewarded trials do decrease 
responses, however, because they lead to increased reactive and conditioned in- 
hibition. A response which had been ‘completely learned’ and ‘completely extin- 
guished’ would be represented here by maximum habit strength and maximum 
conditioned inhibition with the net result that it would not appear. Drive and 
drive reduction are also obviously important in this system. Drive has an activa- 
tional role through its multiplicative relation with habit strength and in addition 
exercises a stimulus role. Emphasis of this stimulus role permits Hull to explain 
experiments which otherwise would be classed as cognitions and has led to 
interesting work on generalization gradients and discriminative characteristics 
of drives. 

At every step Hull attempts to state at least tentatively the nature of the 
mathematical functions relating his constructs to each ojher and to the observ- 
able antecedent and subsequent conditions. He also derives corollaries or second- 
ary principles which amplify the basic principles. Hull’s position is that all beha- 
vior should be deducible from the system. He urges that such attempts be made 
and, when confirmation is not obtained, the postulate set be appropriately re- 
vised. He (as indeed all other theorists) regards the system as a self-correcting 
one, continuously predicting, verifying, and altering until it is complete. 

Hull’s system has been criticized for a variety of reasons—its excursions into 
the nervous system, its insistence upon reinforcement as a necessary condition for 
all learning, its lack of adequate definition of response, its peculiar mixture of 
levels in postulates, etc. In the main, however, it has proved to be quite durable. 
The system has been tremendously successful in stimulating research and pro- 
viding a frame of reference for new material. It has been, and is being, widely 
cid to applications in social psychology, personality, and language be- 

avior. 

All of the theories outlined above are part of the behavioristic and functionalist, 
tradition. As such they are primarily concerned with the phenomena of learning 
as manifested in overt behavior and, with the partial exception of Tolman, they 
all use ‘mechanistic’ terms in describing their concepts. Some critics believe that 
these theorists have not sufficiently considered physiological knowledge in devel- 
oping their concepts. A seemingly larger group of critics object to the apparently 
‘mechanistic’ and ‘atomistic’ nature of these concepts and feel that such con- 
cepts cannot do justice to the full range of human and animal behavior. While 
such criticism has been expressed in many diverse ways, much of the basis for 
such objections may be found in the work of the gestalt psychologists, whose 
name comes from the emphasis they have placed on ‘wholes’ and ‘organizing 
principles.’ Because the primary concern of this group has been the study of per- 
ception and problem solving rather than learning, a summary of their theorizing 
has not been included in this section. (A brief discussion of some of the gestalt 
phenomena in perception may be found in section 3.1. of this report.) 

e 


34 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


2.23. Bibliography 


1. Guthrie, E. R., 1935, The psychology of learning, New York. E 

A presentation of Guthrie's theory along with applications to many everyday be- 
havioral situations. Highly readable. 

2. Hilgard, E. R., 1948, T'heories of learning, New York. 

A fairly recent and comprehensive survey and evaluation of various contemporary: 
conceptions of learning, including gestalt and behavioristic theories. 

3. Hilgard, E. R. and D. G. Marquis, 1940, Conditioning and learning, New York. 

A more advanced and tightly argued analysis of the fundamental concepts and phe- 
nomena of learning. One of the most important critical books in the field. ) 

4, Hull, C. L., 1945, Principles of behavior, New York. 

A systematic presentation of Hull's learning postulates and their application to 
phenomena. Although more recent revisions have appeared, this is probably the best 
comprehensive statement. 

5. Koffka, K., 1935, Principles of gestalt psychology, New York. 

Probably the most detailed and extensive presentation of the gestalt point of view, 
including materials on perception and learning. 

6. Lewin, K., 1936, Principles of topological psychology, New York. 

Behavior of organisms treated in terms of field theory. 

T. MeGeoch, J. A. and A. L. Irion, 1952, The psychology of human learning, New York. 

An extensive coverage of the facts and theories of learning as applied particularly 
to human behavior. 

8. Miller, N. E., 1951, Learnable drives and rewards, in Handbook of experimental psy- 
chology (Ed., Stevens), New York. 

A penetrating analysis of the nature and development of secondary motivation 
and reward mechanisms, with presentation of experimental findings. 

9. Miller, N. E. and J. Dollard, 1941, Social learning and imitation, New Haven. 

An application of Hull-type learning theory to a variety of social problems. Provides 
a very readable introduction. 

10. Mowrer, O. H., 1950, Learning theory and personality dynamics, New York. 

A collection of Mowrer’s best papers relating learning theory to personality dynamics. 
Also presents Mowrer’s two-factor theory of learning. 

11. National Society for the Study of Education, 1942, The psychology of learning, 4ist 
Yearbook, Part II. 

Presents in careful, systematic form several important summaries of contemporary 

theories as written by their sponsors. ) 
12. Osgood, C. E., 1953, Method and theory in ezperimental psychology, New York. 

A graduate text in experimental psychology, including sections on sensory processes, 
perception, learning, and symbolic processes. Includes critical analyses of contemporary 
learning theories and a presentation of the author's mediation hypothesis. 

13. Skinner, B. F., 1938, The behavior of organisms, New York. 

The most complete presentation of Skinner's conception of learning and data de- 
rived from his type of instrumental situation. Important for its methodological con- 
tributions as well as forits viewpoint. : 

14. Spence, K. W., 1951, Theoretical interpretations of learning, in Handbook of ezperi- 
mental psychology (Ed., Stevens), New York. 

A critical comparative evaluation of contemporary theories of learning by 08e of 
the most active theorists and investigators on the scene today. 

15. Tolman, E. C., 1932, Purposive behavior in animals and men, New York. 

A systematic presentation of Tolman’s early views along with results of numerous 
experiments. A classic in the field, and a book which has had great influence upon COD- — 
temporary psychology. = 

16. Wertheimer, M., 1945, Productive thinking, New York. 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 35 


A delightful little book, published posthumously, by one of the founders of the 
gestalt movement. It presents Wertheimer's insightful analysis of the process of in- 
sight and problem-solving in humans. 


2.3. The Information Theory Approach" 


Strictly speaking, the term information theory is a misnomer. As the following 
discussion will indicate, it is not a theory of ‘information,’ per se, but of informa- 
tion transmission, and then only in situations where a message input may be 
said to contain ‘information’ in something like the usual sense of the word. It 
is concerned with characterizing the entropy or uncertainty of sequences of 
events. It was:with such considerations in mind that it was suggested’ that the 
label information theory be replaced by theory of signal transmission. At any rate, 
information theory is essentially an extension of the general mathematical 
theory of probability which has provided some useful descriptive measures in 
several areas of scientific research. Therefore, it is negessary to be acquainted 
with some of the fundamentals of the concept of probability and probability 
theory in order to properly understand information theory. 


2.3.1. Probability Theory 

Despite the seeming simplicity of the concept of probability, there has been 
much controversy among mathematicians and logicians over its definition. The 
reader interested in the details of this controversy, as well as the development 
of a mathematical theory of probability from a postulational system, may be 
referred to works by Nagel (14) and Feller (4), listed in the references appended 
hereto. There is general agreement, however, that for most practical and theo- 
retical purposes the probability of an event may be defined as the limit of the 
relative frequency of its occurrence. In mathematical symbolism, 


p'(i) oe lim fO) 


where p'() is the ‘true’ probability of event 4, lim symbolizes the limit of the 


following expression as n becomes indefinitely large, n is the number of 
events, f(i) is the frequency of occurrence of event i. In other words, if we have 
n events, and a particular class of event, symbolized 4, occurs f(7) times, the 
true probability of event 7 is the value towards which the ratio of f(z) to n tends 
as we allow n to become indefinitely large. bor 

In practice, of course, we cannot determine probability in this fashion since 
we can never have an indefinitely large (i.e., infinite) number of events. There- 
fore, we simply compute 

fG) 
pi) = n3 
?! Kellogg Wilson. 
? By Y. Bar-Hillel, in a talk at Massachusetts Institute of Technology, 1952. 


36 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


for some reasonably large but finite value of n. In this case p (?) may be called 
an empirical estimate of the true probability. If p(z) tends to become nearer 
and nearer to p'(i) as n increases, then we say that the process generating our 
sequences of n events is a stochastic process. For example, if we spin a fair roulette 
wheel we should find that the probability of any number tends to become closer 
to 1/38 as n, the number of spins or trials, increases. This would be a stochastic 
process. Now suppose that there is a magnet under the wheel that tends to 
attract the ball to the O or OO position and which is turned off and on randomly, _ 
In this case, we would find that the probabilities of O and OO would tend to - 
decrease toward 1/38 when the magnet is off and increase away from 1/38 
when the magnet is on. This would not be a stochastic process since the esti- 
mates of probabilities would not converge toward any particular value. How- 
ever, if the magnet were left on constantly we would most likely find that our 
probability estimates would converge to some values greater than 1/38 for O 
and OO and less than 1/38 for the remaining events and we would again have a 
stochastic process. Since some such severe fluctuation in the condition of sam- 


APRES 
"IB 


Su 


Figure 8 


pling is required to make a process non-stochastic, we may generally assume 
that we are dealing with stochastic processes. 

In scientific work we choose certain features of our empirical events as & 
basis for classification and disregard others. In this manner we create sets OT 
classes of events (e.g., a ‘response,’ which may include all the minute variants of 
a bar-pressing response, or a ‘phoneme,’ which is a class of allophones). Such 
sets are sub-seís of the class of all events. We will here consider only events 
which may fall in only a finite number of classes or sets, since such discrete 
classes are nearly always used in psychology and in linguistics. We will use the 
symbol Q to refer to the class of all events, which is divided into a finite number 
of sub-classes or sub-sets as in Figure 8. The sub-classes in Figure 8 are mutually 
exclusive; that is, no event may fall in more than one sub-class, and therefore 
no two sub-classes have any events in common. We will use S to refer to the 
class of all such sub-sets. We will let i refer to any of the sets in S and p(t) to the 
probability of an event being in set 7. When 9 and S are defined, and p(i) deter- 
mined (or estimated) for each set in S, we have defined a probability pace. 
Before we employ probability or information theory in dealing with empiri! 
data, we should make sure that we have a completely defined probability Space 

We can now introduce the notions of joint probability, conditional proba- 


m d 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 37 


bility, and independence which are so crucial to information theory. If we have 
a set of simultaneous or successive events, each of which may fall into one of 
the classes in the probability space, it is often useful to consider the probability 
of a joint occurrence of events. The joint probability, p(z, j), of events in classes 
i and j is the probability of their joint occurrence, that is, the relative frequency 
of a joint event involving both 7 and j in a large number (or an indefinitely large 
number) of joint events. The conditional probability, p;(7), of class j is the proba- 
bility of 7 when it is given that an event in class 7 had also occurred; thus, it is 
the relative frequency of j in the class of all joint events involving 7. In mathe- 
matical symbolism, 


(2 262 
pilj) » 


and therefore p(i, j) = p(i)-p;(j). We say that classes i and 7 are independent 
if and only if the probability of an event being in class j is unaffected by its 
being in class 7; i.e., if p(j) = p;(j), then ? and j are independent. If and j are 
independent, then p(i, j) = p(z)-p(j), and this formula, or an extension of it, 
pli, j, +++, 8) = p(i)p(j) --- p(s), may be used to compute the joint probability 
of a set of independent events. 

Languages appear to be structured so that in a sequence of language events, 
subsequent events are rarely independent of antecedent events, and not all the 
possible sequences occur. The so-called Markov process is appropriate as a 
model for representing such sequences. Suppose, for example, that A, B, C, 
D, and E represent all possible events in a set of sequences. Suppose also that 
the conditional probabilities of these events are as represented in Figure 9. O 
represents the state of the system while at its starting point and is merely a 
convention which indicates the point at which sequential dependency begins 
and ends; in this example, sequential dependency begins when A or B occurs 
and ends whenever C, D, or E occurs. The arrows indicate the sequential order 
of the alternative events and the figures within the arrows indicate the con- 
ditional probabilities of the subsequent events. The complete set of sequences 
which may be generated in this example and their probabilities are as follows: 


3 c Sequence Probability 
A PET Fede AC 6X .3 = .180 
> * AD .6 x .5 = .300 
omy Qul? ABD Fe HOLT 
DA ER 1.0 ABE 6X 3x 27 0M 
2 BD 4X .8=. 
ME BE “4X .2 = .080 
Total 1.000 


Figure 9 


The probability of each sequence is the product of the conditional probabilities 
of each event in the sequence, and may also be said to be the joint probability 
of the events in the sequence. The sym of these probabilities equals unity. In 


38 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


general a Markov process may be used to represent any set of sequences of 
events such that the probabilities of subsequent events are dependent on par- 
ticular antecedent events. 


2.3.2. Basic Concepts of Information Theory 


In the subsequent portions of this presentation, we will be concerned with 
probability spaces for which Q (the class of all events) is a restricted class of 
physical events—e.g., sounds of speech, energy changes which serve as stimulus 
inputs. S (the class of all sub-sets of events) for these probability spaces is some 
‘convenient’ or ‘suitable’ ordering of these events into a finite number of sub- 
classes. Henceforth, we shall refer to the entire set of events in Q as a system. 
and each of the sub-classes of events in S will be referred to as a state of the 
System. 

Comparable systems may differ in ‘randomness’ due to differences in the 
probabilities of their states or in the degree to which their states are dependent 
on prior states. The measures of information theory are extensions of the entropy 
measures of thermodynamics and measure the degree of entropy—i.e., ‘random- 
ness'—of a system's states. A system possesses maximum entropy when its 
states are equiprobable and independent of previous states—uncertainty is 
maximal and predictions can be no better than chance. For example, consider 
a system consisting of a tossed coin which has two states, H (heads) and T 
(tails): if the coin is fair, and p(H) = p(T) = .5, the system has maximal 
entropy and we are maximally uncertain of the outcome of each toss. The 
entropy of a system is decreased when the probability of some of its states is 
greater than others—we are less uncertain about what states will occur and 
predictions can be better than chance. If our coin is ‘biased,’ and p(H) = .75 
and p(T) = .25, the system possesses less than maximal entropy and we are 
less uncertain about the outcome. Entropy can also be reduced by making 
subsequent events dependent upon antecedent events, which will be discussed 
in greater detail at a later point. 

The term information in information theory is identified with the concept of 
entropy and so has a meaning that differs somewhat from ordinary usage. The 
term is not entirely unjustified since a system with little entropy has highly 
predictable states and the occurrence of any particular state is therefore not 
very ‘informative.’ The use of the term, information, may also be justified in 
another manner. The unit of entropy measure, the bit, may be defined as the 
amount of information needed to specify one of two classes of equally probable 
events. In the case of our ‘fair’ coin, H and T constitute two equiprobable 
classes of events so that we would need only one bit of information to determine 
if H or T has occurred. Now let us consider a system consisting of two such 
coins whose states are independent. Here, we would require one bit of informa- 
tion to specify the state of each coin so that two bits of information would be 
required to specify the state of the total system (i.e., the particular combination 
of positions, HH, HT, TH, TT, assumed by the coins). a 

If our system consists of m fair and independent coins, then we would require 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 39 


m bits of information to specify the state of the total system. Since such a system 
can assume k = 2" states, the amount of information needed to specify the 
state of the system, is logs k.” Since more information is required to specify the 
state of a system as the number of states of the System increases, assuming that 
the states of the system are equiprobable and the states of sub-systems are 
independent, it is apparent that this amount of information grows with our 
uncertainty of predicting states of the system. Hence, amount of information 
may be regarded as equal to the entropy of the system. If a system has k states, 
its maximal entropy, H maz , is given by the equation: H mas = log; k. If the states 
of a system can be divided into m pairs of subclasses, then it is apparent that 
the system hás maximal entropy when these subclasses are independent and 
equiprobable and the amount of this entropy may be determined by the same 
reasoning as was applied above to the. system consisting of m coins. If k is not 
an integral power of 2, then the applicability of the argument above is not 
obvious but it suffices to say that Has would then be the average amount of 
information needed to specify the state of the system if these states were equi- 
probable and the subsystems were independent. More rigorous mathematical 
treatments of such considerations may be found in Fano (3) and Shannon (18). 

Let i be any event, or state of a system, in the set of events, J, and let pli) 
be its probability. If p(;) equals a particular value, say 1/a, we can regard this 
statement as equivalent to stating that ; falls into one of a equiprobable classes. 
Hence, there will be log; a bits of information needed to specify i. Let A(i) be 
this amount of information, so A(;) = logs a = log: [1/p(i)] = —log: p(i). We 
may express the average, X, of a sample of numbers, as 


X= pa 


where is the numerical value of any class of members in the sample, f(x) is 
the frequency of that class, and n is the sample size. In other words, the average 


** A logarithm (abbreviated ‘log’) is most simply defined asan exponent. In mathematica’ 
symbolism: if zv = z, then log z* = y, by definition, and z is the base of the logarithm. A 
base of 2 is most widely used in information theory. The examples below, using logs to the 
base 2, may make the concept of a logarithm clearer. 


2° = 1; therefore, log 1 = 0 23 = 8; therefore, logs 8 = 3 
2! = 2; therefore, log; 2 = 1 24 = 16; therefore, logs 16 = 4 
2? = 4; therefore, log 4 = 2 25 = 32; therefore, logs 32 = 5 


Logs of numbers which are not integral powers of 2 can be readily obtained from a table 
of base 2 logs such as that of Dolansky (2). Since logs to any base are proportional to logs 
to any other base, we may convert base 10 logs to base 2 logs by the formula logs a = 
(logis 2) logi z = 3.3219 log uz. 

, Logs to any base have the properties indicated in the three equations below. The base 
18 not indicated but is assumed to be the same in all cases: 


log (zy) = log x + log y; 
log (z/y) = log z — log y; 
log z” = y log z. 


40 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


(or arithmetical mean) is the sum (2) of the values found in the sample, each 
weighted by its proportional frequency. Now, we have previously defined our 
probability estimate as 


p(x) = E , 
80 that 
X= x z p(z). 


We may define the entropy of a system H(J), with a set of states, J, as the 
average amount of entropy associated with its states. Thus, 


H(I) x AG) pG) 


pr p(t) log: p(i). 


The second form of this equation is the expression usually given as the measure 
of the entropy of a system. 

The measure, H(I), has the following characteristics, all of which are in 
keeping with our intuitive feelings about the notion of entropy or uncertainty: 

(a) If one p(?) = 1 and all others are zero, then H (I) = 0. In other words, 
if one state always occurs, then the behavior of our System is completely pre- 
dictable and its entropy is zero. 

(b) If a system consists of k independent sub-systems, each with entropy 
H(I), then the entropy of the total system is k H(I). This theorem follows 
from the same sort of argument that was applied to the system consisting of 
k coins. This characteristic is one of the justifications for using a logarithmic 
measure. : 

(c) If a system has m equiprobable states, then H(I) = Hmas = loge m. It is 
apparent that H(I) will approach H maz as the set of p(t) approaches equiproba- 
bility. : 

(d) If a system has m equiprobable states, then H(I) increases when m in- 
creases. Because of this last characteristic, it is desirable to have a measure 
which may be used to compare systems with different numbers of states. This 
measure, H (I), relative entropy, is 


Hu) = ee 


‘maz  logem™ 


H,e(I) is zero when H(I) is zero and equals one when H(I) equals Hs: - 

A more complex situation is that in which we deal with associated pairs of 
events. In applications of information theory these pairs of events fall into two 
main classes: (a) Pairs of events in antecedent (input) and subsequent (output) 
systems, e.g., the stimulus and the respünse of behavioristie psychology, the 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 41 


speech sound and the hearer's interpretation; (b) pairs of antecedent and sub- 
sequent events in the same system, e.g., sequences of responses, sequences of 
phonemes or morphemes. The main value of information theory to linguistic 
and psycholinguistie problems lies in the application of entropy measures suit- 
able for such situations. In general, these measures indicate how much effect 
the pattern of antecedent events has on the occurrence of subsequent events, 
and hence they indicate the degree to which sequences of such events are struc- 
tured (i.e., non-random). 

Let I be a set of antecedent or input states and let J be a set of subsequent 
or output states and let 7 and j, respectively, be any member of these sets. We 
may compute H(I) and H(J) for these sets as shown above. Let I , J be a set 
of associated pairs of states, let 7, j be any member of this set, and let p(t, 7) 
be its probability. For every antecedent or input state, 7, there will be a con- 
ditional distribution of associated j's. We may apply the measure of entropy 
developed above to these distributions and obtain e 


HJ) = T pi) loge p:(j) 


where p;(7) is the probability of the subsequent or output state, j, when the 
associated 4 has occurred. We may define the conditional entropy, Hi(J), 
of the set I, J as the average amount of entropy associated with these con- 
ditional distributions. 


HG) = -È p) > p) logs p.) 
- m2 pl, j) logs pi). 


In effect, this measure weights the entropy of the conditional distribution of 
each 7 by the value of p). 

H,(J) has the following characteristics: 

(a) If one and only one j occurs with every i, i.e., if for every i, one p,(j) = 1 
and all others are zero, then H. 1(J) = 0. We have already found that H(I) = 0 
if but one state of the system occurs. Similarly, all H;(J) = 0 if but one j occurs 
with every 7 and their average, H;(J ), will also be zero. ie 

(b) If the set J is independent of the set 7, i.e., if all p;(7) = p(j) and p(?, 2 SA 
P)p(j), then H,(J) = H(J). This theorem follows directly from the definition 
of independence, i.e., that all pi(j) = p(j), and from the definition of H,(J). 

Because of these two characteristics, conditional entropy, H;(J), is used to 
Measure the amount of random error or ‘noise’ in a communication channel, 
where we are concerned with pairs of input and output events. If output ee 
are independent of input events, and the distribution of output events is the 
Same regardless of the input events, then ‘noise’ is maximal and H;(J) is equal 
to H(J). If a particular input event always produces a particular output event, 
then the system is ‘noiseless’ and H 1(J) equals 0. There is no requirement here 
that output events correspond in arty way, or be related to, input events; for 


42 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


example, a ‘scrambler’ such as is used in trans-oceanic telegraph communica- 
tion, which reliably changes the input sounds into some arbitrary but predictable 
pattern of electrical energy, is a ‘noiseless’ channel. Thus it can be seen that 
H,(J) is a measure of random error and not Systematic error. It is possible to 
devise measures of systematic error, e.g., measures of the validity of signal 
transmission rather than reliability of the transmission," but they do not derive 
readily from entropy estimates. The amount of information transmitted, I, , is 
defined as the amount of information in the output or subsequent system minus 
the ‘information’ contributed by noise: I, = H (J) — Hi(J). 

Conditional entropy, H;(J), is also of value in measuring redundancy in 
sequences of states of the same system. If a particular antecedent state always 
occurs prior to a particular subsequent state, the sequence is completely re- 
dundant, and H;(J) equals 0. If the antecedent states are independent of the 
subsequent states, then there is no redundancy and H;(J) = H(J). There is 
no reason to suppose that;a subsequent state should be dependent on the single 
antecedent state only. Rather, it is quite conceivable that this dependency 
could extend for sequences of several states. In determining the extent of this 
dependency, we redefine J as the class of all possible sequences of r antecedent 
states, where r = 1,2,3,4,5,..., and J is the class of all subsequent states as 
before. Under these conditions, H;(J) has an additional characteristic. 

(c) If all sequences of length s are independent of all such prior sequences in 
a longer sequence of states, then H;(J) will approach its minimum value as 7 
approaches s and will remain at that value for larger values of r. This character- 
istic is actually a generalization of characteristic (b) and is based on essentially 
the same line of reasoning. 

Characteristic (c) permits us to use entropy measures to determine the size 
of the ‘structured’ (i.e., non-random) sequences in any longer sequence of mes- 
sage states: e.g., a linguistic text transcribed phonetically, phonemically, or 
morphemically. For example, if H;(J) were computed for the example of a 
Markov process given above, it should reach its minimum for r = 3. However, 
it is not always practical to use any but small values of 7 (usually less than 10) 
because of the difficulty of tabulation and the large sample required to obtain 
adequate estimates of the probabilities of such a large number of sequences. 
For sequences consisting of m different states and r units long the number of 
possible sequences is m'; e.g., form = 2 and r = 10, m" — 1024. An alternative 
approach has been to determine the conditional entropy of pairs of states " 
units apart in a sequence (15). 

In order to compare two systems with differing numbers of states, it is useful 
to have a measure of relative conditional entropy, H rel (J): 


H,(J) 
Hyrel(J), =", 
il) = Fe 

14 Measures termed ‘fidelity’ and ‘communication,’ based on the proportion of total trans- 
mission which involves corresponding states of antecedent and subsequent systems, have 
been described by Osgood and Wilson in a mimedgraphed paper. 


> 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 43 


This measure will vary from a minimum of zero when Hi;(J) = 0 and each 
antecedent state is associated with only one subsequent state to a maximum 
of 1.00 when H;(J) = H(J) and the subsequent states are independent of the 
antecedent states. A useful measure of redundancy, R, may be obtained by sub- 
tracting Hre(J) from 1; R = 1 — Hirei(J). This measure will vary from 
a maximum of 1,00 when H;(J) = 0 to a minimum of zero when H;(J) = 
H(J). 

The measure of joint entropy, H(I, J), is closely related to the entropy meas- 
ures discussed above. Just as we have the measure H (I) defined for a class of 
single states J, with H(I) = — Dy p(t) log: p(7), we have the measure H(I, J) 


defined for a class of pairs of states, I, J, with H(I, J) 2 — 25 P(t, 3) log: pli, j). 


LU 
It will then be true of H(I, J) that H(I, J) = H(I) + H,(J). The proof of this 
theorem is based on the analogous relation, p(i, J) = pp), as derived earlier. 
In computing the entropy measures described ab@ve it is often convenient 
to prepare a table of (7, j)'s of the form illustrated in Figure 10. The values 
of p(t) and p(j) in the margins of the table may be obtained by simply adding 
across appropriate rows or columns. H(I) and H(J) may easily be obtained 
from these marginal figures by simply adding appropriate values of —p log; p 
which may be found in Newman’s (15) or Dolansky's (2) tables. H(I, J) may 
be computed by carrying out the same operation on the values of p(t, j) in the 
main body of the table. H;(J) and H,(I) may be directly obtained by using the 
equation below. 
HJ) = HU, J) — HQ) 
H,U) = H(I, J) — H(J) 


2(53) p0) 


pi) 
EE 
Figure 10 


44 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


These computational procedures are best used when entropy measures are 
being applied to pairs of input and output events or to sequences of no more 
than two events. The computational method described by Newman (15) is 
more suitable for longer sequences. 
Since most of information theory has been developed to deal with problems 
` of electronic communications systems, only those aspects which seem most 
applicable to language research have been included above. All discussion of 
entropy measures for continuous data has been omitted, for example. However, 
the concept of channel capacity seems to be of potential value to language re- 
search. Shannon (19) defines channel capacity, C, essentially, as the maximum 
rate (in bits per unit time) at which a communications channel can transmit 
information. His fundamental theorem for a channel with noise states, in effect, 
that for rates of transmission of less than C it is possible to make H;(J) (i.e., 
‘noise’) as small as desired by coding information in some ‘optimal’ fashion but 
that for rates of transmission greater than C, we can never decrease H ;(J) below 
the amount by which the rate of transmission exceeds C. In other words, error 
can be reduced as much as desired for transmission rates less than C but in- 
creases linearly for rates greater than C. If we regard the human organism as a 
communications channel and responses as output states, this theorem seems 
of great theoretical and practical interest to students of human communica- 
tion. 


2.3.3. Some Applications of Information Theory 


Two main classes of application of entropy measures were indicated briefly 
in section 2.3.2. At this point new symbols for sets of states will be substituted 
for the general J and J symbols in order to more sharply distinguish the measures 
used in these two types of situation. 

(a) Conditional Relations between Systems. In many cases we are interested 
in describing the degree to which events in one antecedent system influence 
events in another subsequent system to which it is directly or indirectly coupled. 
We may symbolize the class of events in the antecedent system as J (input) 
and the class of events in the subsequent system as O (output). Here H;(0) may 
measure the degree of ‘noise’ in the communication channel between the two 
systems or the randomness introduced by the channel. If I were a class of 
stimuli and O were a class of responses, H;(0) would indicate the average ran- 
domness of response tendencies to these stimuli. Conversely, H(0) — H;(0) 
would index the dependency of output upon input events, e.g., the lack of ran- 
domness in the channel. 

Nearly all studies involving measurement of information transmission have 
closely followed the familiar communication channel model developed by 
Shannon (19). The states of the input are usually simple stimuli such as alter- 
native light patterns or spoken commands. The experimental subjects are 
treated as communication channels and their responses to the alternative 


15 Research proposals relating to the empirical determination of channel capacities in 
human language behavior are suggested in section 5.5. 


> 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 45 


stimuli are treated as output states. The conditional entropy of responses to 
stimuli is the usual measure. Garner and Hake (5) describe the basic methodology 
of such studies and have written a later series of experimental reports. Since, 
on the content side, these studies are mainly of interest to those concerned with. 
perception and the design of control system displays, they will not be discussed 
any further here. However, it should be noted that such methods can be used 
in psycholinguistic studies where the subject makes an immediate overt response 
to the message stimuli—i.e., in situations where the message stimuli serve as 
‘signals’ rather than ‘symbols.’ 

In situations where the message stimuli serve as 'symbols, their effect is 
generally to change response tendencies in some later situation. For reasons 
discussed in another portion of this report (section 7), the conventional methods 
of measuring information transmission do not seem suitable. Rather, the reduc- 
tion of the conditional entropy of responses in some extra-message situation 
seems to be a more appropriate measure. Bendig's interesting experimental 
study (1) is the only one to date which has used this type of measurement. 

There is one important caution which should be observed in the application 
of entropy measures to measurement of information transmission. The meaning 
of the entropy measures is obviously confounded if the probability of the events 
we are considering changes during the period of measurement—i.e., if these 
events cannot be regarded as a stochastic process. It is apparent that such 
changes will occur if any learning occurs and affects response tendencies during 
the period of measurement. The contaminating effects of learning may be 
avoided in any of the three following ways: (I) using groups of similar subjects 
for short periods of measurement rather than single or small groups of subjects 
for extended periods of measurement; (II) making several measurements over 
relatively short periods during the learning process or before and after learning; 
and (IIT) using responses which are relatively well learned and which can safely 
be assumed to be unaffected by learning during the period of measurement. 
Bendig (1) has used a combination of methods (I) and (II) while Garner and 
Hake (5) have used method (III) in their experimental studies. , \ 

(b) Transitional relations within systems. In other cases we are interested in 
describing the extent to which antecedent events in a system influence sub- 
sequent events in the same system. We may symbolize the class of antecedent 
events as A and the class of subsequent events as S. Here H4(S) indicates the 
degree to which on the average particular subsequent events are independent 
of particular antecedent events, i.e., the degree of randomness in the sequencing. 
Conversely, H(S) — H,(S) indexes the degree to which antecedent events 
predict or lead to subsequent events, ie., the redundancy in the sequencing. 
If A is a class of antecedent phonemes and S is a class of subsequent phonemes, 
H(S) — H,(S) indicates the degree to which sequences of these phonemes are 
structured or organized. 5 

Miller and Frick (11) made an early application of a measure of A odi 
redundancy to measure response stereotypy in learning situations. "use ; 
(16) has analysed the entropy of vowels and consonants in sequences © DERE 


^ 


46 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROB) 


raphy in a number of languages. Shannon (18) and Newman and Ge 
(17) have examined entropy relations in sequences of English ortho 

Shannon (19) and Miller (9) discuss the interesting concept of the order of | 
proximation to the statistical structure of English orthography: a zero-o 
approximation consists of sequences generated from the assumption that 
have equal probability of occurrence; a first-order approximation is a se 


Miller and Selfridge (13). The latter investigators have found that the re 
of such sequences after rote learning is directly related to the order of app 
mation, sequences of higher order approximation being more easily reta I 
It is perhaps unfortunate that so much attention has been devoted to o! 

raphy and so little to spoken language in these studies; it is difficult to 
the results of these researches to linguistic theory. The studies cited al 
demonstrate the potentialities of these relatively new techniques of m 
ment, and proposals for further study of entropy relations with special refe 
to linguistic structure are given elsewhere in this report (particularly section 


2.3.4. Some Limiting Considerations 


Information theory concepts and measures are particularly liable to 
interpretation and misapplication. For one thing, the term, amount of 4 
tion, has been used to mean amount of entropy in both situations (a) and 


message, ie. an ‘informative’ message is so because it is indicative of 
other state of affairs. Thus, we regard language messages with external refe 
meaning as ‘informative’ but arbitrarily selected sequences, such as 
numbers or nonsense syllables, as ‘uninformative.’ When considering seque 
of message events we can only measure the degree of randomness, not relatior 
to external events. In this case, entropy measures only indicate how ma 
binary decisions we need to make in order to predict subsequent message 
not how much ‘information’ in the referential sense the message contains. 
the other hand, when we are considering pairs of input and output ev 
the channel connecting different systems, we can determine the relation 
events in the message output to the external events of the input. Also, 
entropy produced by the channel, H,(J) or ‘noise,’ is distinguishable from 
attributable to the informational content of the input. Only in this c 
possible to measure the amount of information, in the referential sense, 
message by the use of entropy measures, and thus allow the term “nf 
tion’ to retain something like its conventional meaning. 

The distinction made above may be clarified by considering the foll 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 47 


example. Suppose that a ‘mechanical oracle’ has been constructed which answers 
any inquiry with a set of randomly selected words such that the choice of words 
is independent of the inquiry and any previously selected words. The entropy 
within the sequences of the answer is high due to the absence of redundancy, 
and the conditional entropy of the answers given the inquiries is also high due 
to the independence of the answers and the inquiries. Thus, we may haye high 
sequential entropy and low information transmission (due to the high conditional 
entropy) at the same time. This example demonstrates that the two types of 
measures are sensitive to different aspects of messages and that we should not 
indiscriminately equate the amount of entropy with the amount of information 
in a message if the term information is to retain anything like its conventional 
meaning. To date, nearly all of the applications of information theory to psycho- 
linguistic problems have been concerned with the measurement of entropy of 
single message events or sequences of events rather than with the measurement 
of referential information in messages. This is probably due to the greater ease 
of direct application of entropy measures in the former situations. 

Another frequent misconception probably stems from the term “nformation 
theory.’ As the previous discussion has indicated, the chief contribution made 
by this ‘theory’ to the study of language is a set of descriptive measures and a 
unit, the bit, which are much more broadly applicable than to language processes 
themselves. It serves chiefly, therefore, as a quantitative tool for describing 
language processes. It is not a theory of information in the usual sense, nor does 
it provide us with a theoretical model which can provide hypotheses about or 
explain the phenomena of human language communication. 

Two limitations of a statistical nature must also be mentioned. In the first 
place, these entropy measures are as yet of little value in hypothesis testing and 
statistical inference. This is because so little is known about their sampling dis- 
tributions. However, a recent paper by Miller and Madow (12) provides a 
valuable initial step in the derivation of these distributions. It is also possible 
to test hypotheses concerning the probabilities on which the entropy measures 
are based by using the well-known Chi-square test. Secondly, entropy measures 
take no account of similarity among states of a system. Suppose we are studying 
communication via facial expressions and use H;(0) as a measure of the degree 
to which observer judgments (0) are dependent upon actor intentions (I). 
Within the limited number of judgmental categories provided, conditional 
entropy measures the degree of uncertainty of judgments made in response to 
facial poses, but it does not reflect similarity or clustering among judgments since 
each alternative state is treated as unique. It is possible to reclassify output 
states on some similarity basis, but procedures for doing this do not involve 
entropy estimates. 


2.3.5. Bibliography 


1. Bendig, A., 1953, Twenty questions: an information analysis, 
Psychology 46. 345-348. å 
An experimental study of the reduction o! 


Journal of Experimental 
f response entropy (i.e., amount of informa- 


48 


10. 


11. 


12. 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


tion transmitted) by each of a series of statements in a modified game of Twenty Ques- 
tions. 


. Dolansky, L. and M. P. Dolansky, 1952, T'able of log 1/p, p logs 1/p, and p log, + 


(1 — p) log: (1 — p), Research Laboratory of Electronics, Massachusetts Institute of 
Technology, Technical Report no. 277, Cambridge. 

Use of this table reduces computation of entropy measures to the addition of a set 
of numbers. 


. Fano, R. M., 1949-50, The transmission of information: I and II. Research Laboratory 


of Electronics, Massachusetts Institute of Technology, Technical reports Nos. 65 and 
149, Cambridge. 

The first of these papers covers much the same ground as Shannon and Weaver, but 
the basic theorems are developed in a different way. The second paper contains a valu- 
able discussion of the problem of coding information for transmission through a noisy 
channel. 


. Feller, W., 1950, Introduction to probability theory and its applications. 


This is one of the best available treatments of probability theory, and it contains 
an excellent discussion of Markov processes. 


. Garner, W. H. and H. W. Hake, 1951, The amount of information in absolute judg- 


ments, Psychological Review 58. 446-59. 

A valuable description of application of measures of joint and conditional entropy. 
to a situation where J is a set of stimuli and J is a set of responses. The computational 
illustrations are also of value. 


. Hockett, C. F., 1953, Review of Shannon and Weaver: The mathematical theory of 


communication, Language 29. 69-92. 

This paper is actually much more than a review. It is rather an introduction to In- 
formation Theory for linguists and contains an excellent discussion of potential ap- 
plications to linguistic problems. 


. Jakobson, R., H. Halle and E. C. Cherry, 1953, Towards the logical description of 


languages in their phonemic aspect, Language 29. 34-46. 
The entropy of the set of Russian phonemes is estimated and the problem of optimal 
coding in terms of a set of binary distinctive features is discussed. 


. Miller, G. A., 1950, Language engineering, Journal of the Acoustic Society of America 


22. 720-25. 

The problem of designing a language for special purposes is discussed and the rela- 
tion between information theory variables and results obtained in intelligibility ex- 
periments is discussed. 


. Miller, G. A., 1951, Language and communication, New York. 


The chapter titled “The Statistical Approach" contains some interesting material 
on frequency counts of English phonemes, letters and words, and some examples of 
various degrees of ‘statistical approximation’ to English. 

Miller, G. A., 1953, What is information measurement?, American Psychologist 8, 3-11. 
A good elementary and non-technical introduction and contains an excellent an- 

notated bibliography. 

Miller, G. A., and F. C. Frick, 1949, Statistical behavioristics and sequences of re- 
sponses, Psychological Review 56. 311-24. 

Contains an elementary development of the theory of measuring sequence 
responses. 

Miller, G. A., and W. G. Madow, On the limiting distribution and asymptotic mo- 
ments of the maximum likelihood estimate of the Shannon-Wiener measure of amount 
of information (Unpublished dittoed manuscript). 

This is an attempt to derive the sampling distributions of entropy measures f 
it requires a good deal of mathematical sophistication in order to follow the deriva- 


tions. EI 


s of 


and 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 49 


13. Miller, G. A., and Selfridge, 1950, Verbal context and the recall of meaningful ma- 
terial, American Journal of Psychology 63. 176-85. 

An experimental study of the rote learning of written material with various orders 
of statistical approximation to English. It was found that the amount recalled was 
directly related to the order of statistical approximation. 

14. Nagel, Ernest, 1939, Principles of the theory of probability, International Encyclopedia 
of Unified Sciences, 1. No. 6, Chicago. 

This work is an excellent review of the development and current theoretical status 
of the concept of probability. The treatment is mainly logical rather than mathe- 
matically formal and includes a discussion of some of the more basic theorems of 
probability theory. 

15. Newman, E. B., 1951, Computational methods useful in analyzing series of binary 
data, American Journal of Psychology 64. 252-62. 

Contains a table of —p logs p that is not as complete as Dolansky's but may be 
more readily available. Also, detailed computational illustrations are given for 
H(I, J) for sequences of various lengths, and a valuable recursion relationship for 
obtaining Hy(J) for these sequences is included. There is also a design for an ap- 
paratus to be used in tabulating sequences of binary data? 

16. Newman, E. B., 1951, The pattern of vowels and constants in various languages, 
American Journal of Psychology 64. 369-79. 

The redundancy of constants and vowels in the orthography of eleven languages 
is given. 

17. Newman, E. B. and L. S. Gerstman, 1952, A new met! 
Journal of Experimental Psychology 44. 114-25. 

This is an extension of Shannon’s discussion o 
(See 18 below.) 

18. Shannon, C. E., 1951, Prediction and entropy of printe 
nical Journal, 30. 50-64. f 

The conditional entropy of the rth letter, given the first letter, in sequences of 

printed English letters is computed for various values ofr. Jy 
19. Shannon, C. E. and W. Weaver, 1949, The mathematical theory of communication, 
Urbana. j 

The section by Shannon contains the derivation of the entropy measures which 
marked the beginning of information theory as & separate area. This is still the most 
basic reference in the field, although the reader who is not mathematically sophis- 
ticated would be well advised to begin elsewhere. 
restatement of Shannon’s section with a few addition: 
mathematical proofs. 

20. Weiner, N., 1948, Cybernetics, New York. 

This book contains much of the basis for Shannon’s later development, and t 
chapter on “Time Series, Information and Communication” contains the oe 
identification of information with entropy. However, both the prose and the ge n 
matics are difficult to follow, and there is little apparent continuity in SRE TERA 
work. An expository chapter titled ‘Information, Language and Society” is pro- 


vocative. 


hod for analyzing printed English, 
f redundancy in English orthography. 


d English, Bell System Tech- 


Weaver’s section is essentially a 
al comments and without the 


3. PSYCHOLINGUISTIC UNITS 


The linguist is in a relatively fortunate position as compared with other social 
Scientists in being able to analyse his raw data—the sound materials that con- 
stitute spoken messages—into discrete units. Virtually all schools of linguistics 
are in agreement as to the two fundamental building blocks of all natural lan- 
guages, the phoneme and the morpheme. About lesser as well as more compre- 
hensive units—the distinctive feature at one end of the spectrum and the con- 
structive feature at the other—there is far less agreement and indeed much 
controversy. 4 

In the early days of modern linguistics much was written about the psycho- 
logical reality of the phoneme. Much of this was purely speculative, the evident 
futility of which led to the abandonment of this problem in favor of purely 
descriptive investigation. Now, in the framework of psycholinguistics, it seems 
worthwhile to reopen the question in an atmosphere of frank experimentalism. 
Are the fundamental linguistic units, the phoneme and the morpheme, also the 
‘natural’ units of decoding and encoding? In the process of decoding, a listener 
or reader can be thought of as making a series of decisions (significances) in terms 
of input signals; similarly, in the process of encoding, a speaker or writer makes 
a series of decisions (intentions) in terms of which he produces output signals. 
What segments of the message correspond to these non-linguistic events in de- 
coder and encoder? Are the units which characterize decoding necessarily appro- 
priate for encoding? In this section we try to clarify the nature of this problem 
and to suggest some research procedures that might lead to definitive answers— 
the answers themselves are probably more matters of empirical than of logical 
decision. 

The first question we ask is a strictly psychological one—what are the mecha- 
nisms of unit formation in both perceiving and behaving? The second question we 
ask is whether the basic units with which the linguist operates are merely his con- 
venient and productive fictions or perhaps also have their psychological corre- 
lates. Whatever the answer may turn out to be here, we continue to seek the 
answer to a third question—is it possible that some of the vaguer units linguists 
argue about, such as the syllable, word, and sentence, may turn out actually to 
have psychological relevance and thus lead to sharpening of linguistic analysis 
in addition to mere clarification of ancient conundrums? In what follows an at- 
tempt is made to abandon a priori methods and avoid circularity; instead, pro- 
posals for empirical testing of the adequacy of various possible psycholinguistie 
units are made. Á 


3.1. Psychological Bases of Unit Formation'® 
Psychologists differ widely among themselves in their conceptions of units. 
That psychology as a science has prospered without resolution of this problem, 
whereas linguistics gave priority to the definition of units, probably reflects à 


EI 
16 Susan M. Ervin, Donald E. Walker, and Charles E. Osgood. 
50 


, 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 51 


basic difference in purpose—psychologists are more concerned with interpreta- 
tion and prediction whereas linguists are more concerned with description. Fur- 
thermore, psychologists do not find their material already formed into discretely, 
coded events as is language; sensory and behavioral events, at least on the level 
at which psychologists work, seem to be continuous rather than discrete. So it 
has been possible for psychologists to vary in their definitions of units from the 
minutely molecular (e.g., the stimulus elements and muscle fiber contractions of 
Guthrie and Hull) to the grossly molar (e.g., the purposeful acts of Tolman and 
Lewin). 


On the input side of the equation, gestalt psychologists have been most active in con- 
cern about units; gestalt psychology developed out of perception studies and derives its 
principles from this area. The notion of patterning or ‘structure’ of stimuli is treated as 
given, based on the postulated dynamic properties of the field distribution of physical 
stimuli on receptors and the isomorphic relation of this physical field to psychological 
processes. Units are segregated as self-integrated aspects of the gnvironment which stand 
out as figures on a more or less homogeneous ground. Figures are characterized by shaped 
boundedness (contour), dynamic properties (e.g., obedience to gestalt laws), and constancy. 
Many of the accepted empirical laws of perceptual organization have resulted from ob- 
servations under the gestalt impetus. On the output side of the equation, gestaltists have 
had little to say—appropriate behavior is more or less taken for granted, given adequate 
structuring of the perceptual field. 

Behaviorists, on the other hand, have been particularly concerned with the output 
side (responses) in relation to comparatively unanalysed input (stimuli). They have dealt 
with the learning of responses, their differentiation and discrimination, and their amalga- 
mation into skills. Only recently have behavioristically trained psychologists begun to 
give attention to perceptual organization. Hebb” in particular has offered stimulating 
ideas on the organization of input events, as will be discussed below, and Osgood has been 
attempting to relate the significance aspects of perception to semantic processes via & 
general mediation theory. 


3.1.1. Unit Formation in Perceiving (Decoding) 
much of the work on percep- 


It is unfortunate for our present purposes that so 
tion has been concerned with vision; it would probably be correct to say that over 
90 per cent of the research here has dealt with one aspect or another of vision. 
Most of the work done on audition has dealt with sensory rather than perceptual 
processes. This being the case, we shall have to assume a general analogy between 
visual and auditory modalities. 
3.1.1.1. Phenomena of perceptual organization. It is apparent that for the most 
Part responses are not made to unorganized masses of stimuli or to stimuli in 
isolation, but rather to patterns or groups of stimuli. This patterning of stimulus 


input is not dictated by physical properties of the stimuli themselves but is im- 


Posed upon physical events by both innate and learned properties of the organ- 
i iii s oses an order upon the environ- 


ism. The general ways in which the organism imp Mus 
ment can be determined by observing the characteristic phenomena of Ex 

(1) Grouping. Looking about us, we see objects (books, pictures, han dct bl 
and so on), not conglomerations of color points, i.e., sensory input is Org 


a 2 
" D. O. Hebb, The organization of behavior (1949). 


52 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


into wholes by perceptual processes. Similarly, when we listen to speech, we hear 
significant signals, words, not conglomerations of sound. This distinction is par- 
tieularly clear when listening to one's own language as compared to an unknown 
language—the former ‘breaks’ readily into pieces while the latter does not, these 
pieces typically corresponding to meaningful units. What factors in stimulus 
events facilitate grouping? (a) Nearness (in time or space). Those constellations 
of star-points in the heavens which receive labels are nearly always near together 
in visual space; it is almost trite to say that nearness in time operates to determine 
units in messages—the longer the pause between two speech events, the less likely 
they are to belong to the same unit. Similarly, nearness in time is a determinant 
of visual unity, e.g., in producing the phi-phenomenon. (b) Similarity. It is actu- 
ally nearness in time or space of similar processes, not nearness per se, that de- 
termines grouping. The basis of the Ishahara color blindness test is perceiving a 
form of a given hue (for example, making up the number ‘9’) amongst a conglom- 
eration of multi-hued dots. Similarly, it is presumably the continuity in time of 
auditory components of a given quality that makes a phoneme stand out as a 
unit, despite the overlapping of phones (e.g., in hearing /hard/ the / a/ phoneme 
is a persisting similarity of quality throughout much of the sequence). (c) Conti- 
nuity. The more stimulus events dispersed in either space or time tend to follow 
regular or predictable sequences, the more likely are they to be perceived as a 
group. In an X there is directional continuity in seeing two crossed lines rather 
than an upright and an inverted V. Any violation of continuity increases the 
probability of disunity in perception. The continuity which characterizes diph- 
thongs presumably is the reason for perceiving them as single units. 

(2) Closure. Perceptual processes manifest holistic, all-or-nothing properties. 
A group of ares may be perceived as a complete circle under certain conditions; 
a pattern of lines in two dimensions may be perceived as a solid cube. Familiar 
sequences of spoken speech can be mutilated to considerable degrees in transmis- 
sion without markedly affected intelligibility. In all of these cases the organism's 
nervous system, either on the basis of innate tendencies toward completion (ges- 
talt) or on the basis of past experience (behaviorism), acts to ‘fill in’ the input 
events which are missing at the peripheral level, provided enough of a pattern 
is given. In general, stimuli which frequently occur together or in close sequence 
tend to be perceived as wholes. The same thing can be illustrated in language: 
the sequence thelitilegirlrodeonahorse is presumably easier to decode than 4 
sequence of less familiar units, e.g., thepetitebipedelopedonamare. 

The above are all conditions for unambiguous figure and ground. It is also pos 
sible to set up conditions under which several alternative groupings are nearly 
equiprobable, e.g., ambiguous figures. The famous Rubin figure, which can be 
seen either as a vase or as two faces, is a visual example. In the auditory field, the 
same progression of notes can be made to seem either like two intersecting melo 
dies (e.g., an X) or like two separate melodies, upper and lower (e.g., our UPB" 
and inverted V’s), by manipulating the timbre, pitch, or some other characteristi¢ 
of the instruments playing them—and with the same instruments playing throug 
the same mid-point, an ambiguous auditory experience is produced. Ambiguous 


— 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 53 


orthographic patterns could be produced by omission of spaces (e.g.,—asinaga- 
tean), ambiguous linguistic patterns by omission of pauses and junctures, and 
both could be used to study grouping tendencies in decoding. Furthermore, vari- 
ous cues can be magnified or reduced in clarity so as to vary the speed with which 
organization takes place. It is suggested that in most speech situations redun- 
dancies in organizational cues are necessary to account for the apparent discrete- 
ness of acoustic decoding. 

(3) Constancy and transposition. When a person reacts to an object as ‘the same’ 
despite variations in illumination, in angle of regard, in distance, and so forth, he 
is showing constancy—each stimulus pattern is different, but his perceptual 
response is constant. When a person learns to respond to that one of two objects 
which is the brighter (larger, nearer, heavier, and so on) and continues to respond 
correctly despite wide changes in the absolute stimulus values, he is displaying 
transposition. Both of these phenomena are the same at base. The subject must 
have cues available that the context has changed (e.g., {hat the illumination has 
been lowered, that a disk is being held at something other than right angles to his 
line of regard, etc.) in order to show constancy of perception; in transposition one 
object provides the context for the other. If such contextual cues are eliminated 
constancy is eliminated, and what is perceived corresponds to what is given 
peripherally (and the object changes in brightness, in shape, and in size in accord- 
ance with actual stimulus values). Again, this phenomenon has been studied 
almost exclusively in connection with vision, but its analogies in hearing—par- 
ticularly speech decoding—are apparent and probably of great significance for 
the problem of psycholinguistic units. 

Constancy in decoding is evident in the fact that phonemes have a constant 
‘significance’ in the code regardless of the phonetic environments in which they 
appear (allophones). Transposition is operating whenever intonation and stress is 
correctly interpreted by the hearer in relation to the context or mean value of the 
utterance in which it occurs—the rising intonation of a question is not differently 
interpreted because a deep-pitched rather than a high-pitched voice is producing 
it. Perceptual constancies are basic to the operation of language as a code; the 
classes of unique events which have a constant significance in perception are what 
the descriptive linguist analyses as the phonemic structure of a language. 


g constancy requires a standard object, 
shadow or at some obvious angle of re- 
conditions and capable of being 
and so on. The subject first ad- 


_ The usual experimental situation for meaaurin 
Viewed under some contextual condition such as à 
Eard, and a comparison object, viewed under mE 
varied through degrees of brightness, angles of regard, s S 
justs the ATA object icc open field conditions until it looks just like the nd 
ard; he then repeats this adjustment, but using a reduction screen which eliminates s 
context. If the match made under open field conditions is identical with that made i R 
the reduction screen, he is said to show 0 per cent constancy (i.e., the context had no TOR 
on, was not taken into account in, his perceptual judgment); if the comparison P AE 
"looks the same’ as the standard without any such adjustment, (e.g., without bang is a 
ened to account for the shadow), perfect constancy is shown. Some general facts about 


Constancy are the following: - 
(a) The use of a reduction screen eliminates the constancy effect. The analogous expecta: 


54 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


tion for speech decoding would be that allophones and allomorphs should tend to sound 
more and more different as environmental context is reduced. The terminal past tense 
allomorphs should seem more like /t/ and /d/ when restricted (e.g., by tape cutting) to 
these sounds than when heard as parts of meaningful words, such as faked /feykt/ vs. 
played /pleyd/, where both endings should sound like /d/. 

(b) What is perceived in experimental conditions is usually a compromise between perfect 
constancy and absolute stimulus equation. Since it would be difficult to get subjects to re- 
port how similar speech sounds appear, the percentage of listeners giving ‘same’ as a report 
could be used to indicate the extent of compromise. (Cf. Section 4.1.1.2.) 

(c) The greater the contextual difference between standard and comparison objects, the 
greater the constancy effect. In speech decoding this would mean that the greater the dif- 
ference in phonetic environment, the more similar allophones and allomorphs should sound. 
The /t/ and /d/ allomorphs should sound more similar in the comparison napped /napt/ 
vs. waved /weyvd/ than in the very close environments of napped /napt/ vs. nabbed /nabd/. 

(d) Only object-tied stimulus characteristics (e.g., surface colors) display constancy. The 
more meaningful the segments in which speech sounds occur, the greater should be the 
constancy effect. The past tense allomorphs should sound more alike in the meaningful 
comparison ached /eykt/ ys. aimed /eymd/ than in the meaningless comparison /ikt/ vs. 
/imd/. - 

(e) The more cues available that standard and comparison objects are ‘the same’ (albeit 
under different contexts), the greater the constancy. This again implies that constancy 
effects are strongest under natural conditions. Presumably, constancy among allophones 
and allomorphs should be enhanced by orthographic identity and diminished by ortho- 
graphic distinction. 

(£) The direction of the constancy effect is typically a ‘regression toward the real object’ 
(Thouless). In other words, what is perceived tends to be more like the object as known 
under ‘normal’ conditions of inspection, e.g., ordinary daylight illumination, normal angle 
of regard, inspection distance, etc. Isn’t it the case that /t/ sounds like /d/ in the past 
tense signal position, rather than the reverse; and that /z/ sounds like /s/ in the nomi- 
native plural signal position, rather than the reverse? Is the ‘real’ sound psychologically 
/d/ or /s/ here because of frequency of usage? Isit the one that corresponds to orthography? 


Most of the evidence on visual constancies suggests that these are learned 
phenomena; certainly, perceptual constancies in language are learned. It seems 
likely that the users of a given language learn to discriminate those differences in 
the sound material that make a difference in the code and to not discriminate (pay 
no attention to) differences that do not make a difference in the code, the latter 
type of learning contributing to constancy effects. In a later section of this report 
(section 4), an experiment is described which gets at this prediction. 

3.1.1.2. General principles of perceptual organization. The empirical phenomena 
of perceptual organization described above seem to reflect the operation of 8 
limited number of underlying principles of organization. Drawing on & great di 
of evidence which cannot be included here, three general levels of organization of 
sensory input may be postulated. : 

I. Projection level: summation of points of maximal stimulation and suppression 
of other activity. Marshall and Talbot and others have provided evidence for such 
processes in the visual projection system and indicated how they contribute to 
the formation of sharp contours on the visual cortex. Similar processes seem 10 
operate in audition (e.g., masking in relation to pure tone resolution). These 


15 In Biological Symposia, Klüver, ed. (Rondld Press, 1942). 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 55 


mechanisms contribute to a general ‘sharpening’ of sensory signals; however, 
although constituting a significant aspect of total reception, they seem to be in- 
nately determined and ‘sensory’ rather than ‘perceptual’ in character. ) 
TI. Integrative level: central correlates of redundant and frequently occurring sen- 
sory events become integrated at this level. Hebb has described a general principle 
of neural organization which fits this situation: if two or more neurones in fibrous 
contact, either directly or mediately, are simultaneously active, the synaptic 
junctures associating them are strengthened, so that the occurrence of one be- 
comes a condition for either evoking (high frequency of repetition) or at least, 
‘tuning up’ (lower frequency of repetition) the other. Since density of fibrous 
contact is probably both a function of nearness in neural space and of similarity 
of fiber type (due to anatomical organization), we can see a basis for two of the 
major determinants of perceptual grouping, nearness in space and similarity in 
physical quality. Reverberation in neural circuits provides for integration of 
neural events over short time intervals, giving a basis fop another determinant of 
grouping, nearness in time. The general import of this principle is that sensory 
events will tend to be perceived in groups dependent upon redundancy and fre- 
quency in past experience. Thus things seen or heard together or in close temporal 
sequence in past experience will come to function as wholes in subsequent expe- 
rience, The phenomena of closure and continuity become nothing more than dem- 
onstrations of this principle—parts of redundant and frequently experienced 
wholes serve to activate central representations of the whole. Figure experiences, 
whether visual or auditory, and their resistance to breaking up, are also phenome- 
nal effects of the operation of this general integrative principle. Elsewhere in this 
report, this type of principle is applied to an analysis of grammatical mechanisms 
in language decoding and encoding (section 6.1). ; 
TII. Representational level: surrogates of total behaviors to objects become associ- 
ated with signs of these objects, serving both as the significance of these signs and as 
mediators of instrumental behaviors appropriate to the objects represented. Thedevel- 
opment of representational mediators is discussed in some detail elsewhere in this 
report (section 6.1). Suffice it here to say that distinctive portions of the total 
behavior elicited by proximal object stimulation (e.g., taste, texture, eating, etc., 
of APPLE) come to be called forth in anticipatory fashion by the distal cues from 
the object (e.g., visual color, shape, etc. of APPLE). According to theory, it is 
by virtue of the association of visual and auditory patterns with these distinctive 
Mediating processes that they serve as signs of the objects as palpably experi- 
enced (e.g., this particular visual pattern of rounded-redness is a perceptual sign 
of APPLE because it now elicits a minimal but distinctive part of the same 
behayior originally elicited by direct contact with APPLE). Since the various dis: 
tal appearances of APPLE (under different illuminations, at different distances, 
and hence different visual angles, and so forth) are all associated with the oix 
proximal stimulations and terminal behaviors, they come to constitute a class o 
Signs having the same significance. Organisms learn to disregard the a 
cant contextual differences. This association of a class of varied distal stimu 
tions with a common significance is the essence of the constancy phenomenon. 
* 


56 -SYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


Presumably the same type of analysis would apply to linguistie constancies, 
at least at the ^word' level. The word apple is heard with a variety of intonations, 
in a variety of constructions, and in a variety of voice timbres, but it is associated 
with a common perceptual sign and/or proximal experience (e.g., is accompanied 
by seeing and/or manipulating the same object APPLE). The question of phone- 
mic and morphemie (grammatical) constancies presents more difficulty, since 
they do not have ‘significance’ in any representational sense. However, the same 
underlying principle of learning constancies probably applies—language users 
learn to pay attention to the constant features which are significant (in the code) 
and to disregard the variable features which are not significant. Actually, the 
same distinction between constant (significant) and variable (non-significant) 
features arises in connection with perceptual constancies—enough of the features 
of APPLE must be present, such as shape and color, to elicit the common mediat- 
ing process or significance, which then provides for constancy in perception de- 
spite the variable, contextual features, such as size and illumination. The parallel 
between linguistic analysis of language constancies and psychological analysis of 
perceptual constancies is an intriguing one and deserves attention. 

3.1.1.3. Some research proposals. (1) Phonemic and morphemic constancies. Fol- 
lowing the close analogy between visual and linguistic constancies, one would 
want to study the perceived similarities of allophones and allomorphs under vary- 
ing degrees of linguistic context—complete meaningful utterances, single mean- 
ingful words, conditioning phonetic environments, and isolated speech sounds. 
Rather than asking the subject to make a judgment of degree of similarity, one 
should either require judgments of ‘same’ or ‘different’ with percentages of sub- 
jects indicating the degree of constancy, or use a forced choice technique, €.£; 
given [t^al], choose either [st^al] or [stal] as the more similar. Another experimen- 
tal possibility here comes from the known characteristics of orthography : having 
taught speakers of an unwritten language the alphabetic notion along with a pat- 
tial alphabet, their own perceived constancies should appear in use of the same 
symbols for what are allophones in their language. 5 

(2) Study of perceptual grouping in language decoding. At an earlier point in this 
section it was suggested that ambiguous spoken or written materials (the former 
produced by deleting between-word junctures by tape cutting and the latter by 
omission of between-word spaces) could be used to study spontaneous grouping 
tendencies. If subjects were given a sample of such material and instructed to 
segmentit, the relative strengths of alternative grouping tendencies should appear 
in the frequencies of common cutting points. The use of anagrams (and equivalent 
‘anyocs’—jumbled vocalizations) offers another approach, applicable to smaller 
units than words. The stronger the transitional probabilities (e.g., Sensory inte- 
grations) binding parts of the given anagram together, which must be separe 
for solution, the more difficult and time-consuming should be the solution. The 
stronger the transitional probabilities of the correct letter sequences, to be af 
covered, the less difficult and time-consuming should be the solution." Such 

19 Research along these lines is now being cónducted by Charles Solley at the University 
of Illinois. 


PSYCHOLINGUISTICS: A SURVEY OF THEORY: AND RESEARCH PROBLEMS 57 


analysis requires computation of transitional probabilities for samples of both 
English orthography and phonemes. 

(3) Study of ‘communication units.’ It might be appropriate to begin with an 
approximation to the normal linguistic situation—face-to-face conversations be- 
tween two speakers of a language. The grossest unit of language perception would 
seem to be the shortest consecutive sequence of speech produced by one individual 
to which another can make a discriminative response, e.g., the minimal sequence 
that makes a difference in behavior. The effects of increased context upon shorten- 
ing of this minimal sequence could also be investigated. Utterance completion 
could be used as a tool here, for example. 


3.1.2. Unit Formation in Behaving (Encoding) 


The ‘flow of speech’ is a rather apt simile. In the midst of ordinary conversation 
the adult speaker is operating rapidly, smoothly, and largely unconsciously upon 
the outward-moving columns of air by alternately contracting and relaxing a set 
of muscles into varying postures which modulate the rates and amplitudes at 
which this air vibrates. These muscles are always in flux, always approaching 
some posture and leaving another, never in static pose. This flow of behavior is 
analysable into over-learned, well-integrated vocalic skill sequences (probably 
individual words and trite phrases) which are encoded as units and run them- 
selves once initiated. These skill sequences are themselves further analysable into 
vocalic skill components, which we tentatively identify with syllables rather than 
phonemes—if a speaker is asked to slow down his output to a very low rate, he 
typically inserts longer pauses between syllabic units without changing to any 
great extent the intervals between the phonemes constituting syllabic units. This 
is, of course, an hypothesis in need of test. s 

3.1.2.1. Vocalic skill components. The basis for formation of motor output units 
is probably the same as that involved in the formation of sensory input units— 
central neural integration based upon peripheral motor redundancy and fre- 
quency. As a matter of fact, the evidence for central integration or programming 
of motor skills is clearer than in the case of sensory events. 


A three stage process of skill formation can be envisaged: a) The pies Bon a 
repetition of a regular sequence of motor responses on the basis of direct, Em | en- 
coding, imitation of adult models, or some other basis. (2) Since under these m tions 
each movement produces proprioceptive. self-stimulation (feedback) which can e 
conditioned to the succeeding movement, a chain of simple stimulus-response aSa Hii 
is set up, and the developing skill ‘runs itself’ at a much more rapid rate. : sated "m 
Lashley pointed out many years ago, there is just simply not time iden s 7 ED i 
executed skill (e.g., playing & cadenza or speaking) for impulses to trave ey Ode a 
fashion from periphery to center and back again between each aeons Rock 
Sequence of movements is being executed repeatedly on & proprioceptive fee i tio Ad 
the time intervals between successive reactions are short enough to permit huic an 
of ceniral integrations (presumably in the motor cortex) among the neural uds | Soon 
the necessary antecedent of these movements. Again, following Hebb's uel n there 
when cells having nervous interconnections are caused to be simultaneously = piel a 
results an increase in the probability thatesubsequent activation of any one um alas 
lead to activation of the next in sequence and so on. In other words, a short-ci 


58 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


within the motor system is accomplished and a greater speed and stability of execution 
becomes possible. 


The phoneme has been defined as a bundle of distinctive features, these fea- 
tures including such characteristics as tongue-tip position, rounding or flattening 
of the lips, vibration or non-vibration of the vocal cords, and so forth. This defini- 
tion spells out the fact that the phoneme is a spatial pattern of motor activity, but 
it is also a temporal pattern of activity. As a bit of skilled behavior it includes the 
temporal effects of approaching toward its typical posture from a diversity of 
other postures (antecedent environments) and receding from this to a diversity 
of other postures (subsequent environments). Since central motor programming in 
the nervous system is much more rapid than peripheral execution, there is always 
a tendency to anticipate features of subsequent phonemes and persist in features 
of antecedent phonemes. These skill modifications are at once the basis of allo- 
phones and evidence of the formation of encoding units. 

The tightness with whith the elements of a skill component (or a skill sequence, 
ef. below) are welded is a function of both redundancy and frequency. Due to the 
relatively high order of redundancy within phonemic units, the spatial pattern 
of events here should be highly evocative, e.g., occur as synchronous bursts as 
wholes; due to the lower order of redundancy between phonemic units (e.g., /b/ 
can be followed by /i/, /e/, /a/, /o/, and other vowels as well as by the conso- 
nants /l/ and /r/), one would expect the temporal sequences of phonemic events 
within syllables to be merely predictive of one another, and thus less tightly 
welded. This expectation, however, does not take into account the possibility of 
forming higher order units on the basis of extremely high frequency, e.g., syllables 
which becomea ‘pool’ of alternate wholesin encoding. Casual observation suggests 
the syllable as the minimal unit in encoding—not only is there the fact that 
slowed down speech is accomplished by syllabic spacing, as noted earlier, but 
babbling behavior in infants is typically syllabic in nature. The work of Stetson 
on the relation of the chest-pulse to syllable formation also seems to support this 
view. It should be noted that this does not imply that the syllable is also the 
minimal unit in decoding. x. 

3.1.2.2. Vocalic skill sequences. The model in which proprioceptive and audi- 
tory feedback is a controlling factor in skill execution is probably preserved in the 
more loosely welded vocalic skill sequences, the sequences of syllables that con- 
stitute words and trite phrases. The rapidly executed pattern of responses within 
each syllabic unit produces distinctive sensory feedback; to the extent that cer- 
tain sequences of syllabic units are redundant and of frequent occurrence, v^ 
distinctive stimulus pattern will become predictive of certain subsequent sy’ llabic 
units. Thus familiar syllabic sequences should run themselves off more rapidly 11 
encoding than unfamiliar syllabic sequences, and frequency of errors in encoding 
should be predictable as substitutions of high frequency sequences for low fre- 
quency ones at points of high antecedent similarity. j 

Suggestive evidence for this analysis is provided by the research of Grant Fair- 
banks on the effects of delayed auditory feedback. He has been able to show that 
the interval of delay in feedback at which the greatest interference is produc 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 59 


in both spontaneous encoding and reading aloud (e.g., stuttering, reduplication 
of preceding sounds, omissions, and the like) is about 0.25 seconds. This corre- 
sponds closely to the average rate of syllable production, about four per second. 
Attempts to disclose finer ‘ripples’ of interference corresponding to the average 
rate of phonemic production have been unsuccessful. 

The general import of this analysis is that functional units of encoding are 
flexible with respect to standard linguistic units; they depend for their formation 
upon redundancy and frequency factors in the main and may span sequences of 
varying length. Units may be as small as the syllable and as large as a phrase (e.g., 
"Howd'ya-do?", “B’lieve-’t-’r-not”). The behavioral correlates of tightness of 
unit formation should be latencies between elements in production and the exist- 
ence of skill modifications, such as truncation, amalgamation, and anticipatory 
and perseverative alteration. 

3.1.2.3. Some research proposals. A number of research proposals related to 
units in encoding are included in section 5 on transitional psycholinguistics. Cer- 
tain general possibilities may be suggested here. 

(1) Detailed latency measurement. Modern instruments make it feasible to ana- 
lyse juncture and pausal phenomena in close detail. The general prediction is that 
the distribution of within-syllable intervals should be of minimal duration, if 
evident at all, and significantly shorter than between-syllable intervals. Between- 
syllable intervals in turn should vary with redundancy and frequency factors, 
being shortest between syllables within common words and trite phrases and 
longer between syllables in rarer words and less predictable phrases. Intervals 
between morphemic boundaries, and the effects of stress and intonation upon type 
of juncture can also be investigated in this manner. Me! 

(2) Delayed auditory feedback. Given measurements of transitional probabilities 
in English, partieularly as between syllables, the delayed feedback technique 
could be employed to check the prediction that weaker links in the encoding 
canin, e.g., points of low transitional probability, are more susceptible to inter- 

erence. 

(3) Slowed speech. A similarly detailed analysis should be made of intentionally 
slowed speech on the part of native speakers. The expectation offered here is that 
increases in latency will be chiefly apparent between syllables rather than pon 
—accomplished, perhaps, by elongation of the terminal voiced phoneme of eac 
syllable. j $ 

(4) Interruption technique. If a spontaneously encoding speaker oria ee 
interrupted at unpredictable intervals (by some ingenious ieee zot SP E t 
at present), he would be expected to begin again at the nearest aan nd 
onset, e.g., “interruption te/ / —technique is a metho//—od of stud//—ying,” ete. 
The expectation is that these units would be syllabic or larger. adis 

(5) Backward-working skill modifications. Probably one of the best me | 
encoding units is the existence of backward-working (eg., anticipatory) ; 
modifications. When the speaker modifies his articulation of the /E/ in on) 
compared with the /k/ in key to anticipate the following vowel, it is Ade 
vertable evidence that this much, at least, is being encoded as a unit. In o 


60 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS i 


words, the encoder must have already selected the vowel aspect of the syllable at 
the time of executing the initial phoneme. The same sort of logic applies to en- 
coding units operating over larger segments of the message. When a Spanish- 
speaking encoder, for example, produces las bonitas casas, the grammatical 
marker for the feminine gender, -as, which appears in the article depends upon 
the noun form, casas—again, it is certain that at least this much must have been 
selected at some level of organization as a single unit. A detailed analysis of such 
backward-working adaptations should be a very profitable enterprise. It would 
probably provide evidence for a hierarchical structure of units-within-units in 
encoding (cf., section 3.4). 
3.2. Relations between Psychological and Linguistic Units? 

3.2.1. The Problem 

By application of the logically rigorous methods described earlier (section 2.1), 
the linguist has been able to determine minimal units on each of the levels into 
which language is usually divided. The unit on the phonological level is the 
phoneme; the unit on the morphological level is the morpheme; and most linguists 
would probably admit the validity of the function class as a meaningful and useful 
unit at the syntactical level. These units can be rigorously defined in terms of 
linguistic method and have proven useful for descriptive purposes. 

However, the speaker of a language is also aware of certain units in its strut- 
ture, At least he uses certain terms consistently in talking about his language 
which indicate perception of units roughly at each of these levels. Sapir has 
pointed out that speakers of Indian languages which have no orthography at all 
have no difficulty in dictating a text to a field worker ‘word by word.’ The same 
speaker, probably, could dictate his text ‘one sentence at a time’ if asked to. This 
implies an implicit set of criteria for defining words and sentences. For languages 
which have a written form, these criteria are usually reflected in the orthography. 
A ‘word’ is a unit which, when written, appears between spaces. A ‘sentence’ is 8 
unit which, when written, starts with a capital and ends with a period. But, obvi- 
ously, the orthography is merely a representation of what, at one time at least, 
were felt to be criteria that operated in speech—that is, the criteria which govern 
our Indian informant, who has no prejudices because of orthography. With his 
concepts of ‘word’ and ‘sentence’ the speaker indicates his awareness of units at 
the levels of morphology and syntax. Regarding phonology, there would be less 
agreement in identifying the number of ‘sounds’ in a given utterance, but speak- 
ers would probably agree on ‘syllable’ counts, if not on syllable boundaries. 
three psychological units which emerge from a native speaker’s analysis, then, 
are the syllable, the word, and the sentence. 

If we use the dichotomy of ‘linguist units’ and ‘psychological units’ to apply 
respectively to the units determined by the linguist and the native speaker, OUF 
immediate problem becomes one of relating them. In other words, we are con- 


20 Sol Saporta. 
° 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 61 


cerned with the psychological validity or reality of existing linguistic units and 
with the linguistic feasibility or productivity of ‘natural’ psychological units. 
There is the further problem that what we might call ‘psycholinguistic units’ need 
not correspond precisely to either those arrived at by deliberate linguistic analysis 
or those arrived at by casual lay analysis, Psycholinguistic units would be those 
segments of the message shown to be functionally operative as wholes in the proc- 
esses of decoding and encoding, and these too are capable of analysis into levels. 
For example, the units operating in correlation with events at the semantic or 
representational level are probably different than those operating at the gram- 
matical or integrative level, and both of these in turn are probably both different 
and larger than the units correlated with skill components in encoding. 

3.2.2. Linguistic Feasibility of Psychological Units 

The linguist, aware that syllable, word, and sentence are functional concepts 
to the native speaker of a language, has felt obliged to défine them rigorously, but 
he has met with little success, 

(a) He has been reasonably successful in incorporating the concept of the ayl- 
lable into his descriptions, but only for some languages. In some dialects of 
Spanish, for example, the quality of a vowel (open vs. closed) is determined by 
its position in the syllable (non-final vs. final), We have, then, objective criteria 
for determining syllable boundaries. Most attempts to define the syllable have 
been made in terms of the presence or absence of a vowel, or some similar orite- 
rion. However, too often linguists have ended with the kind of circularity by 
which a syllable is defined as that unit which contains one and only one vowel (or 
diphthong) and the vowel is defined as that unit which may function as a syllable, 
Other definitions have been attempted in terms of chest pulses, eto., but there 
apparently is no definition which is entirely satisfactory. It has also been mug- 
gested that even if definable, the syllable as a concept may be irrelevant in a 


cessful, linguistic solution of this problem by Greenberg. 

(c) The sentence likewise has not been clearly defined in linguistics. The most 
meaningful definitions have been in terms of intonation features and juncture 
phenomena. A sentence end is usually marked by one of several ‘final junctures’ 
accompanied by a certain intonation pattern. After listening to impromp se 
Versations in several languages, one suspects that even these criteria apply 
in ‘cleaned-up’ texts, and may not really apply in the everyday 


Situation, ° 


62 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


"The result is that the linguists for the most part have been unable to operate 
profitably with these units of language which speakers intuitively understand 
and use. 


3.2.3. Psychological Reality of Linguistic Units 

We come now to the reverse problem of determining whether the units which 
the linguist can isolate are psychologically valid. 

(a) The phoneme is probably the one unit which can be demonstrated to exist 
both linguistically and psychologically. (A specific experimental technique is sug- 
gested below.) Under normal circumstances, in the decoding process, people do 

_ not distinguish differences between allophones. They are, of course, noticed when 
incorrectly used by a foreigner speaking with an accent. Likewise, speakers in 
encoding are not conscious of selecting among allophone classes—this is auto- 
matic. Consequently the allophone is too small to be a unit in the encoding or 
decoding process, implying that the phoneme is. But here, apparently, is a con- | 
tradiction. If the selection of an allophone is determined, say, by the following 
phoneme, it implies that at least for the encoder, a group of two phonemes is a 
unit—that one selects at least two phonemes (possibly a syllable) at a time. On 
the other hand, however, two words or two messages may differ by only one 
phoneme, which means that the one phoneme has been selected independent of 
the environment and likewise that the decoder must distinguish between pho- 
nemes. 

If we consider the initial sound in ‘key,’ we must conclude that it plays a dual 
role. The particular allophone [k^-] is a part of a larger unit in the flow of speech, 
However, the phoneme to which this sound is assigned, /k/, is itself a unit, and 
as a unit it serves as a basis for distinguishing this lexical item from, say, ‘tea.’ 
In trying to relate units to points of decision, then, we conclude that whereas the 
abstraction, i.e., the phoneme, corresponds to a unit of decision, the particular 
manifestation or actualization of the phoneme, i.e., the allophone, is only part 
of a unit. Our problem then is to determine what is this larger unit. There are two 
possibilities. ‘Key’ may have been chosen either as a phonological unit (a sylla- 
ble), or as a morphological unit, (a morpheme). Obviously the two levels need not 
exclude one another. A discussion of the hierarchies of levels appears further 0n 
in this section (3.4). 

(b) There is evidence for the justification of larger units as well. Just as allo- 
phones are encoded and decoded automatically, allomorphs may be selected auto- 
matically, indicating the same process on the morphological level. For example, 
an English speaker, and particularly a listener, is not aware of the phonemic dit- 
ference between the singular ‘house’ /haws/ and the corresponding allomorph 
the plural ‘houses’ /hawz-/. In other words, it is as though the phonemic differ- 
ence between /s/ and /z/ were neutralized, indicating that a unit larger than the 
phoneme is being decoded. r 

Again the abstraction, i.e., the morpheme house, is a unit because of the obvious 
decision not to say, for example, churches However, the particular allom: 


j 


| 


MEL. ci 


PSYCHOLINGUISTIC8: A SURVEY OF THEORY AND RESEARCH PROBLEMS 63 


a part of a larger unit in the flow of speech. This may be a morphological unit, 
the word, or some syntactic unit, perhaps the noun phrase. ' 

(c) On the syntactic level, the category of agreement indicates selection of word 
groups. A Spanish speaker who begins a phrase with the feminine article ‘Ia’ indi- 
cates by this choice that he has already selected a feminine noun, so that, on the 
syntactical level, perhaps the whole noun phrase is a unit in encoding. There is 
an interesting difference here also between encoder and decoder: for the encoder, 


'the subsequent unit (in this case the noun) determines the antecedent (the 


article); for the decoder, the antecedent limits the probabilities of the subsequent. 
It is as though in one case the article ‘agreed’ with the noun (‘agreed’ here is 
equivalent to ‘is determined by’) while in the other case, from the point of view 
of the listener, the noun 'agrees' with the article. The question of agreement has 
not been clearly, treated in linguistics, and this is possibly because it is difficult 
to find one explanation which will cover what seem to be two different processes. 


e 

In this connection, it is tempting to hypothesize a relation across languages between 
the expression of agreement, by adjectives and the position of the adjective in relation to 
the noun. For example, one might suspect that if the selection of the noun determines the 
form of the adjective, then the adjective is more likely to follow the noun. This is generally 
true in the Romance languages, where many adjectives must follow, and others may follow 
or precede. On the other hand, if there is only one form of the adjective, as in English, it 
may very easily precede since the selection of the noun cannot affect the form of the ad- 
jective. German, of course, would be an obvious exception. Before such a could 
be seriously considered, a large number of languages would have to be investigated, If 
such a relation appeared, it would imply that tho units of encoding differ from language 
to language, that the larger the role played by agreement, the larger the unit of encoding. 


Our preliminary survey suggests that: (1) for the most part, linguists have 
been unable to operate profitably with ‘natural’ folk units, and (2) there may be 
some basis for concluding that the linguistic units are psychologically valid. The 
latter must, however, be tested by suitable experimental situations, 


3.24. Research proposals uda 

Research should be directed at setting up situations designed to 
pendent results which can then be compared with the two seta of units described 
above. It may develop, for example, that on the phonological level, both tho syl- 
lable and the phoneme are valid. 

3.2.4.1. Child language. One field for such investigations might be in child lan- 
guage. The order in which distinctions and contrasts are made should be carefully 
analyzed. For example, if it turned out that a child learned a serios of monagi 
labic items, no two of which formed a minimal contrast (08. if, d 
learned the word ‘pa’, he did not then learn ‘ma’, but first learned 'me'), ono 
might conclude that learning was on a syllable basis rather than on à phonemic 
basis. On the morphological level, the writer has heard of cases perc 
have confused the items ‘yesterday’ and ‘to-morrow,’ indicating that " 
minimal unit of meaning and has beep learned as such. This error is in terms 


64 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


‘words,’ not morphemes, and apparently would conflict with the analysis of those 
linguists who would insist on dividing ‘yesterday’ into two meaningful units (mor- 
phemes) on the basis of contrasts with such items as ‘yesteryear’ and ‘Monday,’ 
‘Tuesday,’ and perhaps even ‘to-day. This is not to be interpreted as indicating 
that the morpheme is not a unit in language learning. A child’s use of a form such 
as ‘runned’ or a formation such as ‘monk’ from ‘monkey,’ in analogy to a pair like 
‘dog—doggie,’ indicates an awareness of morphemes. Nevertheless, it seems rea- 
sonable to conclude that an analysis of the learning process would indicate both 
the morpheme and the word as valid units. 

3.2.4.2. Reversed speech. Another possible technique might be the use of re- 

versed speech in a controlled experimental situation. The purpose would be to 
‘ascertain those points where mistakes are made, the hypothesis being that from 
these points some indication may be had of the units into which the speaker 
divides speech. The next step would be to correlate these units with the psycho- 
logical and the linguistiœunits previously mentioned. We assume that any fea- 
tures that are encoded simultaneously are being treated as indivisible units, 
either in the perception of the item as presented or in the production of the item 
in reversal. 

It is apparent from even the most superficial observation that any native 
speaker of English, instructed to reverse the sounds of the word ‘net’ will respond 
with /ten/, and he will also be of the opinion that his answer is ‘correct.’ The 
linguist is of course aware that the speaker has modified all three sounds, perhaps, 
substituting one allophone for another. The fact that allophones are thus changed 
automatically indicates that, at this level at least, not allophones but phonemes 
are functioning as units. The examples selected for experimentation here would 
have to be carefully chosen to test the units being considered. For example, given 
the word ‘mate,’ most subjects can be expected to respond with /teym/, thus 
indicating that on this level dipthongs are a psycholinguistic unit. 


If English is used for these experiments, one problem that comes up immediately is the 
influence of orthography. Two possibilities suggest themselves: (1) The effects of spelling 
may theoretically at least be eliminated by using either illiterates or pre-school children 
as subjects. (2) The effect of spelling may be so considerable that it might be advisable 
to measure it directly in an attempt to ascertain whether it has any effect on the percep- 
tion of units by speakers. For example, one might ask subjects to reverse a series of words 
given orally, amongst which were included the pair ‘wrong,’ ‘right,’ and then some time 
later, the pair ‘read,’ ‘write.’ By comparing the two reversals for the sequence /rayt/ one 
might determine to what extent mistakes were a result of orthography. The same results 
might be obtained by asking them to reverse other pairs of homonyms presented in slightly 
different form. For example, one might expect different results from subjects told to re- 
verse the last words in the sentences ‘I don’t like to pay my income /taks/,' and ‘I always 


21 For a discussion of the suggestion that morphemic analyses, like phonemic analyses, 
can only be based on individual idiolects, see Nida, Word, 7. 1-14 (1951). 

22 [t seems reasonable to assume that spelling does affect the analysis of some speakers. 
Most speakers, for example, do not readily associate ‘cat’ and the element ‘kit’ of E 
whereas they do associate ‘goose’ with the element ‘gos’ of ‘gosling.’ Tt seems that the 
orthography here overbalances the phonetic relation. 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 65 


use nails, but rarely use /taks/.' For our purposes, we may assume that the influence of 
orthography has been eliminated by using either illiterates or pre-school children. 


The hypothesis, then, is that it may be possible to determine psycholinguistic 
units by analyzing those places where subjects make ‘mistakes,’ i.e., where the 
reversal does not coincide with the actual reversal of phonemes and would not 
‘sound right’ if played back in reverse on tape. It seems likely that these places 
may coincide with the various linguistic and psychological boundaries as defined 
above—namely, boundaries of phonemes, morphemes, syllables, words and so 
forth. Still assuming that orthography has no influence, subjects might. be asked 
to reverse the words ‘boys’ /boyz/ and ‘noise’ /noyz/. Linguistically, the /z/ is 
different in these two cases, in one case being a morph in itself; furthermore, the 
morpheme ‘plural’ has an allomorph with the phonemic shape /s/ as well as the 
one /z/. If we extend the process by which allophones were substituted in the 
example ‘net’—‘ten,’ we may reasonably assume that there will be competition 
between allomorphs in the reversal of ‘boys,’ whereas no such competition should 
exist in the reversal of ‘noise.’ We might expect ‘soyb’ (rather than ‘zoyb’) as a 
significantly more common response than ‘soyn’ because of the conflict of /s/ 
and /z/ in the former case. Another typical mistake might be ‘yobz,’ where the 
sequence of morphemes is maintained with reversal occurring within the mor- 
pheme. Another possible measure of the effect of this linguistic boundary might 
be the relative latency of similar responses. Those responding to ‘boys’ with 
/zoyb/ would be expected to have taken more time than those responding /20yn/ 
to ‘noise.’ We suggest, then, that at those places where there are clear linguistic 
boundaries, subjects will indicate that, to a certain extent, linguistic units cor- 
respond to (or influence the determination of) psycholinguistic units. — 

We have thus far limited ourselves to monosyllabic units. It would be interest- 
ing to see what would be the effect of increasing to, say, four syllables, but without 
changing the instructions. Would subjects automatically try to reverse syllables 
instead of phonemes? In other words, is the unit of perception in part determined 
by the length of the utterance? A further possibility is to instruct subjects to 
reverse the syllables in a series of words of two syllables, which differ in their 
linguistic units. One would expect, for example, significant differences between a 
pair such as ‘boyish’ and ‘parish, where the co-occurrence of syllable and 
morpheme boundaries in the first case should facilitate reversal. This procedure 
could be carried out on all levels of linguistic analysis. For example, subjects 
could be requested to reverse the words in the sentence “The boy went home; 
The most common error one would expect would be ‘home went the boy, where 
‘the’ and ‘boy’ are considered as forming a unit, in accordance with the usual lin- 
guistic analysis. Carefully chosen examples and accurate interpretation of Bod 
might reveal interesting correlations between the linguistic (formal), the psycho- 


logical (intuitive) and the psycholinguistic (functional) levels of unit analysis. 


? Notice that it is likely that diphthongs will not be reversed. — Dh 
% The relative frequency of the words might be a factor in facilitating goes which 
Case the words would have to be chosen in accordance with a reliable frequency list. 


66 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


3.3. The Word as a Linguistic Units 


The word as a unit occupies a paradoxical situation in present-day linguistic 
science. Such a unit, roughly coinciding with the usage of the term in every day 
. language and in the discourse of sciences other than linguistics, is actually em- 
ployed as a fundamental dividing line between the two levels of morphological 
(infra-word) constructions and syntactic (supra-word) constructions. Yet no 
generally accepted and satisfactory definition exists, and some linguists deny 
validity of the word altogether, relegating it to folk-linguistics. Others believe 
that the word must be defined separately for each language and that there are 
probabably languages to which the concept is inapplicable. Some define the 
word in phonological terms, as when a word in Czech is defined as a sequence with 
stress on its initial syllable. Other definitions depend on the distribution of mean- 
ingful units and may be qualified as morphological or grammatical. Here belongs 
Bloomfield’s well-known definition of the word as a minimal free form. This 
definition has the advantage, lacking in so many others, of being operational. 
Unfortunately it leads to results not at all like the traditional notion, although it 
was manifestly intended to correspond at least roughly to ordinary conceptions 
of the word. For example, ‘the’ in English would not be a word, but ‘the king of 
England’s’ in the sentence ‘the king of England’s realm includes land on several 
continents’ would. This is not in itself a fatal objection to its acceptance as 
defining some unit but it cannot be considered an adequate explication of the 
ordinary usage. Nida, for example, who adopts it, finds it necessary to supple- 
ment it with additional criteria, an indication of its unsatisfactory status.” 


3.3.1. Criteria for ‘Word’ Units 

Before proceeding with the definition to be proposed, we must ask what re- 
quirements must be fulfilled by a definition for it to be considered satisfactory. 
The popular conception of. the word as indicated by the use of space in orthog- 
raphies of various languages is not in itself sufficiently consistent to make 4 
definition possible which will justify every word division in every existing orthog- 
raphy. This would be an unfair, and one might add, an impossible, requirement. 
As generally in problems of scientific explication, we take the popular non-sclen- 
tific use as a point of departure, and one to which our results must, in general, 
conform. We require of our definition that it involve procedures that can actually 
be carried out (i.e., that it be operational), be free of logical contradiction, and 
give results in general agreement with the popular notion of what a word is. 

Among the requirements that must be satisfied for the word to correspond to 
the usual notions regarding it, are the following: it should consist of a continuous 
sequence of phonemes such that every utterance in a language may be divid 
into a finite number of words exhaustively (i.e., with nothing left over) and un- 
ambiguously (ever phoneme should belong to only one word). Overs: stated, 

25 Joseph H. Greenberg. ) 

26 See Eugene Nida, Morphology: the descriptive analysis of words! (Ann Arbor, 1949). 
For a convenient review of the history of the subject, not discussed here, see Knud Togebys 
Qu'est-ce qu'un mot? in Travaux du Cercle Linguistique de Copenhague 5. 97-111 (1949). 


s 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 67 


the division of an utterance into words should involve the assignment of each 
phoneme to one of a set of mutually exclusive classes which exhaust the universe 
of the particular utterance. It would also be expected that every word boundary 
should be a morph boundary, that is, the constituent phonemes of a morph, the 
minimal meaningful sequence, should never be divided between two or more 
words. On the other hand, it would be expected that many morph boundaries 
would not be word boundaries. 


3.3.2. Overview of this Analysis 

To aid in clarifying procedures, which must otherwise seem obscure at many 
points without some knowledge of the end-result and the major difficulties to be 
overcome, an informal account of the nature of the solution attempted here will 
first be given. The continuity, or non-interruptibility of the word, has been 
mentioned above as a desideratum of a successful definition. This might suggest 
immediately that a word be defined simply as a sequehce within which another 
sequence cannot be inserted. However, it will soon appear that while in general 
this is true, it does not constitute an adequate definition. For example, we can 
insert ‘r’ in ‘gate’ to get ‘grate,’ but we wish ‘gate’ to be a word in English. We 
can insert ‘house’ in ‘schools’ to get ‘schoolhouses’ but we would certainly want 
‘schools’ to be a word. The first example shows the necessity of eliminating inser- 
tions between non-meaningful elements (i.e., between ‘g’ and ‘ate’). The second 
example shows that even this is not enough, for here the insertion takes place 
between ‘school’ and ‘s,’ in other words, at a morph boundary. Much of the pro- 
cedure is motivated by the attempt to discover a unit which permits only certain 
specifiable insertions. The result is the determination of a unit here called the 
‘nucleus,’ intermediate between the morph and the word in length. For any 
utterance, m > n > w where ‘m’ is the number of morphs, ‘n’ the number of 
nuclei and ‘w’ the number of words. Having defined the nucleus, we test all 
nucleus boundaries to see if they are word boundaries. Unlimited possible inser- 
tion of nuclei at a nucleus boundary makes it a word boundary. Since our pro- 
cedure gives us word boundaries, and words are defined simply as the stretches be- 
tween boundaries, the requirement of continuity is necessarily fulfilled. —— 

Another feature of the procedure which perhaps requires some Pinney 
explanation is that it is entirely contextual in the sense that it provides a method 
for dividing a particular utterance into word units. We do not ask, as is ATP 
done, whether ‘hand’ is a word in English but whether, in the utterance the hand 
is quicker than the eye,’ the sequence ‘hand’ constitutes a word, This is because 
in many instances we want a sequence (e.g., Latin ‘trans’) sometimes to be a 
word, as the preposition meaning ‘across’ in ‘coelum non mentem mutant is 
trans mare vehunt? but sometimes to be part of a word, as when compounded wit 
a verb in ‘sic transit gloria mundi.’ 


3.3.3. Definition and Clarification of Terms 
titution class (MSC) in terms of 


The first unit to be considered is the'morph substif 
which it will be possible to define the key nucleus unit referred to above. A morph 


68 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


substitution class is a set of single morphs (minimal units with a meaning) which 
in a given context may substitute for each other. For example, in the sentence 
‘the singer broke the contract’ the morph ‘sing-’ in ‘singer’ belongs to an MSC 
which contains ‘sing-,’ ‘play-,’ ‘min-’ and other members, since ‘the player broke 
the contract’ and ‘the miner broke the contract’ are possible utterances; ‘reform-’ 
does not belong to the class since ‘re-form’ consists of two morphs. It might be 
thought that the use of the concept of the MSC in defining the word involves a 
vicious circularity in that the definition of morph implies the comparison of 
word units in order to isolate minimal components. In fact, however, the notion 
of the word is not necessary here, and Harris and others have specified procedures 
for defining the morph while ignoring the word as a unit.” 


The varying methods of defining morphs will almost all turn out to have no effect on 
our end result. The only exception is that the type of discontinuous morphemes described 
by Harris in his ‘Discontinuous Morphemes’’ is naturally excluded, since such discon- 
tinuous elements are known to belong to different ‘words’ before we begin. We do not 
allow discontinuous morphs except such as have constantly numbered sequences of pho- 
nemes in their gaps. For example, in classical Arabic we have a morph q—t—I ‘kill’ in 
‘qatala zaydan’ ‘He killed Zaid,’ but the number of dashes is restricted. Most disputed 
cases of morph division involve combinations such as ‘receive’ or ‘huckleberry,’ in which 
each of the elements belongs to such a small and unique MSC that nothing can be inserted 
anyway and either solution, as one morph or two morphs, leads to the same result. 

Another proviso must be made: sometimes a substitution can apparently be made but 
the two morphs are not members of the same class. 

One further limitation is necessary regarding what may be accepted as a morph. Some- 
times intonational and other features extending over phrases or sentences are considered 
as morphs. Only prosodic elements simultaneous with a single segmental element, for ex- 
ample, tone in Chinese or stress in English is accepted here. It is self-evident that a unit 
which extends over a whole sequence such as a sentence cannot be relevant to the problem 
of its internal subdivision into words. 


The next notion to be defined is that of a thematic sequence. In the example of 
‘sing-er’ above we saw that ‘re-form,’ although a sequence of two morphs and 
representing two MSO's, behaved in the construction ‘reform-er’ like a single 
MSC, that containing 'sing-, ‘play-,’ etc. A sequence of two or more MSC’s will 
be said to constitute a thematic sequence (1) if there is some single MSC for which 
it may always substitute and yield a grammatical utterance and (2) if none of the 
MSC’s of the sequence is equivalent to, that is, has exactly the same membership 
as this single MSC for which the sequence may substitute. The thematic se 
quence may be said to form a theme and to be an expansion of the single MSC for 
which it may substitute. 


Thematic expansion includes both what is usually called derivation and what is called 
compounding. Thus ‘duck-ling’ is a sequence of two morphs which is called a derivation® 
construction. It consists of the MSC containing ‘duck-,’ ‘gos-,’ etc. and the MSC con 
taining ‘-ling’ as its only member. It may substitute for the single MSC containing ‘hen, 
‘chicken,’ ‘goose,’ etc. among its members, and neither of its constituent MSC’s is equiva- 


7 Z. Harris, From morpheme to utterance, "Language 22. 161-83 (1946). 
15 Z. Harris, Discontinuous morphemes, Language 21. 121-7 (1945). 


> 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 69 


lent to this latter class since both contain members 'gos-, -ling' not found in the MSC of 
‘hen,? ‘chicken,’ etc. 


We are now ready to define nucleus. A nucleus is either (1) a single MSC which 
is not part of a thematic sequence or (2) a thematic sequence of MSC’s. Among 
single MSC’s are some which are expandible into thematic sequences but are not 
expanded in the particular construction analyzed, and some which are not. In the 
sentence ‘the farmer killed the ugly duckling’ there are nine morphs: (1) the (2) 
farm- (3) -er (4) kill- (5) -ed (6) the (7) ugly (8) duck- (9) -ling. There are seven 
nuclei: (1) the (a nonexpandible MSC) (2) farm-er (a thematic expansion con- 
taining two MS8C’s) (8) kill- (a single MSC expandible e.g. into ‘un-hook-’) (4) -ed 
(a nonexpandible MSC) (5) the (as above) (6) ugly (a single MSC expandible 
into ‘un-god-ly’) (7) duck-ling (a thematic expansion consisting of two MSC's). 

There remains finally the distinction between nucleus boundaries which are also 
word boundaries and those which are not. There are a number of ways of stating 
the distinction which give practically the same results. The one adopted here is 
as follows: a nucleus boundary is an infraword boundary if and only if a fixed 
number of nuclei may be inserted including those with zero members. Often 
nothing may be inserted. (Zero can be considered a limiting instance of a fixed 
and finite number.) It is a word boundary in the excluded instance, that is, when 
insertions are possible and they are not fixed in number, e.g., if both three and 
five are possible. Usually an indefinitely increasing number of insertions is 
possible, that is, there is ‘infinite’ insertion at word boundaries. In the above 
sentence no nucleus can be inserted between (3) kill- and (4) -ed and therefore it 
is not a word boundary. Between all the others, sequences of nuclei may be 
inserted of varying length and, in fact, without limit. Thus between (1) ‘the’ 
and (2) ‘farm-er’ we can insert ‘very, headstrong, cruel, unloveable, ete, ;’ between 
(2) ‘farm-er’ and (3) ‘kill’ ean be inserted ‘who lives in the house which is on the 
road that leads into the highway,’ ete. 

There is one kind of insertion which must be forbidden by a special rule since it can be 


carried out at any nucleus boundary whatever. This consists of one whose initial nucleus is 


the same as the nucleus after the boundary and whose final nucleus is the same as the sod 
before the boundary. In the above sentence, we might insert between (4) kill- and ©) jn 
‘ed and slaughter- producing ‘the farmer killed and slaughtered the ugly ut Pe 
‘-ed,’ the initial morph of the insertion, belongs to the same nucleus as (5) -ed and ‘s ites 
ter-' is a member of the same nucleus as (4) kill-. An indefinite number of such insertions 
of varying lengths is always possible. 
Phoreme modifications at word boundaries, 

uu to the analysis if they are regular. ner mo mo: 

erms of the occurrence of phonemes, that is, is phonologically regula i ji 
to restrict the insertion iy boundary to the subclass which begins with pe um 
ticular set of phonemes. But by a well-known theorem in set theory, an inde ent isle 
able set subtracted from an indefinite enumerable set still leaves an braun yes rm 
Set. For example, the exclusion of all odd numbers still leaves an bonds ues Sls 

here is one rare type of occurrence in which sandhi gives rise to a single phone! 
9f the final of one nucleus and the initial of the next. eae vit bdo (do /, the result 


In Sanskrit if a nucleus ends in basic /-n7 and the next i inat 
is a single phoneme /i/, a nasalized lateral. In this case the number of words is determinate, 


ften known as word sandhi, make no dif- 
y dification can be stated in 
gular, the result is merely 


70 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


but the ascription of /I/ to the former or latter is arbitrary. If we changed our phonemic 
analysis to make /-/ a supra-segmental phoneme we could divide /I/ into two phonemes 
and assign /-/ to the former word and /1/ to the latter. A similar argument applies to junc- 
tural phonemes. In the example from English used here the junctures have not been writ- 
ten. They would not affect the analysis. 


The present definition of nucleus resolves the contradiction between phono- 
logical and grammatical definitions of words. In the former, it is not the presence 
of stress or some other marker which demarcates the word, but the existence of 
stress or other variation or shift which produces different classes whose analysis 
by the present distributional (grammatical) method generally justifies such an 
apparently phonological procedure. 


For example, in Latin, what is usually called a word is stressed on the penultimate 
syllabic if this is long, on the antepenultimate if it is short. This suggests a phonological 
definition of the word unit on the basis of this rule of stress, The enclitic -que (‘and’) is 
reckoned as a syllable with’ any preceding sequence in locating the stress which serves a8 
a marker of word boundaries under this definition. Thus, traditionally dóminus (‘ord’) 
and dómindsque (‘and the lord’) are both single words. Under the present purely distribu- 
tional analysis likewise, ddmintisque will be one word, and not two. Déminis- belongs to 
the same nucleus as legatús-, puér- and all other stress-shifted nominative singular mascu- 
line substantives which may be substituted for it. Since no nucleus can be interposed be- 
tween the nucleus of dõminús- and that of -que, -ve and other enclitics, dõminúsque is a 
single word. Even in monosyllables where there is no stress shift, más (‘mouse’) and the 
mils of másque (‘and the mouse’) are members of different nuclei since the former can 
only be substituted by dóminus, puér, etc. and the latter by dominüs-, puér-. 


3.3.4. Some Psycholinguistic Implications 

The concept of nucleus as defined here is essentially a unit of which there is 
always a single fixed number in the class of words which are mutually substitut- 
able in the same construction. As such it corresponds to the notion of positions’ 
in the word as developed by Boas in connection with the description of American 
Indian languages. It may find application beyond that of its utility in the present 
definition. For example, it might well be investigated psychologically as a possible 
fundamental encoding or decoding unit. 

It has been seen that intraword nucleus boundaries and those which coincide 
with word boundaries are different in the choices presented to the speaker. In the 
former, the next nucleus is determined, or passed over if it has a zero member, by 
the next one not represented by zero in the context. At a word boundary, on the 
contrary, the speaker has a choice among a number of different nuclei. It has been 
noted elsewhere in this report that pauses tend to occur at word boundaries 
rather than within the word. Indeed, it may be proposed that the presence of 
potential pause be employed as an independent definition of the word-unit (ct; 
section 5). This phenomenon is probably connected with the greater latency which 
occurs in psychological experimental situations when a subject is faced with 
choices referring to different bases of judgment even where the number of alterna- 
tives are the same as a set all involving the same basis of judgment (cf., section 
5). At every boundary in speech we must make choices, but within a word we 
choose a particular member of a determined class. At word boundaries we must, 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 71 


in addition, make a semantic selection, the choosing among alternative nuclei. 
Indeed, it seems quite possible that the nucleus corresponds to the minimal 
semantic unit. à 

Finally, a possible application of the present analysis to the development of 
child language may be pointed out. It has been remarked as paradoxical that, 
in the child’s speech development, syntactic constructions, supposedly on a 
higher level, occur at a period when morphological distinctions are not yet de- 
veloped. Thus the child says ‘boy run,’ which involves an actor-goal construction 
but ignores the morphological distinction between singular and plural. At, this 
period of the child's development, however, all utterances have a maximum of 
two ‘words,’ later on three ‘words.’ In accordance with our definition, however, 
‘boy’ and ‘run’ are not words since there is no possibility of indefinite—in fact, of 
any—insertion at their boundary. Hence at this stage the child does not have 
syntax since he does not have word sequences. What he has, since he may sub- 
stitute ‘girl’ for ‘boy’ or ‘eat? for ‘run,’ are fixed sequénces of nuclei whose rules 
of combination are therefore analogous to that within the adult word. When he 
learns boundary expansion, his former morphology becomes syntax and a new 
morphology of intraword constructions appears. Hence the paradox is only 
apparent. The child develops a morphology before he handles syntactic construc- 
tions. 


3.4. Hierarchies of Psycholinguistic Units? 


The various research techniques for getting at psycholinguistic units suggested 
both here and elsewhere in this report will probably yield evidence for a number ; 
of different types of units. The larger will include clusters of the smaller in the 
same way that function classes include morphemes and morphemes include 
phonemes, but they will also overlap to varying degrees in all probability, These 
units will be found to be related to certain levels of organization within the human 
nervous system, which may be tentatively identified as motivational, semantio, 
sequential, and integrational. The first of these levels, motivational, is discussed 
further in section 7.1. The other three, semantic, sequential and integrational, 
are discussed in some detail in section 6.1 in connection with the development of 
language behavior. P i 

In general, the question we ask is this: how much of the message 18 sse 
decisions or choices made at each of these levels of or. ganization and what f atures i 
messages serve as boundary markers of these units? In the case of encoding, utat 
to know what segments of messages (output) depend upon intentional t 
made at motivational and semantic levels as well as what ane aie 
Sequential and integrational organization of vocal skills. In the case of decoding, 
We want to know what segments of messages (input) determine Mech 
decisions about both emotional state and meaning as well as what segments UE 
tribute to sequential and integrational organization in iyw ie for 
There is no requirement that the units of messages discovered be 


29 Charles E. Osgood. 


72 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


both encoding and decoding, and the evidence already presented at least implies 
that they are not. 

"The motivational level, as we are using the term here, is concerned with decisions 
of a gross nature—whether to speak or not to speak, and if the former, whether 
to make a statement, answer a question just received, ask a question, give a 
command, or so forth, and within these decisions whether to use an active or 
passive form of address, what to emphasize, and so on. The functional unit here 
would seem to be the ‘sentence’ in a broad, non-grammarian sense or the ‘con- 
struction’ in the linguistic sense. There are features which mark these units as 
being wholes at a gross level, including intonation pattern, stress. pattern, and 
certain construction markers. These three types of features tend to be somewhat 
redundant with respect to one another, which would be expected if they depend 
upon the same decisions; for example, construction markers like ‘who,’ ‘when,’ 
‘do,’ ‘have,’ ‘will’ at the beginning of an utterance signal that the encoder has 
already selected a question form, and the usual rising intonation is redundantly 
related to this selection to a considerable degree. Motivation obviously influences 
the location of primary stress, but it also modulates relative stress throughout a 
construction. (There are undoubtedly effects which go beyond the bounds of the 
single construction or sentence, but we have enough complexity to worry about 
within this unit!) On the reciprocal decoding side, units for interpreting motiva- 
tional significances are probably the same as above for intonation and stress 
patterns, since decoding here requires the complete utterance (e.g., ‘The boy 
walked down the street alone’ can be suddenly shifted from statement to question 
by a rising intonation between, roughly, ‘a-’ and ‘-lone’). On the other hand, con- 
struction markers that occur at the beginnings of utterances, like ‘How... ,’ 
can function as sufficient segments for decoding motivational significance in 
themselves. 

The semantic level is concerned with discriminations among possible meanings 
(or among alternative representational mediators, if one prefers this less mentalis- 
tic language). What segments of messages as produced by an encoder correspond 
to decisions on this level? We suggest the function class as the encoding unit here. 
This would mean that ‘the new car’ would be a single unit in encoding, not two or 
three units. Some languages provide evidence for such functional units, e.g., when 
the Spanish speaker encodes ‘las bonitas casas’ he must have selected the head of 
the phrase at the time of initiating ‘las.’ It seems likely that the semantic unit for 
the decoder, on the other hand, will be much smaller. We suggest Greenberg’s 
nucleus as a candidate here. Unlike the encoder, who ‘knows’ he is going tO 
say ‘the little girl with the red hair’ when he starts, the decoder must react 
sequentially to the sound material as it is unreeled, modifying his interpretation 
as new material comes along—‘the’ must be discriminated from other possibili- 
ties, such as ‘a,’ ‘some,’ ‘all’ and so on, ‘little’ must set up a process different 
than what would be started by receiving ‘big,’ ‘hairy,’ ‘green,’ and so forth. The 

same is true for grammatical tags—the /-t/ in ‘walked’ must be distinguished 
from ^s, ‘-ing,’ and even ‘zero’ endings. ® 

The sequential level, as we have called it here, concerns the tying together of 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 73 


either input or output events on the basis of their redundancy and frequency, e.g., 
their transitional dependencies. It would seem that it is here that the word 
appears as a unit in both encoding and decoding. On the encoding side the features 
characterizing the word as a unit would be backward-working skill modifications 
(e.g., the fact that the terminal phone in /haws/ is changed across morph boun- 
daries to make /hawz-/ in the plural)—these seem to operate clearly within word 
units but not beyond, except in trite phrases—and length of junctures (we expect 
that detailed analysis will show intervals between words to be significantly longer 
on the average than junctures within words, even at morph boundaries). On the 
decoding side, the significant feature is probably length of juncture, which cor- 
responds to spacing between words in orthography. There are also grammatical 
sequencing mechanisms that work over larger segments for both encoder and 
decoder, 

At the integrational level we are dealing with the smallest building blocks of 
language which, because of their extremely high intgrnal redundancy, high 
frequency of occurrence, and limited number, become very tightly welded and 
indivisible units. Again, we feel reason to believe these units are different for 
encoding than for decoding. In encoding this minimal building block seems to be 
the syllable, i.e., these are the minimal motor skill components which are variously 
compounded into words and utterances. Only by considerable effort, if at all, can 
the native speaker produce separate phones—witness his way of ‘saying the 
alphabet,’ in which every ‘letter’ (with the possible exception of the vowels) is 
produced as a syllable (e.g., /ey/, /biy/, /siy/, /diy/, etc.). In decoding, on the 
other hand, the phoneme seems to be the minimal unit. As we have already seen, 
allophones are typically not perceived by the native speaker, but he does make 
decoding discriminations in terms of minimal phonemic contrasts, as between 
/haws/ and /maws/. 

What goes on in the rapid interplay of conversation between an encoder and 
decoder must be tremendously complicated, since it involves operations on all 
these levels simultaneously and in relation to all of these types of units and their 
distinguishing features. In the process of encoding, for example, a speaker may 
be motivated toward obtaining some butter for his bread, which influences bis 
selection of a “command’ construction; the automatisms associated pith this 
intention select the verb form first, and ‘Pass me,’ ‘Gimme,’ ‘Hand me,’ or ege 
other is encoded; this is followed by the encoding of ‘the butter,’ that member S 
the form class which is associated with the representational process establish 
in butter-using situations. Mechanisms at lower levels in the motor system are 


presumably concerned with the calling forth and one Chee hail 
hich i : ts ti wi . 
Which includes one or more syllabie componen 3 Ps y Chat the orat. 


The decoding process is equally complex. It shi 1 : 
chical analysis suggested here, and particularly the identifications of at id 
correlated features, is entirely tentative in nature. A great deal more emp: 
evidence is needed. 


e 


4. SYNCHRONIC PSYCHOLINGUISTICS I: MICROSTRUCTURE 


Speech communities are knit into systems of social organization by the transfer 
of messages over interpersonal communication channels. These channels are 
made up of a number of different bands over which messages can move synchron- 
ously. There is, of course, the vocal-auditory band which couples movements of 
vocal muscles with stimulation of auditory receptors. It is axiomatic that speech 
is independent, of a light source, which is one of its great advantages over most 
other avenues of communication. There is also a gestural-visual band which 
couples movements of facial and bodily muscles with stimulation of visual re- 
ceptors. Interpersonal messages in everyday communication travel simultane- 
ously over these auditory and visual avenues, typically reinforcing one another 
but occasionally being in contrast for certain purposes. Other sensory modalities 
(such as smell, touch, taste, and temperature) may participate in communication 
—they certainly do with other species, and the remarkable feats of Helen Keller 
show that they can be highly discriminating even in the human—but they usually 
contribute in limited and unintentional ways, since they are seldom under the 
voluntary control of the encoder. There is finally what we may call the manipu- 
lational-situational band, which via the mediation of ‘things’ that the encoder 
manipulates and the decoder observes also couples the two. In this chapter we 
explore both organization within these bands and interaction between them. The 
result is essentially the outline of a broad area and serves to etch the gaps that 
exist in our empirical knowledge. 


4.1. Within Band Organization 


Each of the bands in the interpersonal channel can be studied internally to 
determine its organization. To what extent is its coding discrete or continuous? To 
what extent arbitrary (in terms of learned social agreements) or natural (in terms 
of innate biological necessities)? What is the organization or structure of the code? 
What classes of events have common significance and reflect common intentions 
(like allophones)? How do the continuously coded signals interact with the dis- 
cretely coded signals? Questions of this sort have been asked and answered m 
some detail for the vocal-auditory band, since this is the central area of operation 
of the structural linguist. A little work has been done in the gestural-visual 
band, as we shall see, but practically nothing in other bands. We shall therefore 
use work that has been done on the vocal-auditory band as a model for potential 
application elsewhere. 


4.1.1. The Vocal-auditory Band i 

The study of synchronous bands as a psycholinguistic problem cannot fruit- 
fully begin on the global level depicted above. Here we restrict our attention t0 
information carried within the auditory band. The organization of synchronous 
bands within the auditory channel is not itself well understood. Some of the 
variables are, of course, linguistic in tlie narrow sense—bundles of distinctive 


74 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 75 


features and hierarchies of configurational features which contribute to the 
formal aspects of the message. The auditory channel also includes variables which 
convey information as to the code being used, social relations between the com- 
municators, their geographic origin and physiological states, and their evanescent 
emotional attitudes. These variables are sometimes called ‘voice qualifiers,’ 

4.1.1.1. Non-linguistic organization.” The discretely coded signals in this band 
have been explored by linguists and should not concern us here, except as they 
contribute to communication in a fashion which is not subsumed under their 
purely linguistic function. This distinction between linguistic and non-linguistic 
variables is unfortunately not so clear theoretically as it might appear to be from 
the particular scope of the work in which linguists engage. It seems to be possible 
to distinguish linguistic features as discrete and quantized in contrast to the 
continuity of the non-linguistic. But this may reduce in the last analysis to the 
fact that the former have been studied systematically by linguists from a par- 
ticular point of view, e.g., the discreteness may be imposed on the material, while 
the others have not been so treated. Some of the problems discussed here may 
eventually be subsumed under linguistic methodology proper. 

(1) A variety of views. There are several ways in which the non-linguistic 
features can be categorized. Sapir, in an article on “Speech as a Personality 
Trait” suggested two interrelated analyses differentiating the individual from 
the social. aspects on the one hand, and distinguishing levels of speech on the 
other. The particular levels which he found relevant are the following: 1) Voice 
is the lowest and most fundamental level. 2) On the next level is voice dynamics, 
which subsumes intonation, rhythm, continuity, and speed. 3) Pronunciation 
concerns those variations, individual or social, made upon the phonemes of a 
language. 4) Vocabulary involves the particular selection made by speakers or 
groups of speakers from the lexical pool of a language. 5) Finally, on the highest 
level, style characterizes those typical arrangements that are made of the vocabu- 
lary elements. This classification suggests a number of the variables which may 
be treated. ; i 

Sebeok, following, in part, Lotz, has approached this analysis from a somewhat 
different point of view. Tentatively, he has suggested the following features as 
particularly relevant: 1) Manner of speaking. This is a constant feature of pu 
individual speaker and may be shared with either the entire speech community 
(Japanese is spoken in a higher pitch than, say, English) or with a gape 
Social group. 2) Speech organ characteristics. These may be long range as m à Bi 
Case of a speaker with cleft palate or short range, as when the speaker has a n 
mouth or a cold. 3) Pragmatic (emotive or expressive) features. These d to 
broken down into a) statements about codification used to bring people d 
implicit agreement as to the meaning of their messages and, of wu ud 
code they are using, and b) statements about interpersonal Pow T des d 
ing the emotional relationship of the speakers, their mutual status and role, 
the felt success of their communicative efforts. 


B 
" Thomas A. Sebeok, Donald E. Walker, and Joseph H. Greenberg. 


76 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


Another categorization which could be used has been suggested by Greenberg 
and Walker. This involves a series of binary distinctions between learned vs, 
unlearned, voluntary vs. involuntary, and constant vs. intermittent features, 
The unlearned features are by definition not susceptible to the voluntary- 
involuntary differentiation. 1) Unlearned: a) Constant—this includes such factors 
as voice quality, the effects of cleft palate, deviated nasal septum, and the ab- 
solute range of vocalization. b) Intermittent—the effects of cough, fatigue, full 
mouth, and of colds and other temporary physiological conditions. 2) Learned: 
2) Involuntary—the following features are those not usually varied voluntarily 
by the individual for specific vocal effects: (i) constant—average tempo of speech, 
vigor of articulation, normal range of speaking, normal distribution of allophonic 
features; (ii) intermittent—variations in the above introduced by moods or emo- 
tions. b) Voluntary—features introduced into the message specifically as vocal 
modifications: (i) constant—some characteristic referring to success of communi- 
cation, interpersonal rolationships, and statements about codification; (ii) 
intermittent—variations induced by emphasis, intonations for sarcasm, encour- 
agement, irony, etc. 

Given some such scheme as the three presented above, two problems must be 
considered. The first involves the utility of the classification itself, but, indepen- 
dent of any particular means of categorization, it should be possible to specify 
relevant variables by the consistency of their identification in experimental situa- 
tions. The second problem involves specification of the particular phenomena in 
the sound material which represent these variables and determination of the 
ranges of variation permitted. Once such identifications have been made, it 
should be possible to study relations of linguistic to non-linguistic features. 


(2) Sketches of specific systems. In Hungarian, there is a distinctive feature of length. 
Expressively, it is possible to distort this feature so that long is substituted for short, and 
over-long for long. There is no phonemic stress. Expressively, also, it is possible to stress & 
syllable other than the first, usually the third. This illustrates the fact that expressive 
features can be superimposed on distinctive features, or, again, a distinction can be in- 
troduced which is not phonemic. A third possibility consists of the substitution of a con- 
trast already in the language in a position where it does not ordinarily occur. Fourth, itis 
possible to introduce an entirely new (from a phonemic point of view) phone into the 
language for expressive purposes. The above are all carried on the vocal-auditory band 
but are not phonemic. Spanish. One might consider these bands as consisting of levels of 
information. For example, the information, ‘relative social position’ is usually expressed 
in Spanish by the morphemic contrast between ‘tú’ and ‘usted,’ indicating the categories 
of familiarity and politeness. However, in addition, this same information may be carrie 
on another level, namely by the use of the diminutive morpheme ‘-ito’ under certain con- 
ditions. This morpheme may be used in any conversation with the meaning ‘diminutive’ or 
‘endearment.’ When added to a word like ‘Adios,’ however, the information carried is 
merely that there is relative familiarity between the speakers. In terms of the problem 
being considered here, one may consider this use of the morpheme as an introduction of 
new contrast into the code, a contrast expressed not by any morphemic distinction, bul 
expressed by use of a certain morpheme in special environments. English. Similarly, 5 
English, the use of a particular allophone in special environment, may be considered 
giving additional information. The distributiwn of the voiceless aspirated stop, atl 
is usually limited to initial position. It is used, however, by some speakers in final positi 


5 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 77 


instead of the customary unreleased stop. The usual effect of such usage is an unfavorable 
one on the listener, which is usually expressed as ‘an attempt at putting it on’ or ‘over- 
careful enunciation,’ etc. This then may be considered as the use of a non-phonemic con- 
trast as an expressive feature. 


(3) Research proposal: Determination of non-linguistic features and correlated 
variations in the sound material. The experimental situation here requires elimina- 
tion of all communication bands other than the vocal-auditory. This can be 
accomplished by having subjects speak through screens, in the dark, over the 
radio, or onto tape recordings. The latter recommends itself as permitting the 
most control, delayed uses of the material, and sampling of the most natural 
situations. The experimental situation also requires delibérate variation in the 
physiological, emotional, social and other characteristics of speakers that are 
assumed to be transmitted as information by non-linguistic features. The general 
proposal below is to (A) obtain judgments as to these characteristics of speakers 
from representative hearers, based on tape recordings, and correlate them with 
observable features in the sound material, and then (B) experimentally vary 
what seem to be the relevant features on a series of tapes and see if in fact these 
variations are accompanied by changes in the judgments presumably dependent 


on them. 


(A) Record the conversation of two individuals in natural encounter (as in someone’s 
office, in role playing situations, in therapy sessions, and so forth). Play the consecutive, 
uninterrupted remarks of one speaker to a group of subjects and ask for spontaneous 
comments about the characteristics of the individual. Then ask for specifications of age, 
sex, physical condition, emotional state, the apparent audience, social status, ete. Then 
Tequest the same subjects to indicate as best they can the basis, in the sound material, 
for each of these judgments. Check communality of judgment and correlate with both the 
original speaker’s judgments about himself and those of independent judges who have 
witnessed the original communication situation. This experiment could be varied by 
using conversational material in which both speakers are heard. Another variation would 
be to use artificially structured situations, with participantsinstructed to act out particular 
relationships under particular assumed emotions. ; 

(B) Having obtained evidence in (A) as to what vari i i à n 
to function as cues for such judgments, it should then be possible to introduce electronic 
modifications in samples of the same speech wbich alter certain of these variables sys- 
tematically (at least those variables which can be 80 modified), If our abe iB 
valid, then the consistency and extremeness of judgments about, say, anger should. 
continuously variable by modifying, say, pitch, amplitude, and rate of eee u La 
combination. There is a considerable body of research already available? which woul 
guide and sharpen experiments on this problem. 
4.1.1.2. Linguistic organization? Early students of language described speech 
sounds as similar if they gave the ‘impression’ of similarity, i.e., if they were 

d by the physical characteris- 


Perceived as similar. Such judgments are influenced 
tics of sound, but also by the perceptual characteristics of hearers. More recently, 
similarity in terms of either spectro- 


Phonetieians have defined speech sound 
: da 
? See particularly Cantril and Allport, The psychology of pd RI M eei 
Series of research papers by Grant Fairbanks on quantitative vocal correlates 
® Kellogg Wilson and Sol Saporta. ^ 


ables in the sound material seem 


78 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


graphic analysis (acoustical phonetics) or positions of the articulatory organs 
(motor phonetics). In this section we suggest a general logic and procedure that 
may be applicable to analyzing the internal structure of any band of human com- 
munication, even though it is discussed here in relation to linguistic sound 
material. 

(1) Phonetic, phonemic and psychological spaces. While the exact nature and 
number of variables needed for adequate description are not agreed upon, it 
seems reasonable to suppose that speech sounds can be regarded as occupying 
positions in a multidimensional space in which each of the variables used in 
describing the sounds corresponds to a dimension of the space. The dimensions 
of such a space are defined in physical terms and may correspord to either dis- 
crete or continuous variables, or a combination of both; however, the categories 
of a discrete variable should be ordered in such a way (e.g., four degrees of in- 
creasing length) that they can be regarded as a quantized continuous variable: 
We shall regard the position of a sound in this phonetic space as constituting its 
phonetic reality. The phónetie space is invariable in the sense that the language of 
the speaker or hearer of a sound does not affect it. The phonetic space is con- 
tinuous in an operational sense, since no sound could conceivably be assigned to 
the same position as another if measurement were sufficiently refined. The 
physical similarity of speech sounds is measured by the distance between their 
positions in this multi-dimensional phonetic space. 

Phonemic analysis results in another—and more convenient—way of describ- 
ing the speech sounds of a language. We can regard the analysis of a language into 
a set of k phonemes as defining a space consisting of k mutually exclusive regions 
which correspond to the phoneme classes. The position of a sound in this phonemic 
space will be regarded as constituting its phonemic reality. The phonemic space is 
variable in the sense that the position of any sound depends on the divisions 
imposed by the language code of its users. Also, the phonemic space is discrete and 
unordered in the sense that two sounds are either in the same or different regions 
(i.e., in the same or different phoneme classes), and a statement that one pair of 
sounds is ‘more alike’ than another is meaningless. Thus, two sounds are either 
phonemically the same or phonemically different. 

Conceptually, the simplest possible relation between the phonetic and phone- 
mic realities would be one in which the regions of the phonemic space correspon! 
to clusters of sounds in the phonetic space. Thus, phonetically similarsounds would 
be phonemically identical. However, we would not necessarily find such a simple 
relationship as the following example indicates: let us consider the vowel [6] in 
the word ‘buzz,’ the unstressed vowel [5] of ‘Rosa’s,’ and the unstressed vowel 
[i] of ‘roses’ in the dialect of speakers who distinguish between the last two 
Laboratory measurement would probably indicate that the last two are more 


33 The technique of phonemic analysis used by Jakobson and his associates, in e 
phoneme classes are determined by sets of binary distinctive features, gives a discrete E 
ordered phonetic space, since similarity of phones may be regarded as varying with the 
number of distinctive features shared by their phoneme classes. 

> 


| 


PSYCHOLINGUISTIC8: A SURVEY OF THEORY AND RESEARCH PROBLEMS 79 


phonetically similar than the first two. On the other hand, a phonemic analysis 
is very likely to assign the first two to /o/ and the third to /i/.4 

The lack of exact correlation between the phonetic and phonemic similarity of 
sounds can be at least partially attributed to the perceptual habits of the speakers 
of a language. These habits permit their possessors to respond differentially to 
some phonetic differences and to ignore others. The effects of these habits are 
most evident in the accents and misinterpretations of people who are learning 
a foreign language. The use of impressionistic judgments of ‘similarity’ by the 
linguist is justified if such judgments approximate these perceptual habits. 
Nevertheless, we need an objective technique of describing these habits which 
is not affected by the results of a phonetic or phonemic analysis of the speech 
sounds of a language. Our purpose here is to outline such a technique and to 
suggest experimental procedures needed to apply it. 

The end result of the technique to be proposed below will be to generate a 
psychological space containing a set of speech sounds, A measure of psychological 
similarity, which indicates the degree to which a pair of speech sounds are per- 
ceived as similar by a group of subjects, will form the basis of this psychological 
space, which will be continuous like the phonetic space but variable like the pho- 
nemic space. The dimensions of this space should indicate the bases for discrimina- 
tion between the speech sounds employed by the subjects, these dimensions 
constituting a minimum set of ‘distinctive features’ needed to make the dis- 
criminations involved in the ordering of the speech sounds. 


While determination of the psychological space is independent of the associated phonetic 
and phonemic spaces, we can expect the results expressed in the psychological space to be 
dependent on the results expressed in the phonetic and phonemic spaces. The psychological 
Space must be related directly to at least some sub-space of the phonetic space, since at 
least some of the dimensions of the phonetic space must correspond to the differential 
stimuli to which the subjects respond. The psychological space must also be related 5 
the phonemie space, since two sounds cannot contrast and be used to indicate differential 
‘meaning’ in the same phonetic environment if they are not discriminated. Thus, the 
difference between the phonetic and psychological spaces represents a treno d 
the ordering of speech sounds produced by the perceptual habits of a set of speakers h 7 
given language; it may be regarded as the result of a sort of phonemic ae in wi m 
each cluster is a group of psychologically similar sounds sharing distinotive s in E 
similar values, but where distributional criteria are ignored. The difference d E 
psychological and phonemic spaces represents a transformation in ordering produced by 
considering distributional criteria alone. 


(2) Experimental proposal. The human perceptual apparatus operates s 
the same speech sound does not always produce the same p Thus si ke 
be said to behave like a communication channel with some degree i: UN Mer 
the distribution of output events is not perfectly predictable for each inp - 


Following this analogy, it seems reasonable to say that two input a 
Sounds) are similar to the extent that they produce similar condition: 

3 n line of English struc- 
* Cf. the vowel phonemes as presented by Smith and Trager in Out E 

ture (1951). The phonetic and phonemic symbolism used here is DEDOS 


so that 


80 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PRO} 


tions of output events (perceptual discriminations). The first expel 
problem discussed below (A) concerns a method of determining sim 
perceptual judgments of speech sound, a modification of the psychopl 
method of paired comparisons being finally selected. Experimental procedure 
selection of materials is then described under (B); for demonstration purp 
the suggested analysis is limited to the cardinal vowels. Finally, under ( 
suggest a possible way of treating such data, essentially a computation of. 
tances’ between speech sounds as perceived based upon the conditional 
tions of forced-choice judgments. 


(A) Method for determining psychological similarity. We need to select some di 
response pattern which is indicative of our subjects’ perceptions. The obvious 
is to simply ask them what they hear when a particular sound is presented, but the 
reason to suppose that untrained native speakers could make a coherent report. 
other hand, phonetic training would probably change the similarities the subj 
learned to perceive as native speakers and hence destroy the very condition wi 
study. Articulation tests—in which subjects select a spoken word from several 
alternatives—can be used with naive subjects, but they have several disadvanta 
our purposes: 

(a) It is necessary to use a different set of words for each group speaking a dii 

language or dialect, making it impossible to compare the pattern of C 
similarities of different language or dialect groups under identical conditions. — 

(b) Since all possible phonetic environments are not found in the words of a 

language, our results would be confounded by the dissimilar environments in 
the sounds we are studying occur. i 
These objections could be avoided by use of one of the psychophysical techni 
would be possible to present the same group of stimuli to any group of subjects 
present each sound in a particular phonetic environment or set of environments. 

The most applicable of the conventional psychophysical techniques is the mei 
paired comparisons, in which the subject is presented with a pair of speech soun 
asked to state whether they are the ‘same’ or ‘different.’ However, it is a common ob 
tion that subjects do not have the same criteria of ‘sameness,’ so we might well hav 
subjects saying that two sounds are the same because they appear to be 'simila 
others who would say that two sounds are ‘different’ because they are only simili 
may avoid this limitation of the method by presenting our subjects with seq 
three instead of two sounds. The subjects would be given response sheets with t 
a b c opposite the number corresponding to each sequence, and they would be 
cross out the position of the sound least like the other two. Thus, the responses 
subjects will be based on a simple and relatively unequivocal forced choice rai 
on the rather complex judgment of ‘sameness.’ 

The main disadvantage of this technique is the large number of sequences whie 
necessary to use. If we use all possible orders, i.e., all permutations, of n speech sou! 
sequences of three different sounds and in sequences where two sounds are the 
there would be 


n! 2(n!) 
-J a-i 


35 The reasons for using each sound paired with itself will be made clear later 
measure of similarity is introduced. ə 


2 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 81 


sequences, For n = 14, this quantity is 2,548. If we use only all possible combinations, 
each combination being represented by but one of its possible orders, there would be 


‘ n! (2n) 
3! (n — 3)! n 21(n — 2)! 


sequences. For n — 14, this quantity is 686. Obviously, there would be considerable saving 
of time if it were feasible to only use all combinations, Therefore, it would be well to run 
at least one pilot study using all possible permutations of a small number of speech sounds 
to determine if the order of presentation has any effect on the patterns of judgments made 
by the subjects. 

(B) Procedure and materials. The temporal order of events given below for the presenta- 
tion of each sequence seems to be adequate: 


1 sec. Announcement of no. of sequence 
lsec.  ' Silence 

3$ sec. First speech sound 

14 sec. Silence 

14 sec. Second speech sound 

4g sec. Silence 2 

1$ sec. Third speech sound 

3 sec. Recording of judgment 


At this rate, it would be possible to complete 686 sequences in 85.75 minutes and 2548 se- 
quences in 318.50 minutes. In order to preserve uniformity of the sounds employed, it 
would be desirable to record each sound but once and ‘assemble’ sounds for experimental 
presentation by re-recording them on magnetic tape. ES 

Because there is more agreement concerning the articulatory position, the distinctive 
features, and the role of the formants in the production and reception of the vowel sounds, 
it would be advisable to begin this type of analysis with a set of cardinal vowels. The use 
of cardinal vowels has the additional advantage that this material may be used with speak- 
ers of various languages to determine the effect of language on perception of phonetic 
similarity. In order to be sure of their exact acoustic qualities it would be well to have 
these sounds produced by some electronic apparatus. 

(C) Treatment of data. Let i, j and k represent any of the set of n speech sounds and 
let p(i; j/k) represent the estimated probability that k will be judged the most dissimilar 
member of the sequence i j k. 

Let p(i; j) = 2, pli; j/k). The measure p(i; j) appears to be related to the joi 


bility of the production of sound i and the perception of sound j, p(i, j). However, 


from p(i, j) in that: ; ji 

(a) pü; i) = pġ; i) while pū, j) does not necessarily equal pl, i) 
(b) p(i; j) is relative to the choice of sounds with which t andj ea uld be sym- 
Point (a) is not an overly serious objection since & relation of similarity sho = fei 
metric; i.e., a should be just as similar to b as b is similar to a. Point (b) merel 


that p(i; j) is relative to the situation in which it is determined—a leeren equally true 
9f any estimate of p(i, j), although the precise nature of the E . 
We shall define the distance between sounds i and j, DG, j),** a$ 


DG, j) = V2 ipi; 9) = PG: YF 


i re composed. 
where r is any of the complete set of speech sounds of which the sequences & reer an 
If two sounds are similar, they should be judged as similar to other, sounds to the 

. ' Of., Osgood and Suci, A measure of relation determined by both profile 
difference information, Psychological Bulletin 49 (1952). 


int proba- 
it differs 


and mean 


82 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


degree. In this case, all of the differences, p(i; r) — p(j; r), should be zero or near zero 
80 that D(i, j) is small. If ¢ and j are usually perceived as dissimilar we would expect all 
of these differences to be large so that D(i, j) would be large. If three sounds, 1, j and k, 
are ordered along the same dimension, the distances between the three possible pairs will 
be such that D(i, j) + DG, k) will be equal to D(i, k) within the limits of sampling error, 

If the number of dimensions in the psychological space is three or less we should be 
able to simply construct a physical model which preserves the proportionality of the dis- 
tance measures. The nature of the dimensions could be determined from the clusterings of 
the sounds along the dimensions or at their end-points. If the number of dimensions is 
greater than three, it would be necessary to apply some factor analytic procedure. Suci has 
developed a technique which can be directly applied to distance measures, and other factor 
analytic techniques could be applied to correlation matrices of p(i; r) with p(j; r) for all 
pairs of i and j.* 


(3) Some applications of psychological space. The technique indicated above is 
tedious and can be applied only to small sets of sounds at one time. However, we 
were unable to devise any simpler technique which is compatible with the de- 
mands of scientific rigoz. Even with these limitations, it can be a valuable re- 
search tool. 

The psychological space serves as a sort of transition stage between the pho- 
netic and phonemic ordering of speech sounds and can serve to clarify the nature 
of phonemic analysis. It also might provide a more objective measure of the 
perceived similarity of speech sounds than the impressionistic judgment of 
even an expert linguist. The ordering of a language’s sounds in a psychological 
space could be used as a standard to select between two equally simple, exhaus- 
tive and non-contradictory phonemic analyses. Furthermore, we may use this 
technique to test Jakobson’s hypothesis concerning the binary nature of the dis- 
tinctive feature. If he is correct, we would expect the sounds to form clusters in 
the psychological space such that each cluster marks the end of one of the dimen- 
sions. 

There are other potential applications to linguistic theory. Consider, for ex- 
ample, the contrast between voiced and voiceless consonants in Spanish and 
English. In Spanish, the contrast between voiced and voiceless is phonemic 
between [p] and [b] and [t] and [d], but allophonie in [s] and [z]. Therefore, we 
would expect that the psychological space for a group of Spanish speakers would 


*' Cf. an unpublished paper by Suci; and R. B. Cattell, Factor analysis. After the statis- 
tical technique above had been devised, we found that a similar technique had been devised 
by Warren S. Torgenson (Psychometrika, 17. 401-19 [1952]). Torgenson introduces some 
refinements not present in our technique which require additional assumptions about. the 
nature of his measures and their distribution and which lead to lengthy and laborious 
computation. It should be noted that Torgenson seems concerned with developing & psy: cho- 
metric measuring device while we are concerned with the less demanding task of determining 
the ordering of speech sounds in an exploratory fashion. E 

38 In the discussion of the seminar group it was suggested that the sounds may have to 
be put in specific phonetic environments to obtain the desired relation to the phoneme 
space. There is nothing in the nature of the technique or the basic theory to prevent Es 
being done, but it is evident that the use of particular environments would severely Id 
the generality of the results. Therefore, the more general technique was suggested in th 
hope that phonetic environments will have generally slight effects. 


Kae 


| 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 83 


reflect this linguistic fact by placing the allophonic pair closer together than the 
phonemic pairs. Another situation arises when Spanish speakers are asked to 
distinguish between [f] and [v], since the latter sound does not occur in their 
language. The most likely outcome here would seem to be that [v] will be psycho- 
logically similar to another sound in Spanish so that the relation between [f] and 
{v] will correspond to the relation between [f] and [b]. This effect is suggested by 
the errors made by Spanish speakers in learning English. It is possible that the 
nature of the psychological space may indicate the effect of morphophonemic 
relations. For example, the fact that in English /t/ and /d/ are alternates of a 
very common ‘past tense’ morpheme should make them more psychologically 
similar than corresponding pairs such as /p/ and /b/ or /k/ and /g/. 

Another set of hypotheses may be explored by obtaining psychological spaces 
for the same set of speech sounds from speakers of various languages. As men- 
tioned above, it seems likely that speakers of different languages will show differ- 
ences in their psychological spaces which correspond to, differences in the pho-' 
nemic spaces of the language. The effect of learning a second language, or of 
bilingualism, on the psychological space could also be investigated; our example 
concerning Spanish speakers would imply that a relatively greater distance 
between /f/ and /v/, indicative of a phonemic distinction, should be associated 
with Spanish-English bilingualism. Finally, it would be of interest to obtain a 
sort of ‘asymptotic’ psychological space, using subjects highly trained in distin- 
guishing between speech sounds. Such a space should indicate the complete set 
of discriminations which the human perceptual apparatus is capable of making. 

4.1.1.3. Levels of awareness of linguistic differences. Utterances differ at many 
different levels—phonetically, phonemically, in word order, stress, intonation 
Pattern, and grammatical construction. It is usually assumed that native speakers 
can identify some of these differences but not others. In particular, it is said that 
they cannot hear allophonic differences. The following analysis is intended to 
describe some procedures for testing whether discrimination has occurred between 
two utterances which are similar in all respects but one. If it can be demonstrated, 
for example, that speakers consistently report no difference between allophones, 
but that their responses to allophones differ in some other way, then we will be 
better able to infer the nature of the decoding processes involved. 


(1) Verbalization about differences. These may be of two varieties: (a) i ij nd 
Points out the linguistic feature that is different. (b) The subject reports how d yn 
about the difference, what different information it gives him either about the con : 
the utterance or about the speaker. Differences in word order and grammatical cons idis 
tion, for instance, may be recognizable as features for subjects even though oa 2 Este 
in their report about the information the differences give them. Shifts in pe te ud the 
while not specifically identifiable, may by many subjects be oerte rah 
Speaker has a certain dialect, comes from a certain group, ete. " 

(2) Indirect verbal indices. In certain cases, subjects may report no don hie 
may report a difference but not know whether or in what way it sweeps pets as 
tion methods, or the semantic differential (ef., section ae crise strong man) vary 


?' Susan M. Ervin. " 


84 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


the stress or intonation pattern, and test the effects on these indices. Voice qualifiers might 
be studied in this way also. One of the problems here would be that a whole utterance is 
somewhat difficult to use as a stimulus, but subjects might be instructed to respond only 
to the last word, as they have been in some context studies using word clusters. 

(8) Non-verbal responses. The subject would be conditioned to some sound, in a certain 
context, and generalization to other sounds or to contexts containing the same sound 
would be measured. Or, conversely, one could determine how easily discrimination is 
learned. PGR and finger movements in response to shock might be appropriate to use here. 
This technique would be particularly useful for phonetic and allophonic discrimination 
studies. For example, it could be used to test generalization between phonetically dis- 
similar allophones which are not similar in sensory features. 


These techniques could be used with several variations, such as varying the 
degree of audibility to see effects on level of awareness. Also, the location of the 
difference in the utterance could be varied. It might be hypothesized that differ- 
ences occurring at points of high transitional entropy are more likely to be 
noticeable. Other variables which should be related to level of awareness of differ- 
ences are the following: age, education, amount of contact with other languages 
and dialects, types of personality (presumably intellectualizers are more aware of 
language differences than repressors), characteristics of the language itself, and 
so on. 


4.1.2. The Gestural-visual Band 


It is apparent from casual observation that distinctive movements of facial and 
bodily musculature are part of the total communication process—one can get a 
considerable amount of information from a completely silent movie, for example. 
This band of the communication channel is strictly equatable with the linguistic 
band: a set of responses on the part of one individual (encoder) produces stimuli 
which can be interpreted by another individual (decoder). This band is capable 
of the same type of analysis that has been given the vocal-auditory system, but 
relatively little has been done.‘ Such study would require (1) descriptive analysis 
of the gestural-visual code itself —which is coming to be known as kinesics—and 
(2) analysis of the relations of these messages to the intentions (encoding) and 
significances (decoding) of communicators—which might be called psycho- 
kinesics. k 

4.1.2.1. Kinesics. A very promising beginning in the study of gestural communi- 
cation has been made by Birdwhistell in strict analogy with the techniques of 
linguistics. A particular motion or posture of a given part of the organism (facial 
or bodily) is called a kine (equivalent to phone). The first step in the analysis of 
any gestural system would be a complete ‘transcription’ of the kines in ther 
sequential context of one or more 'informants' from a given language-culture 
community. Birdwhistell describes a notation system for transcribing or recording 
kines which is unfortunately (but perhaps necessarily) very complex and cumber- - 
some. In just the same sense that phoneticians require training in objective 
“ See, however, Ruesch and Bateson, Communication, and a series of articles by th? 
same authors; D. Efron, Gesture and environment (1941); R. L. Birdwhistell, Introduction 
to kinesics (Foreign Service Institute, 1952), and the references he cites. 


^ 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 85 


listening, so kinesiologists require training in objective looking—the untrained 
observer will be likely to perceive only those movements which are significant in 
his own "language." 

The secondestep in analysing any gestural anguage’—again, in parallel with 
linguistics—would be to determine what movements are significant in the code, 
i.e., what classes of kines constitute kinemes (equivalent to phonemes) by virtue 
of having the same significance. The movements which constitute such classes 
would be called allokines (cf., allophones), and they would also be characterized 
by either conditioned variation (e.g., types of smiles varying somewhat with 
antecedent facial posture) or free variation (e.g., winking with right or left eyes 
being equivalent in significance and independent of context). Individual members 
of a gestural community would be expected to vary somewhat (cf., idiolects), 
particularly in the features allowing free variation, and to show some constant 
transpositions, e.g., variations in the general amplitude of gestures. The general 
procedures of the kinesiologist, as described by Birdwhistell, would be to try 
out various ‘minimal pairs’ of kine patterns (for example, variations in eyebrow 
position with the rest of the facial pattern constant) and get from ‘informants’ 
judgments of ‘same’ and ‘different’ in meaning. The equivalent of morphemes, or 
perhaps words, in gestural language would be total patterns of facial and bodily 
posture which, as wholes, have distinctive significance but lose this significance 
when broken up. To the best of our knowledge, there has been as yet no complete 
analysis of any gestural language by this method. 

There are, of course, a great many questions that need to be answered about 
kinesics. For one thing, the direct application of linguistic methods implies that 
events in the gestural-visual material are discretely coded at some level, e.g., that 
elevation of the eyebrows is either present or absent and thus either does or does 
not signal something; it seems quite possible, however, that we are dealing here 
with continuously coded materials, e.g., that the degree of judged ‘surprise’ or 
‘horror’ or other kinemorph including this feature will be found to vary con- 
tinuously with the degree of eyebrow elevation. Another question concerns the 
innate vs. learned nature of the signs here. Birdwhistell takes the position that 
all kinemes are learned, but there is considerable evidence for cross-cultural 
similarities of expressions of at least certain intense emotions going back to the 
work of Darwin. And there is, of course, the question of whether or not there is 
any communication via the gestural-visual medium, and whether or not this 
band is completely redundant with respect to linguistic and situational contexts. 
There are the well-known psychological studies on judgment of emotion from 
facial expressions which seem to show that when the situational context is 
removed, accuracy of judgment approaches zero—if you do d pee the baby 
being pricked with a pin, you're as likely to call his expression ‘Joy’ as you are 
‘pain.’ tae 1 

4.12.2. Psychokinesics. This brings us to the problem of psychokinesics, ds 
tions between the characteristics of communicators and the Garona d 
gestural-visual messages they exchange. The question raised above a A j 
validity of the gestural-visual band ag a communication medium is actually 


86 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


psychokinesic problem: to what extents are particular gestures, facial and bodily, 
conditionally dependent upon ‘intentional’ states of encoders and to what extent are 
‘significance’ states of decoders conditionally dependent upon particular gestures? 
One way of getting at this problem experimentally would be to have the same 
communicator repeatedly produce gestures appropriate to the same intention 
(e.g., repeatedly pose ‘anger,’ ‘consternation,’ ‘boredom,’ and the like); we would 
anticipate certain variable kines and perhaps certain constant kines to appear, 
the latter being critical to encoding. Similarly, we could repeatedly present 
moving or still pictures of the same gestures to the same individuals for interpre- 
tation, to determine the degree of consistency in decoding. The question of 
whether or not we are dealing with a ‘language’ in the interpersonal sense would 
require replicating individuals in the same design above, i.e., do different encoders 
and decoders drawn from the same community agree in the gestures used to 
represent certain intentional states and in the interpretations of certain gestures? 
Questions of this sort apparently have not been considered by Birdwhistell. 

Psychologists have been interested in these problems over a considerable 
period,“ but have limited themselves pretty much to facial gestures as ‘expres- 
sions of the emotions.” The issue has generally been phrased as follows: (1) are 
facial expressions valid indices of the actual emotional states of the encoder? In 
other words, can judges accurately infer emotional states from facial gestures? 
The results obtained here are rather discouraging. Although accuracy is reason- 
ably high when facial gestures appear in situational and linguistic contexts (e.g., 
a picture of a woman running from a fire and heard screaming, “Save me! Save 
me"), it is very poor when these supports are removed. However, many studies 
purporting to get at this question have actually been designed to answer a quite 
different one: (2) is there social agreement on the meaning of facial expressions, 
quite apart from what the ‘real’ emotional state of the encoder may be? That this 
was actually the question being asked is evident from the fact that many studies 
have used professional or amateur actors deliberately posing certain facial ex- 
pressions on demand. Even here, however, results have been inconsistent, partly 
because of difficulties in scoring ‘correctness? (e.g., should we count a judgment 
of ‘scorn’ in the same category with ‘contempt’?) but also because there are still 
two different issues being confused. (3) Do facial expressions validly communicate 
the intended states of the encoder, regardless of his ‘real’ feelings? Here correctness 
of judgment by observers is determined by the instructions given the actors. (4) 
Regardless of what the intention of the encoder may be, do observers in a given 
culture agree on the meanings of the facial gestures they perceive? This final 
question eliminates the skills of the encoder entirely, and we merely look for 
evidence for structure or agreement among decoders, 


An experiment on question (4) above provides evidence for a considerable degree of 
communication via facial expressions.‘ Numbers of different college student subjects 

“| See Woodworth, Experimental psychology (1938). 

^ However, see the work of M - Krout on other gestures, 

“ Osgood, Suci, and Heyer, The validity of posed facial expressions as gestural signs 


in interpersonal communication. Paper delivered at American Psychological Association 
meetings, Pennsylvania State College, 1950. 


s 


1 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 87 


posed 40 different emotional states (from the labels given them) under lighting conditions 
that emphasized the lines and shadows of the face. The labels for these same 40 states 
were written on the blackboard and student observers were instructed to select that one 
label which seemed to best fit each seen facial posture. Each state was posed by five different 
actors and judged by five different groups of observers, orders of presentation being ran- 
domized between groups. Since correlation with the ‘intent’ of the actor was not involved 
at this point, the 40 samples of judgments for the intended states were treated simply as 
reactions to that many independent facial stimulus situations. If the expressor intended 
‘anxiety’ but most observers perceived states like ‘dreaming sadness’ and ‘quiet pleasure,’ 
it made no difference in the computations. The question was: to what degree are variations 
in the use of one label correlated with the use of other labels? If ‘disgust’ and ‘contempt’ are 
similar in meaning—and if facial expressions do have different effects as stimuli—then 
any facial stimulus that calls forth one label should also tend to call forth the other, and 
vice versa. . 

Coefficients of agreement were computed for each label with every other label, yielding 
a 40/40 matrix which was analysed by the difference method and the results represented 
in a solid model. The distances between all of these labels were reproducible in only 
three dimensions with a high degree of accuracy, indicating the existence of only three 
major factors. The structure had a roughly pyramidal form: going upward and out from 
one corner at ‘complacency’ was a series of increasingly pleasant expressions terminating 
at another corner with ‘joy;’ going outward and left along the base of the pyramid from 
‘complacency’ was a series of increasingly compressed or grim expressions, running through 
‘contempt’ and ‘cynical bitterness’ and terminating on ‘sullen anger;’ outward from ‘com- 
placency’ and toward the right along the base of the pyramid was a series of increasingly 
open and traumatic expressions, running through ‘expectancy,’ ‘awe,’ and ‘anxiety,’ and 
terminating at the front right corner in ‘horror;’ finally, running across the front face of 
the model was a series of equally traumatic and tense expressions, but from ‘sullen anger’ 
through ‘rage,’ ‘dismay,’ and ‘fear’ over to ‘horror.’ Given this structured character of the 
decoded significance of expressions, it becomes possible to experimentally manipulate 
gestural components (e.g., kines relating to the mouth, eyes, nose, and so forth) and deter- 
mine what variations in the encoding correspond to variations in significance, That facial 
gestures do have considerable validity as signs in communication is indicated by the ex- 
istence of structure in the judgments—only to the extent that the changing stimulus 
characteristics of the face did have commonly accepted meanings which restricted judg- . 
mental categories could anything other than chaos (unplotability) have resulted from 
this method. 


4.1.8. The Manipulational-situational Band 


All we can do here is to sketch in the types of communication materials which 
would be included under this rubric. Again, we may divide this band into the 
discretely and arbitrarily coded materials vs. the naturally and continuously 
coded (discreteness and arbitrariness do not necessarily go together in opposition 
to continuousness and naturalness, but we suspect that they usually do). The 
whole field of orthography could be treated in this context—the writer (encoder) 
produces a product via his manipulations and this product, in a letter or a printed 
page, constitutes the object-situation to which the reader (decoder) responds. In 
this case we would be dealing with arbitrary and discrete coding. Somewhat less 
arbitrary and certainly less discrete would be the use of symbolism, as in cartoons, 
the ‘V’ for ‘victory,’ ‘thumbs up,’ the political elephant, and so on. On the same 
continuum is aesthetics—again, encoders (artists, musicians, and the like) produce 


^ 
“t Cf., Osgood and Suci, Psychological Bulletin 49. (1952). 


88 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


certain products via their specialized manipulations and these products serve as 
the source of aesthetic stimulation for decoders (appreciators and critics). Here, 
although there may be a certain arbitrariness or conventionality in the code 
(witness the fact that ‘primitive’ peoples often have great difficulty perceiving 
the objects in drawings that are to us quite realistic), it certainly is continuously 
organized—the mood of ‘excitement,’ say, probably varies continuously with the 
brightness of color, shape of forms, and so forth. Perhaps more obviously manipu- 
lational-situational are many of the acts of everyday communicating—leaving a 
key under the doormat, hanging mistletoe above the archway, moving your castle 
to a position where it confronts your opponent’s queen, and even breaking and 
bending twigs and grass in a way that unintentionally communicates your course 
to a pursuer. á 


4.2. Between Band Organization’ 


The notion of sequential redundancy between parts of a message as serially 
unreeled is now a fairly„common one, particularly as a result of the work of 
Shannon, Miller, and others. The notion that there can also be synchronic redun- 
dancy among simultaneous events within the same band or between bands is less 
familiar but equally reasonable. Both linguists and information theorists have 
‘taken cognizance of redundancies within the linguistic band per se, the former 
observing that phonemes are for the most part overdetermined (in terms of 
clusters of correlated features) and the latter reporting that one can experiment- 
ally cut out 50 per cent or more of the total information in the auditory channel 
without seriously hampering intelligibility. There is also redundancy between 
discretely and continuously coded signals in the vocal-auditory band—witness 
how stress is typically accompanied by lengthening of vowels, how stress and 
raised pitch tend to go together, and so forth. Redundancy between bands, e.g., 
between vocal-auditory and gestural-visual bands, has been for the most part 
neglected, although Ray Birdwhistell and H. L. Smith** have made some very 
interesting observations along these lines. Informal observation indicates at least 
two types of relation between communication bands: ( 1) synchronic complementa- 
tion, the usual situation in which gestural signals have the same significance as 
vocal signals and hence complement one another; (2) synchronic contrast, the more 
informational situation in which gestural and vocal signals have different (usu- 
ally opposed) significance and hence change each other in some fashion. 


4.2.1. Synchronic Complementation 


At the lowest level, of course, there is constant between-band complementation 
between the vocal-auditory channel and the visible gestures of the speech appara- 
tus itself—the fact that people can learn to ‘read lips’ with high proficiency testi- 
fies to this. The rest of us do much the same thing in traumatic interpersonal 


*5 Charles E. Osgood. 

'* See Claude Lévi-Strauss, Roman Jakobson, C. F. Voegelin and Thomas A. Sebeok, 
Results of the conference of anthropologists and linguists, Indiana University Publications 
in Anthropology and Linguisties, Memoir 8 (Baltimore, 1953). 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 89 


exchanges in which the speaker, under strong emotion, typically exaggerates the 
speech motions. Less obvious and more in need of experimental verification are 
possible ways jn which both facial and bodily gestures may complement those 
parts of the linguistic band related to motivational and semantie information. 
Are there any facial and gestural concommitants of stress, for example—is there 
a tendency toward raising of the eyebrows with rising intonation (e.g., at the end 
of a question)? It should be possible to study these and other possible relations 
by the careful analysis of sound-film recordings. Unquestionably gestures are 
related to semantic events in the sound channel—in fact, this is probably the 
primary correlate. The meaning of negation is synchronously encoded in the 
vocal “No” and the shaking head; the meaning of agreement is synchronously 
encoded in the vocal “Yes” and the nodding head. The meaning of ‘being com- 
pletely at sea’ is often expressed by the shrugging of the shoulders while saying 
*How should I know?" or some related sequence. For the more motor expressive 
individual, at least, movements of hands, face and trunkkeep up a running com- 
mentary on his verbal output— a big boat” is accompanied by cupped, spreading 
hands, “I was shocked” is accompanied, perhaps, by retraction of the head and 
popping of the eyes. 

Similar synchronic redundancies can be observed between the manipulational- 
situational band and the vocal-auditory band. The very common use of ‘dood- 
ling’ and diagramming on a pad as a means of facilitating interpersonal com- 
munication about objects and events is an example. Another illustration, here 
of the intimate redundancy between auditory and orthographic inputs, is the 
following: Once while listening to some recordings of Gilbert and Sullivan, with 
the verbal libretto in hand, the writer noticed that by alternately reading the 
words in parallel with listening to them sung and then just listening, he could 
make the auditory material alternately seem perfectly clear and then perfectly 
ambiguous—without the printed guide the sounds were literally meaningless, 
but with the printed material before him, it seemed that the speech sounds 
suddenly became completely intelligible. This demonstration is rather striking 
when experienced, and it has additional implications for the close relation be- 
tween perception and meaning. Other examples of redundancy between situa- 
tional cues and verbal decoding are legend and often humorous—in a situation 
where a knife is needed and you are handing another person this implement, 
you may actually say, “Here, use this plate,” without his noticing the error at 
all; when entering an elevator in the morning and greeting someone with a tip 
of your hat, you may actually say something quite insulting without its usually 
being noticed. 

The psychological basis for complementation between bands seems to be 
quite simple and apparent. From the encoder’s point of view, both the vocal 
response of saying, “No, ...” for example, and the head-shaking gestural re- 
sponse are in a hierarchy associated with the same mediation process of intention, 
e.g., both reactions have been learned in similar situations and associated with 
the same significances. Since these reactions are not incompatible or competing, 
they will tend to be elicited synchronously by occurrence of the negation semantic 


90 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


state. Presumably the stronger the motivation operating, the greater will be the 
tendency to overflow into these parallel reaction pathways. From the decoder’s 
point of view, in his own development of decoding behavior he has been exposed 
to many people who use such gestures, and thus repeatedly the elicitation of the 
negation semantic process by the words ‘no’ and ‘not? and the like have been 
accompanied by the head-shaking visual pattern and thereby associated. Here 
again, the decoder has learned to interpret synchrony of correlated signs in 
several bands as increased intensity of motivation on the part of the encoder—if 
he says “no” and shakes not only his head but his whole body in Saying it, he 
must really mean it! This analysis, of course, does not explain the origin (in cul- 
ture or language community) of this parellelism. In general, then; complementa- 
tion between bands is based upon the association of reactions (encoder) and cues 
(decoder) in different systems or modalities with the same intentions or signifi- 
canoes. 


4.2.2. Synchronic conflict 


‘It is possible for the encoder to produce gestural signs incompatible with his 
vocal signs. These gestural signs may be in direct contrast, may be unrelated or 
irrelevant, or may be simply suppressed, and quite different effects upon decoders 
seem to be produced. 


both cases above the verbal materials were in conflict with the situational context (the 
flat tire, the obviously inadequate argument). 


verbal encoding. Thus, a person may grimace and repeatedly clench his hands while saying, 
“Oh, we had an interesting trip to New York . . . saw & new show and bought some clothes 
we really needed." To the decoder, this is evidence of conflict in the encoder, as if one set 
of meanings were directing one encoding system while another were directing the other. 
And this, of course, is one of the clues used by the psychiatrist in diagnosing dissociation. 
The other effect upon the decoder is probably to dilute or make ambiguous the significance 
of what is being said. 

(e) Suppression. The encoder may completely eliminate information via either the 

al-auditory or the gestural-visual band. I» the former case we say the person is being 
‘secretive,’ is ‘daydreaming,’ or ‘has something on his mind’—in other words, we interpret 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 91 


his gestural display as indicative of active mediational states and interpret his general 
mood therefrom. In the latter case, we speak of the person as being ‘dead-pan’ or ‘poker- 
faced,’ and in general take the lack of normal complementary gestural behavior as indicative 
of inhibition—which of course it is. The typical effect of suppression of information in 
any one of these bands is to make the decoder question the validity of information in the 
other bands. 


As we have seen above, the association of gestural and vocal signs with com- 
mon mediators in decoding and the association of these common mediators with 
equivalent gestural and vocal acts in encoding provides a psychological basis for 
synchronic complementation as the ‘normal’ situation in interpersonal communi- 
cation. Conflict between vocal and gestural bands, whether in the form of contrast 
irrelevance, or suppression, necessarily involves some degree of potential con- 
fusion on the part of the decoder. For normal communicators, therefore, produc- 
tion and interpretation of such effects as sarcasm and irony, deliberate irrelevance 
and band suppression implies a certain degree of intelligence—greater discrimina- 
tion among overt responses (encoder) or among mediatérs (decoder) is required. 
In this connection it is interesting that the only ‘coded’ type of dissociation 
between bands is that of direct contrast or opposition, as found in irony and 
sarcasm. It is as though only the complete ‘flip-flop’ from one motor reaction to 
its direct opposite in all-or-nothing fashion can be readily handled—note the 
parallel here with tendencies in languages to select binary oppositions in pho- 
nemic signals. The synchronic conflicts introduced by abnormal psychological 
disturbances may involve irrelevance and suppression (but probably not inten- 
tional contrast) and clearly indicate underlying conflict. 


4.2.3. Research Proposals 


The type pf research on synchronic interactions will depend upon whether 
encoding or decoding is being studied. (A) Encoding. Here one might study the 
relative difficulties of deliberately ‘acting out’ instructions which involve com- 
plementation, contrast, irrelevance and suppression—presumbably complementa- 
tion would be the easiest, ‘most normal’ task and intentional irrelevance the most 
difficult. Another research direction would be to experimentally produce states 
of motivation and emotion in which complementation, contrast, and so on are 
relevant, and study encoding with intelligence, for example, as a variable. (B) 
Decoding. Here one immediately thinks of sound-motion movie recording as the 
basic technique, with cutting, splicing, and elimination of bands as means of 
experimental manipulation. In producing the original materials, one could use 
either trained actors (in which case a specific series of ‘intentional’ states could 
be expressed and recorded, with or without situational context) or set up experi- 
mental situations with untrained and unknowing actors. The general procedure 
might be to present the recorded materials under various experimental conditions 
and record judgments from decoders as to their interpretations of encoder inten- 
tional states. One experimental treatment would be to successively eliminate 
bands of information—how does masking out the situational band (leaving 
gestural and vocal) affect the decoder? Éliminating the vocal band? The gestural 


Li 


| 
92 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS < 


band? Which band by itself carries the most information and what kind of infor- 
mation? Does one get evidence for complementation (i.e., enhancement of effects) 
or mere redundancy? Another experimental treatment, particularly with a series 
of particular emotional states acted out by trained actors, would be to deliber- 
ately change the normal between-band complementation. One could, for example, 
have the words originally accompanying a joyful gestural pattern occur with a 
graded series of other gestural patterns, including that for gloom; or one could 
vary the words accompanying a constant gestural pattern. In both of these 
cases, one would have to take care to use verbal materials whose automatic 
speaking gestures were sufficiently similar to each other. Judgment as to ‘sarcasm,’ 
‘mental disturbance,’ ‘secretiveness’ and the like could be secured from the 
decoding subjects. 


zi ia 


5. SEQUENTIAL PSYCHOLINGUISTICS 


Study of the'sequential or transitional structure of language behavior provides 
2 meeting ground for linguists, information theorists, and learning theorists. 
The linguist, applying his own methods of analysis, discovers hierarchies of 
more and more inclusive units; the information theorist, usually Starting with 
lower-level units such as letters or words, finds evidence for rather regular 
oscillations in transitional uncertainty in message sequences, the points of 
highest uncertainty often corresponding to unit boundaries as linguistically 
determined; and the learning theorist, working with notions like the habit- 
family hierarchy, finds it possible to make predictions about sequential psycho- 
linguistic phenomena that can be tested with information theory techniques. 
Here we come back once again to the problem of units in encoding and decoding, 
the general notion being that at any given level of selection by speaker or hearer 
both the transitional probabilities and the correlated ifidices of habit strength 
will be higher within units appropriate to that level than between such units. 
And we again find it, necessary to think in terms of interactions between hier- 
archical levels in the processess of encoding and decoding, a sort of ‘super-Markoy 
process' in which selection of higher-order, more inclusive units results in a re- 
loading of the transitional probabilities obtaining among lower-order units. 


5.1. Transitional Probability, Linguistic Structure, and Systems of 
Habit-family Hierarchies“ 

"This section offers a general picture of how our three approaches intersect and 
facilitate one another in understanding sequential mechanisms. It also provides, 
by way of coperete illustration, a discussion of hesitation phenomena in ordinary 
conversation and lecturing and some hypotheses about such phenomena which 
are capable of empirical testing. + 


5.1.1. Statistical Structure of Messages 


"Transitional structural analysis assumes units of a given order (phonemes, 
morphemes, words) and seeks to ascertain their transitional probabilities: 
“Given an occurrence of the unit z, what is the probability that y will be the 
next unit to follow?” “That z will be?" Ete. for first-order probability. Or: 
“Given the sequence of units zy, what is the probability that z will be the next?” 
“That w will be?” Etc. for second-order probability. Similarly for higher-order 
probabilities.*Or, stated in information-theory terms: “In the case of the occur- 
rence of a sequence zy, what is the ‘amount of information’ in the occurrence of y, 
given the previous occurrence of x?” (first order). “In the case of the occurrence 
of the sequence zyw, what is the ‘amount of information’ in the occurrence of w, 
given the previous occurrence of zy?" (second order). 

The units with which this type of analysis may operate are various. The most 
readily available units are letters of conventional orthography. This choice may 

47 Floyd G. Lounsbury. 


IE 


; 94 E PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


„be ‘purposeful, as when the investigator is concerned with the statistical proper- 
ties of telegraph messages (e.g., Shannon), or it may represent some linguistic 


m , snaiveté (e.g., Newman's Samoan Bible). Where interest is in speech behavior, 


` the units chosen should be phonemes rather than orthographie symbols. Mor- 


: ` phemes are also possible units for this type of analysis, as are words. Zipf did 


some simple counting with these orders of units, but he did not obtain transi- 
tional probabilities. To the best of our knowledge no one has yet carried out any 
systematie transitional analysis involving morphemes or words. 

"Transitional probabilities are determined generally from natural data, i.e., 


"from records of the normal flow of speech. When the units of analysis are words, 


however, recourse can be had to the experimental device of the word-association 
test and other short-cut procedures (see sections 5.3, 5.4, 5.5). The transitional 
probabilities determined in this manner appear to correspond well with those 
which might be determined from the analysis of a necessarily large amount of 
natural data, though there is not yet conclusive evidence for this. When the 
units of analysis are anything less than minimal free forms of a language, no 


< such experimental short-cut appears to be possible. 


- 6.1.2. Statistical vs. Linguistic Structure 


We must distinguish between ‘transitional structure’ and ‘linguistic structure.’ 
The former is a product of statistical analysis; the latter, of linguistic analysis. 
They reflect important differences in statistical and linguistic procedures. In 
the procedure of contemporary structural linguistic analysis, frequency of 
occurrence (of a given unit in a given context, or of a given contrast) is not a 
relevant criterion. Only the possibility of occurrence—as represented by some 
one instance or by many instances of it—is relevant. The answers which are 
sought from data are of a simple yes-or-no type rather than of*& how-much 
type. In statistical analysis on the other hand, frequencies are the immediate 
goal of analysis, e.g., the probability of occurrence. Statistical procedure usually 
ignores, however, a matter which is basic to linguistics—the distinguishing of 
levels of structure. Linguistic analysis is directed toward the discovery of these 
and their combinatory and hierarchical arrangements. The structure of particular 
utterances is stated in terms of these, and the structural pattern, or grammar, 
of a whole language consists of generalized summary statements of the same. 
Discovery of the hierarchical structure in a language is by means of ‘immediate- 
constituent’ analysis. The boundaries between constituents on the same level 
of structure are established. A sentence—any one on this page for example—is 
not to be broken down simply into all of the units of a given order, such as 
words or morphemes. Rather, the process of immediate-constituent analysis 
is carried out, proceeding from level to level of structure, so that constructions 
on one level are established as constituting the units of structure on the next 
higher level. The criterion for establishing the boundaries between two different 
units on the same level is generally that of maximum substitutibility of possible 
replacement parts (Wells, Nida, Harris), 

Statistical analysis i ignores the differing hierarchical values of these boundaries. 


D) 


se 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS: ; 95 - 


All boundaries, for purposes of statistical procedure, are taken as equivalent. VE 


Thus, in the sentence just preceding this one, the boundary between are and 


taken, that between taken and as, and that between as and equivalent are not.’ 


accorded the different statuses which linguistic analysis would ascribe to them." 
They are lumped together as cases of the same sort of thing. The differences ' 
between them which are initially ignored do, however, get reflected in a certain 
fashion in the statistical results. These different ‘transitions’ will often be found 


to have different probabilities of occurrence. The different transitional proba-  ''' 


bilities are in a way indexical of the different linguistic statuses of the boundaries 
between the words of each pair. The correspondence is only rough, however, and - 
many other factors besides the linguistic hierarchical statuses of the boundaries 
affect the transitional probabilities. The former cannot be derived from the 
latter, nor vice versa. 

‘Statistical structure,’ then, is to be understood as denoting the system of 
transitional-probability relationships between the unigs of a given order in a 
language. (Care should be taken not to confuse the terms ‘order’ and ‘level.’ 


Words, morphemes, and phonemes are different orders of units. Between units - 


of the same order strung along in sequence there are boundaries, or ‘transitions,’ 
belonging to different levels of construction.) ‘Linguistic structure,’ on the other 
hand, may be understood as the system of hierarchical combinatory possibilities 
between the units of a given order. 


5.1.3. Behavioral Levels in Encoding and Decoding 


There are behavioral data of various kinds which support the inference of at 
least three psychological levels of organization of linguistic responses (see section 
6.1). Osgood has distinguished a ‘representational level,’ an ‘integrational level,’ 
and a ‘skill level.’ The triggering of linguistic responses appears to be accomplished 
by a complex of internal stimuli deriving from each of these levels. (It should 
be noted that the use of the word ‘level’ in the present context is independent 
of its use in the different context of the preceding paragraphs.) Stimuli from the 
representational level derive from the meanings or significances of incoming 
stimuli and have been labeled with the roughly characteristic term, ‘intentions.’ 
(Meanings and significances of incoming stimuli, of course, derive not only 
from the external sources, but also from the internal emotive and evaluative 
systems which in turn derive from past experience and learning.) Intentions 
are probably more synthetic than their relatively analytic expression in speech. 
The process of selecting the larger semantic units of language for the expression 
of intentions has been called ‘semantic encoding.’ 

Much of the triggering of linguistic responses, however, is accomplished at a 
lower organizational level of greater automaticity and less conscious awareness. 
Ordering of semantic units, concrete-relational classification of these, concordal 
agreement and certain other relational phenomena appear to belong to the 
‘ntegrational level.’ This process has been called ‘grammatical encoding.’ The 
final triggering of the motor acts whigh produce the sounds of speech appears 
to be accomplished on the still lower level of motor skill organization. The 


96 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


sequenced triggering of the individual motor acts in speech is accomplished at 
a rate of speed which Lashley showed to be, like the individual motor acts in 
piano playing, too great for each such act to be under specific cortical control 
via feedback mechanisms. This process may be called ‘motor encoding.’ Speech 
pathology, particularly aphasia, shows examples of disturbances in each of the 
above described systems. 


5.1.4. Habit-family Hierarchies and Transitional Probabilities 


Whenever a variety of stimuli terminate in a common response, we have a 
convergent habit-family hierarchy; whenever a given stimulus is associated 
with a variety of responses, we have a divergent habit-family hierarchy. In 
other words, a habit-family hierarchy is a cluster of associations in which one 
of the members, S or R, is common. Associations (habits) vary in strength, 
and variations in habit strength are known to correlate with probability of oc- 
currence of responses (as well as with other indices, such as latency and ampli- 
tude). Habit strength, in turn, is known to depend upon variations in both the 
frequency and contiguity of S-R associations. Information theory measurements 
deal with the probability of occurrence of one event among the class of possible 
events of the same order. If we conceive of an antecedent message event (of any 
order or size of unit) as constituting or indexing a stimulus situation and the 
subsequent message event (of the same order or size of unit) as constituting a 
response, then the transitional probability measurements of information theory 
can be viewed as reflections of the systems of encoding or decoding habit 
strengths. 

Since the linguistic structure of the language and the ‘semantic structure’ 
of the culture is such that certain message events co-occur more often than others 
(frequency of S-R) and certain message events appear closer together in the 
temporal sequence than others (contiguity of S-R), it must follow that at each 
level of organization hierarchies of habits of varying strength will be developed, 
and these will correspond to sets of transitional probabilities. Assuming a constant ` 
and limited number of alternative events of a given order (phonemes, mor- 
phemes, words, constructions, ete.), transitions characterized by convergent 
hierarchies should correspond to points of relatively low transitional entropy 
or uncertainty (e.g., where a wide variety of stem morphemes converge upon & 
limited number of suffixes) and transitions characterized by divergent hierarchies 
should correspond to points of relatively high transitional entropy or uncer- 
tainty (e.g., initial phonemes of words following junctures). 


; Beyond these general determinants, the habit strengths of associations within hierarchies 
will vary with frequency and contiguity factors, and so therefore will vary the entropy 
characteristics of sequential sets of message events. If frequency and contiguity factors 
are such that all of the alternatives following a given event are of about equal habit 
strength, uncertainty will be maximal for that number of alternatives; if frequency and 
contiguity factors are such that one event is highly associated with another and other 
events only remotely, a relatively low degree of uncertainty will exist. A number of observa- 
tions of behavioral stereotypy, including the 1iasses of highly regular data about languages 
assembled by Zipi, lead one to the hypothesis that habit-family hierarchies tend toward 


d 


» d 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 97 


a structure such that habits strengths of the member associations decrease according to a 
logarithmic function of their rank in strength. Further discussion of transitional entropy 
measurements and entropy profiles will be found in section 5.3. 

As was poiated out above, it seems necessary to view language behavior as organized 
simultaneously on at least three levels, a semantic (representational) level, a grammatical 
(integrational) level, and a receptive-expressive (sensory-motor skill) level. Each of these 
levels is assumed to deal with units of decreasing size, hierarchically arranged such that 
the units at a higher level include units of the next lower level. We assume also that habit- 
family hierarchies of the sort we have been discussing operate at each of these levels. A 
given antecedent event at the semantic level (e.g., the meaning of a stimulus word in free 
association tests) will tend to elicit a hierarchy of subsequent semantic events (e.g., mean- 
ings of associates of variable strength), as indexed by the hierarchical frequencies of overt 
responses—arr&nged, interestingly enough, according to a Zipf-type function. Similarly, 
reception or production of an antecedent syntactical unit (e.g., a nominal phrase, such as 
the little red schoolhouse . . .) will set up readinesses, based on past redundancies, for a 
variety of subsequent syntactical units (such as . . . sat on a hill, or . . . I love is still there, 
or,..and barn were painted), and these alternative constructions constitute syntactical 
hierarchies which, although there is no evidence availabla probably have a Zipf-type 
distribution. Similar hierarchical arrangements have been demonstrated for phoneme 
sequences. 


Finally, mention should be made of the conditioning or restricting effect of 
context upon selection within hierarchies and hence upon transitional prob- 
abilities. Given only knowledge of the immediately antecedent event at any 
one of these levels, uncertainty as to the subsequent event is maximal (within 
limits imposed by the structure of the hierarchy). As we increase our knowledge 
by taking into account more and more of the sequence of antecedent events—as 
well as subsequent events, in the case of decoding—uncertainty as to the sub- 
sequent event decreases. Psychologically, this is due to stimulus patterning, 
e.g., the stimulus, including traces from past events, becomes more specific 
and hence fnore precisely associated with a given response than with the others. 
The association of a subject to the single stimulus word, BLUE, is less predictable 
than to the sequence, I’M ALL BLACK AND BLUE. The way in which events 
at superordinate levels reshuffle transitional probabilities at subordinate levels 
(see particularly section 5.3.) can also be understood in terms of the effects of 
contextual stimuli upon modulating the ‘average’ structure of hierarchies. Thus 
the cue effects of a given semantic decision persist through some period of time 
and serve to modify the actual eliciting stimulus pattern at each of a series of 
hierarchical choice points at some lower level of encoding or decoding. This 
conception helps explain an apparent paradox—the fact that a speaker’s se- 
quencing is,almost perfectly dependable, e.g., he ‘says what he meant to say 
and it always makes sense,’ despite the uncertainty present from the point of 
view of the observer with his entropy measurement. The point is that, from the 
speaker’s point of view, selection at each hierarchy is a simultaneous function of 
all of the preceding sequence and of regulating inputs from all levels of organiza- 
tion, whereas, from the entropy estimater’s point of view, selection at each 
hierarchy is being predicted from only first and second order segmental prob- 
abilities (usually) and takes only a eingle level of organization into account 
(usually). d 


98 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


5.1.5. Pausal, Juncture, and Hesitation Phenomena 


Encoding and decoding processes being as complex as they are, it is always 
difficult to discover easy checks on the type of model described above. The fact 
that habit strength is inversely correlated with the latency between S and R seems 
to offer one avenue of approach, however. At any level of the model just de- 
scribed, the stronger the transitional habits, and hence the lower the transitional 
entropy or uncertainty, the shorter should be the pausal durations separating 
sequential events. This means that within syllables, within familiar morphemes, 
and even within familiar words and phrases the durations of pauses (latencies) 
separating successive events should be minimal, if measureable at all. On the 
other hand, pauses should be somewhat longer, and hence measureable, at 
boundaries between units where transitional habits are weak, the number of 
alternatives large, and hence the transitional probabilities low. The boundaries 
of constructive units might be of this sort. As will be discussed below, what we are 
calling ‘hesitation phenomena’ seem to reflect transitions of low probability 
at the semantic level, and these do not seem to correspond in any simple fashion 
to standard linguistic boundaries. 

Hesitations which interrupt the continuous flow of speech are anything 
from very brief pauses to extended periods of halting, often filled with ‘hemming 
and hawing.’ The phenomena we are speaking of are not to be identified with 
linguistic ‘junctures.’ A variety of phonetic phenomena, including such things 
as brief pauses, ritardando effect, slight articulatory shifts, and even morpho- 
phonemic alternations have at one time or another, or by one writer or another, 
been, set up as ‘juncture phonemes.’ But we are not referring to these. Even if 
‘junctures’ sometimes consist of short pauses, the pauses under consideration 
here are not the same. For one thing, there is a difference in duration. Juncture 
pauses which we have seen in spectrographic analysis of speech were in the order 
of a hundredth of a second or less in length. The pauses referred to here, however, 
are appreciably longer. We are not sure of the lower limit in duration of these 
pauses, for measurements have not been made, but in general, certainly, they are 
longer. They often, of course, may last as much as several seconds. Another and 
more important difference is that they do not characteristically fall at the points 
in a sentence where junctures are presumed to fall. 


This last point may be made clearer by means of an illustration. Consider the speech of 
a man lecturing or speaking on a difficult and not too familiar subject and, as we 84y, 
‘thinking on his feet.’ There are pauses and perhaps quite a bit of hemming and hawing 
as he ‘organizes his thoughts’ or ‘gropes for the right expression.’ Compure his output 
under these conditions with his output if he is reading a prepared and rehearsed typescript 
on a familiar subject, or if he is delivering it after having committed it to memory. In 
the latter case the pauses which we note are those which fall at the boundaries of syntactic 
units, the so-called syntactic junctures. They may be fleetingly brief and few in number, or 
they may be exaggerated, longer, and more frequent for emphasis and stylistic effect, but 
in any case they are distributed systematically in some sort of conformity with the lin- 
guistic structure of the sentence as revealed by immediate-constituent analysis. This is 
not so in the first case where the man was thinking on his feet. To be sure, the syntactic 
junctures appear also here. But in addition there are frequent hesitation pauses. These 


» 


a 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 99 


would vary considerably in length, some would be dead-ends from which the speaker re- 
treats to start over, some might be filled with hem-and-haw to mark time, etc. But the 
significant thing about these is that the majority of them do not fall at syntactic juncture 
points. Instesd of occurring at the boundaries of major syntactic units, they typically fall 
at minor structural boundaries and within, rather than at the ends or beginnings, of larger 
syntactic constructions. Whereas juncture pauses are an aid to the hearer and help to put 
across the structure of a sentence, these hesitation pauses are often an annoyance to the 
hearer and interfere with rather than aid in grasping the sentence as a whole. Reading the 
material after sufficient rehearsal, or speaking it after memorization, would eliminate most 
of these hesitation pauses. Even the practised and fluent lecturer, however, apparently 
cannot entirely eliminate these in unrehearsed discourse. He may reduce them to such a 
point that neither he nor his hearers are aware of them, but a listener who is concentrating 
upon these rather than on the content of the lecture will find them very marked though 
brief. 

Hesitation pauses have figured very little in linguistic analysis. Probably one reason 
for this is the way in which the informant technique has been made use of in the past. 
Whether the informant be an American Indian speaking a strange language to an inquiring 
linguist, or whether he be a linguist speaking his own language to himself, the time-con- 
suming task of committing the observations to paper has necessitated a great many repeti- 
tions of stretches of speech a sentence or less in length. The repetitions demanded of the 
informant amount to rehearsal and result in his memorization of the phrase or sentence, and 
thus the hesitation pauses are weeded out. Only nowadays, with the advent of easy-to-use 
recording machines, are records of speech possible which preserve these little ‘defects’ 
for the investigator. One group of linguists has recently given particular attention to these 
pauses, preserving them in their transcriptions. But they have not been clearly enough 
distinguished from junctures. In some cases they have in fact been regarded as junctures. 


Hesitation pauses in speech need much more study. We have hunches as 
to some of the results which such a study might show. These may be put in the 
form of hypotheses to be tested. The hypotheses have to do with the suspected 
relationships between hesitation pauses, transitional structure, and units of 
encoding. More conjectural are some which have to do with linguistic structure 
and units of decoding. 

Hypothesis 1: Hesitation pauses correspond to the points of highest statistical 
uncertainty in the sequencing of units of any given order. (High statistical un- 
certainty = high transitional entropy.) The observations which lead us to 
formulate this hypothesis have been focused on the sequencing of words. We 
are relatively hopeful for the substantiation of the hypothesis when the units are 
of this order. Whether the same may hold true for some sort of hesitation or 
tempo phenomenon when the units are morphemes or phonemes, or perhaps 
some higher-order phrase units is a completely open question. 

e 

Testing of this hypothesis will require accumulation of two sorts of data: measurements 
of hesitation pauses, and transitional probabilities. It should be done for a single speaker, 
since the values of these would vary considerably with the speaker and his familiarity 
with various possible subjects of discourse. Our observations suggest to us that magnetic 
recordings of the elass performances of a good lecturer would make excellent material 
for the identification and measurement of hesitation pauses. The measurement of transi- 
tional probabilities, on the other hand, would be more laborious. There are two theoretically 
possible methods. The one might make use 6f a large amount of natural data, e.g., a semes- 
ter’s lectures in a particular course. The calculation of all transitional probabilities for 


100 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


every pertinent word in its various contexts, or for every pertinent context and the various 
words which may follow, would be an impossible task. A limited sampling could be done, 
however. À more practical short-cut in establishing transitional probabilities would be to 
administer word-association tests to the speaker of the recorded material. ùn interesting 
experiment could be worked out combining these two methods of getting at transitional 
probabilities. The ‘Cloze’ procedure being developed by Wilson Taylor should also be useful 
here, in this case given to the speaker himself. 


Hypothesis 2: Hesitation pauses and points of high statistical uncertainty cor- 
respond to the beginning of units of encoding. Evidence on this point will be of an 
indirect sort, since the encoding process is not open to observation. The psycho- 
logical theory would have a unit of encoding begin with semantic encoding in a 
higher mediational system and set off a train of more automatic responses in the 
lower dispositional and motor skill systems. Automaticity of response is a product 
of frequent repetitions. A response which originally is consciously directed is 
transformed with sufficiert repetition into an automatic unconscious response. 
(To understand the point, one need only think of a person learning to drive an 
automobile or to type or to execute immediately and ‘without thinking’ any 
prescribed response to a given stimulus.) If it should be shown that the stretch 
of speech from one hesitation pause to the next is a convergent one, i.e., one 
characterized by decreasing statistical uncertainty (increasing transitional 
probabilities), then we would have strong support for claiming this as a unit of 
encoding. 

Hypothesis 8: Hesitation pauses and points of high statistical uncertainty fre- 
quently do not fall at the points where immediate-constituent analysis would establish 
boundaries between higher-order linguistic units or where syntactic junctures or 
‘facultative pauses’ would occur. Evidence on this question would be relatively 
easy to assemble, given the recordings and analysis of data proposed under Hyp. 
: PES It would be necessary only to add linguistic analysis of the same ma- 

erial, 

Hypothesis 4: The units given by immediate-constituent analysis, and especially 
those bounded by facultative pause points, do correspond to units of decoding, how- 
ever. (These do not necessarily coincide with units of encoding: see Hyp. 5.) A 
definition of ‘unit of decoding’ would have to be given in terms of speech com- 
prehension. It is a common English-class dogma that carefully phrased speech, 
with pauses, etc. ‘for expression,’ is more comprehensible than either ‘slurred’ 
or ‘chopped-up’ speech. The phrasing pauses here referred to are characteristically 
inserted at points which immediate-constituent analysis establishes as the 
boundaries between larger units. Conceivably an experiment might'be designed 
to test the facilitation or hindrance of comprehension with different distributions 
of pauses in speech material. Among the experimental distributions would be 
included the two which we have discussed and which we suspect correspond to 
units of encoding and to units of decoding, respectively. 

Hypothesis 5: Units of encoding for easy oft-repeated combinations approach 
coincidence with those of decoding. In such material (e.g., the favorite oft-repeated 
assertions of a professor in his university lectures) hesitation pauses will tend 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 101 


to be eliminated. The frequent repetitions increase very highly the transitional 
probabilities between the units of which it is composed and reduce the statistical 
uncertainty at all points within it. The pauses which remain in the delivery of 
such materiak are those which fall at major syntactic juncture points and which 
may be magnified for stylistic effect or diminished for speed and economy of 
effort, depending on the content of utterance. A test of this hypothesis could 
be achieved fairly simply by choosing from a large collection of recordings a 
number of the most frequently repeated sentences, series of sentences, or parts 
of sentences, and examining these and comparing them with other less frequent 
sentences. 


'5.2. Certain. Characteristics of Phoneme Sequences 


A few applications of entropy measures have already been made on the level 
of phonemic analysis. The probabilities of English phonemes and of all possible 
sequences of two such phonemes have been estimated from a text of 20,000 
phonemes, and appropriate entropy measures have been computed.” Similar 
analyses will probably be carried out in the near future on other languages. 
Such studies would be of great value in describing and comparing the structures 
of various languages. However, since the factors governing the choice of phonemes 
extend over long sequences of phonemes, and even morphemes, these entropy 
measures can at best be regarded as averages over a large set of conditions and 
so only partial descriptions. Jakobson and his co-workers go at this descriptive 
problem in a different fashion. Here the phoneme is treated as a class of sounds 
defined by a set of distinctive features. This sort of analysis permits one to es- 
timate the degree to which all of the potential combinations of distinctive 
features are used. Both of these approaches are utilized in the following analysis. 

Whereas the descriptive linguist has usually limited his interest to those 
combinations which can occur in a language, it appears that analysis of relative 
frequencies of combinations may reveal data which can be more meaningfully 
interpreted and lead to more fruitful hypotheses. One such hypothesis is based 
on the assumption that a message will tend to be produced in such a way as to 
take into consideration the effort of both the speaker and the hearer (cf., Zipf). 
For example, in any cluster of consonant phonemes, the minimum effort for the 
encoder would be the one in which any two successive phonemes would be most 
similar; this, however, would cause a maximum effort on the part of the listener, 
who would be forced to make a series of very fine distinctions. For the decoder, 
the simplest situation is the one in which two succeeding phonemes differ as 
much as possible, thus making the distinction easy to make. If speech does re- 


48 Sol Saporta. 

4° This tabulation was carried out under the direction of John B. Carroll at the Summer 
Seminar on Psychology and Linguistics held at Cornell University in 1951. Only a privately 
distributed mimeographed summary of the results is available so far. 

% Jakobson, Fant, and Halle, Preliminaries to speech analysis (Cambridge, 1952). Cherry, 
Halle, and Jakobson, Toward a logical description of languages in their phonemic aspect, 
Language 29. 34-46 (1953). L] 


102 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


flect both factors, then we would expect low frequencies of both extremely similar 
clusters and extremely different clusters, i.e., a normal distribution curve, where 
frequency of a cluster is a function of the difference between the two phonemes 
in the cluster. 3 

Roman Jakobson’s distinctive feature analysis offers a meaningful measure 
of differences between phonemes. If we compare the English phonemes /p/, 
/t/, and /0/, we can establish that /t/ and /@/ have the same distinctive fea- 
tures, except for their contrast as to continuant/interrupted. This — vs. + 
contrast is here counted as being a difference of 2 units. On the other hand, while 
/t/ and /p/ contrast as to grave/acute, which is two units of difference, they 
also differ in that the feature of strident/mellow is irrelevant im /p/ but is — 
in /t/. This kind of a difference is here counted as one unit of difference, so that 
/p/ and /t/ differ by a total of 3 units. In this way, the units of difference be- 
tween any given phoneme and all other phonemes may be established. The 
series of 20,000 phonemes analysed by Carroll, showing the frequency with which 
each phoneme is followed by every other phoneme, provides data for a test of 
our hypothesis. We would predict that lowest frequencies of clustering would be 
between phonemes maximally similar or maximally different. The results for 
845 consonant clusters are as follows: 


Difference between phonemes Average frequency of 
in number of units clustering 
0.0 (by definition) 
0.0 
0.2 
1.2 
5.1 
5.4 
0.4 T 
1.9 
0.7 
0.2 
0.0 


Doonan YNY Oo 


m 


E 


It is apparent, then, that clustering does tend to follow a normal curve, ex- 
cept for the disproportionately low occurrence of clusters differing by 6 units. 
This is based on an analysis whereby /č/ and /J/ are considered unit phonemes. 
If one accepts a phonemic analysis whereby these affricates are considered to be 
clusters of /t$/ and /dZ/ respectively, each occurrence of /č/ and /j/ becomes 
a cluster. The resulting analysis into distinctive features indicates that these 
clusters differ by a total of 6 units difference. The average frequency of clusters 
differing by 6 units would then be 3.6 ,which is quite in keeping with the normal 
curve. It seems justified, then, to assume that at least in consonant clusters, 
maximum efforts for either encoder or decoder are avoided in favor of those 
situations where the effort is more or less equally divided. If our hypothesis is 
correct, it should apply to all languages. Phonemic transcriptions for diverse 
languages must be analyzed in terms of transitional frequencies, as well as in 


e 


Bir 


——— oO -—LPy——— CR 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 108 


terms of distinctive features, to determine whether or not this is a general prin- 
ciple. 

The above is merely one particular analysis. The investigation of factors 
which may Getermine transitional frequencies, however, can be extended to 
cover all types of data. A further examination of the same material used above 
indicates other possible fields for investigation. For example, a significantly 
higher percentage of voiceless stops is found to occur before juncture than the 
corresponding voiced stop. The fact that this is due to a well-known historical 
change in Germanic is of no relevance here. Precisely the same psychological 
factor which might tend to cause this on the synchronic level would also operate 
in affecting change. Exactly what this factor is, of course, is difficult to determine. 
One possible explanation is the seemingly reasonable assumption that less in- 
formation need be given in final position of word units. This would then assume 
that voiceless gives less information than the corresponding voiced. It has been 
suggested by Zipf that the voiceless is easier than the corresponding voiced, so 
that if the information value is not a factor, the system might tend to choose the 
unit requiring the least effort. This is, of course, merely a hypothesis which 
would have to be tested in various languages. 

Leopold has suggested that in child language there seems to be a tendency 
for a stop to be replaced by the corresponding fricative in word final position 
but not in word initial position. This immediately suggests that a correlation 
might be found in adult speech. The available corpus of transitional frequencies 
for English does indicate a tendency for this to be true, but it is not significant 
with this amount of data. In any case, it seems that this method may have many 
fruitful applications. The immediate need is for similar data in diverse languages, 
so that general principles, if any, may be disclosed. 


5.3. Applications of Entropy Measures to Problems of Sequential Structure"! 


Because of the frequently observed effects of antecedent events on the choice 
of subsequent events in language, the Markov Process has been regarded as an 
ideal conceptual tool in the study of linguistic structure. The Whorf and Harris 
models of the English monosyllable could be readily used in such an analysis 
and lack only the conditional probabilities of passing from one state to another 
to be complete Markov processes. Likewise, knowledge of syntactical structure 
can guide us in applying entropy measures to morphemes or words and setting 
up appropriate Markov processes. However, it seems that existing knowledge 
cannot do more than provide us with guides for describing such relatively simple 
phenomena, “leaving the more complex and less well understood aspects of 
linguistic structure untouched. 

The most obvious way to approach these more complex aspects would be 
to apply entropy measures to extended sequences of phonemes taken from a 
large sample of texts. By increasing the length, r, of the sequences of phonemes 
in A, the class of all possible sequences of antecedent phonemes, we should be 


51 Kellogg Wilson and John B. Carroll. ° 


104 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


able to find a minimum sequence length, say n, for which H,(S) ceases to de- 
crease significantly, S being the class of subsequent events. The set of joint and 
conditional probabilities obtained for all sequences of length n or less should 
enable us to set up a Markov Process which completely represents linguistic 
Structure within the limits of sampling error. While such a procedure is feasible 
in theory, it is hardly practical because of the enormous effort needed to sample 
and tabulate the very large number of sequences in 4.5? Moreover, the results 
of such an analysis would be of such a bewildering complexity that they would 
be practically unusable. 


5.3.1. Higher-order Markov Processes è 


A more practical conceptual scheme can be devised using the concept of a 
higher-order Markov Process—a Markov Process such that each of its states 
is itself a Markov Process. Such a scheme can allow incorporation of the existing 
units of linguistic analysis. For example, the states of a higher-order Markov 
Process could be morpheme classes, each of which is represented by a Markov 
Process whose states are phonemes. Such a representation has additional advan- 
tages in clarifying the entropy analysis of phonemes. It is easily demonstrated 
that the probability of a phoneme may be expressed as a sum, over all the mor- 
pheme classes, of the probability of a morpheme class times the probability of 
the phoneme in that morpheme class In other words, the probability of a 
phoneme is a weighted average of its probability within each of the morpheme 
classes. Thus, a highly probable phoneme could be highly probable in just a 
few morpheme classes or moderately probable in nearly all morpheme classes. 
In English, the phoneme /6/ is highly probable only in words including a definite 
article morpheme (e.g., ‘the’, ‘these,’ those") while the high probability of the 
vowel phonemes is most likely due to their moderate probability in a large 
number of morpheme classes. A simple count of phonemes over a large sample 
of texts would be incapable of indicating these phonemena whereas the analysis 
suggested above should. Thus, the more complex analysis proposed here is 
potentially more capable of indicating the details of linguistic structure. 


Eventually, it will probably be necessary to establish a hierarchy of Markov Processes 
where each level contains some of the processes of the next lower level as states. For the 
time being, however, it would probably be best to confine the setting up of such a hierarchy 
to linguistie units which are relatively well understood, such as the phoneme and some 
classes of morphemes. It should be realized that the choice of levels in the hierarchy of 
Markov Processes is largely a matter of convenience in conceptualizing linguistic structure 


*^ This number is relatively small for very small values of r, say 1 or 2, but increases 
very rapidly as r increases. For example, if our phonemic transcription used 34 phonemes, 
the number of sequences in A for r = 4 would be 344 = 1,367,500 (approx.). This state of 
affairs would be much worse if a full phonetic transcription were used. A 

* Using mathematical symbolism, if p(a) is the probability of phoneme a, p(B) the 
probability of a morpheme class B which is any of a set of morpheme classes, and ps(a) 
the probability of a in B, it is easily shown that 


p(a) = Zp) Pa(a). 


E 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 105 


and that there can be no serious objections to using any particular set of eategories so long 
as the categories on each level proyide a probability space with mutually exclusive divisions. 
Tn fact, the use of categories established by various schools of linguists may well indicate a 
hitherto unrealized agreement in the nature of linguistic structure. 


519:23 Entropy’ Analysis of a Small Scale Artificial Language** 


The analysis of a small scale artificial language given here is meant as a demon- 
stration of a potentially valuable technique. The demonstration hinges on the 
fact that, given the rules of its construction and the number of its interchangeable 
states, the total number of messages that can be transmitted is known and 
finite. This is, of course, not true of natural languages. However, artificial 
languages can be so designed as to incorporate any particular aspects of language 
in which the investigator is interested without contamination with the many 
complexities of natural languages. Moreover, such languages permit; the study 
of phenomena which may not be found in any natural language. While it is cor- 
rect to point out that we can ‘only get out what we put in’ such an artificial 
language, this technique allows us to explore the implicktions of certain aspects 
of linguistic structure which we may not have been aware of previously. 


Structure of small scale language. 
(1) 
Phonology: 


Vowels: a, i 
Consonants: b, d 


(2) 


Morpheme classes and exhaustive lists thereof: 


Nouns: bab ‘man’ 
(N) bad ‘oman’ 


bib ‘boy’ 
bid ‘girl’ [Each of these may be used as either subject (Ns) or object (No)] 
dab ‘dog’ 
dad ‘cat’ ‘ 
dib ‘wolf? 
did ‘bird? 
Verbs: Transitive (V4) Intransitive (Vi) 
aba ‘see’ iba ‘go’ 
abi ‘kill’ ibi ‘come’ 
ada ‘find’ ida ‘walk’ 
adi ‘Igse’ idi ‘fly’ 


Truth-value markers: (T) 
ba ‘yes’ (approximate translation) 
bi ‘no, not’ (approximate translation) 


5! This analysis has been described elsewhere by John B. Carroll and is included here 
because of the remarkably clear way in which it illustrates many of the problems with 
which this section is concerned. 'e 


106 ^ PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


(3) 
Syntax: 
The following is an exhaustive classification of the sentences of this language: 


Statements: No. of possible sentences 
N.ViT (with intrans. vb.) 8x4xX2 = 64 
N,ViNcT (with trans. vb. and obj.) 8xX4X8X2= 512 


Questions (parallel to above) 
ViN.T 4X8X2 = 6 
V.N.NoT 4X8xX8X2= 512 


Total No. of possible sentences: 1152 


(4) Juncture, stress, intonation, ete.: there is no stress or intonation in this language, at 
least not phonemically. There are no junctures between ‘words,’ so that all sentences are 
simply strings of phonemes. 


Sample sentence: dadabididbi. ‘Cat kill bird not.” 


(6) Non-linguistic considerations. We will assume that all of the 1152 sentences in this 
language are equiprobable, and that there are no dependencies between sentences in a 
string. 


Application of entropy measurement 


(1) Entropy of a single sentence: Hs = log 2 1152 = 10.17. 

(2) Entropy reduction of each phoneme, considered with respect to its position in the 
sentence is: Hp = Hs — Hp, where Hr is the entropy of the statements which are still 
possible after the transmission of phoneme P. 

To illustrate: Consider the successive phonemes of the sentence abibibbabbi ‘see boy man 

not’ or ‘does not the boy see the man?’ (Free translation.) 
No. Possible Sentences 


Phoneme Remarks Remaining H Hp 
—  Anysentence possible before transmission begins 1152 10.17 an 
a Must be question with trans. vb., of which the no. - 
of possibilities is 512 9. 1.17 
* b Verb either see or kill 256 8. 1.0 
a . Verb must be see 128 T7. 1.0 
b | N,must be either man, woman, boy or girl 64 6. 1.0 
i . Must be either boy or girl 32 5. 1.0 
b Must be boy 16 4. 1.0 
b Nais man, woman, boy, or girl US 3. 1.0 
& Man or woman 4 2. 1.0 
b Man; sentence either pos. or neg. 2 al. 1.0 
b Redundant; gives no information 2 1. 0.0 
i Sentence is negative 1 0.0 1.0 


| 


10.17 
(3) Entropy reduction of each morpheme, also considered in relation to its position in 
the sentence, is Hy = Hs — Hr where Hs and Hz are as defined previously. The same 
sentence is used here as in the example above. 


D 


oum bataia —" 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 107 


Morpheme Remarks Nr H Hu 
— Any sentence possible initially 1152 10.17 -= 
aba Verb see; sentence is question with 128 7.00 3.17 

Ve 
bib This is noun boy 16 4.00 3.00 
bab Thìs is noun man 2 1.00 3.00 
bi Sentence is negative 1 0.00 1.00 
10.17 


Note that this analysis can be obtained from that in section 2 above by summing across 
the phonemes in each morpheme. 

Sometimes variations in structure allow for a greater entropy reduction for particular 
morphemes. For example, consider the sentence, ibiadbi—translated ‘Come woman not,’ 


Morpheme * Remarks Nr H Hy 
— Any sentence possible 1152 10.17 = 
ibi Question with V; come 16 4.00 6.17 
bad This is noun woman 2 1.00 3.00 
bi Sentence is negative 1 0.00 1.00 
2 ere 
10.17 


Note that the entropy reduction of the first phoneme here is 4.17. 


The entropy analysis of the language described above seems to justify the 
following conclusions: 

(1), The amount of entropy of any message in the language is constant re- 
gardless of what type of units are being analysed. However, the amount. of 
redundancy depends upon the characteristics of the symbols in which the message 
is coded. 

(2) The amount of entropy of a message with specified structural boundaries 
is a function of the ensemble of all possible messages within these boundaries. 
This ensemble is determinable from the grammar of the language and the inven- 
tory of its forfn classes. It would be possible to test the validity of this conclusion 
for English sentences of a limited structural type—say of the form Noun-Verb- 
Noun. This also could be done for similarly limited classes of words, but in both 
cases we would have to account for the differential probabilities of the units 
within the language. ^ * 

(3) The entropy of a given symbol at a given point in a message is a function 
of the extent to which its transmission narrows the range of possible messages. 
The average amount of entropy per symbol is an average of such measures. 


5.3.3. Entropy Profiles 


The analysis of the amount of entropy reduction for every unit in the model 
language above seems to be closely related to the entropy profile analysis to be 
described here. However, there are two important differences which should be 
noted: ! 

(a) An entropy reduction analysis presupposes that the number of possible 
messages is finite and that the probabilities of each of the messages is known. An 


entropy profile analysis involves no assumption concerning the number of pos- 
e 


108 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


sible messages or their probabilities, but requires only that the various component 
units which ean occur in the environment of the message and their probabilities 
be known. Thus, it appears that entropy reduction analysis could be applied 
only to limited classes of natural language messages since the number of messages 
in nearly all languages is indefinitely large. 9 

{= (b) An entropy reduction analysis of the type above presupposes that the 
structural units of the language are known. Such an analysis indicates the con- 
tribution of these units to entropy reduction. An entropy profile analysis involves 
no such assumption about the higher order units of a language, but serves in the 
selection of the most appropriate higher order units. 


Let 1, 2,3, 4 . . . n represent a set of sequentially ordered phonemes in a text of n units 
length. Let z and y be any pair of antecedent and subsequent phonemes in this sequence. 
Let A be the class of antecedent phonemes which may be selected before any y and let S 
be the class of subsequent phonemes which may be selected after any x. Let a be any member 
of A and s be any member of S. Finally, let p. (S) and p, (A) represent the conditional proba- 
bilities of the s's and a's for particular z's and y's. 

Tt is possible to measure the entropy of class S after any z and the entropy of the class 
A before any y by means of the equations H,(S) = -2 px(s) log: px(s) and 
H,(A) = -2 px(a) log: p,(a).55 The total amount of entropy between any z and y, 
E(x, y), is given by E(x, y) = H«(S) + Hy(A). 

Let us examine the behavior of E(x, y) in four extreme cases. 


(1) Only one phoneme follows z and only one phoneme precedes y. In other words z 
and y always occur together. 


x y 


Obviously, H.(S) = H,(A) = 0, so E(x, y) = 0. 
(2) (a) Only one phoneme, y, follows z but k different phonemes can precede y equi- 


probably. 
um E 
E y 
Obviously, H.(S) = 0 and H,(A) = log: k, and E(x, y) = log; k. 
(b) U different phonemes follow z equiprobably but only one phoneme, x, can pre- 


cede y. 
E ow 
p PERE RTM A 
m m 


3 y It should be noted that H.(S) and H,(A) are measures of entropy in particular con- 
ditions and so correspond to the measure Hi(J) discussed in section 2.3. They are not 


measures of conditional entropy which average measures corresponding to H;(J) over & 
number of conditions. 2 


g» 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 109 


Obviously, Hz(S) = log: 1 and H,(A) = 0, so E(x, y) = log: 1. 
(3) | different phonemes can follow z equiprobably and k different phonemes can precede 
y equiprobably. 


& 


Obviously, H.(S) = log; 1 and H,(A) = log: k so that E(x, y) = log: 1 + log: k.55 


Once E(x, y) has been computed for all pairs (x, y) for the sequentially ordered 
phonemes 1, 2, 3...n, an entropy profile may be plotted from these values. 
We would expect this profile to be near zero for instances of high redundancy 
like case 1, to be moderately high for instances of partial redundancy like cases 
2 (a) and (b), and to be maximally high for instances of minimal redundancy 
like case 3. We may further distinguish between the two types of case 2 instances 
by plotting H,(S) on the same graph as E(x, y) since H,(S) will be low for in- 
stances like case*2 (a) and high for instances like 2 (b). The resulting profile will 
appear as in Fig. 11. The two data points between phonemes 1 and 2 represent, 
E(t, 2) and Hi(S); the two points between phonemes 2 and 3 represent E(2, 3) 
and H;(S), etc. The underlined numbers beneath the points on the graph indicate 
the class type most nearly represented by the entropy relations. 

So far, we have not mentioned the most tedious part of the computation of 
entropy profiles—the estimation of the p.(s)'s and the py(a)’s. This estimation 
can be carried out in at least two distinetly different ways which will naturally 
lead to somewhat different interpretations: 

(1) Estimation from sample texts. Such estimation would require a large sample 
of texts in a given language with a variety of semantic contents. Since all the 
phonemes of a language will be included in any sizeable sample, one such estima- 
tion should suffice for the computation of entropy profiles for any text in the 
sampled language. However, this conclusion would not necessarily apply to 
morphemes or any larger linguistic units. An entropy profile based on such an 
estimate should indicate only the effect of the formal structure of the language 
and should be relatively independent of the semantic content of the text. 

(2) Estimation from subjects’ anticipations. This technique would require a 
group of homogeneous subjects—all speakers of the language of the text—to 
anticipate the phonemes of the text. In obtaining the p.(s)'s, the text would 
be given in a forward direction and the subjects would be asked to anticipate 
what the next phoneme would be. In obtaining the py(a)’s, the text would be 
given in a reverse direction and the subjects would be asked to anticipate the 


56 Naturally, we can expect to find such extreme cases only rarely in actual texts. Never- 
theless, these cases are pure examples of the four general kinds of relationships we can 
expect to find among sequentially ordered message events and so we can expect that they 


will be approximated by empirical data. 
. 


110  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


E(x,y) »—* 


HS) -—-— 
Phonemes 1 ` 2 3 4 5 6 7 
Class types à l 2(a) 2(b) 2 


Figure 11 


preceding phoneme. In both cases, it would probably be necessary to repeat the 
portion of the message already given to control for differential memory effects. 
Also, sufficient instruction concerning the semantic content of the message 
should be given before the first anticipations are made to insure that the effect 
of semantic content is relatively constant throughout. This method is rather 
cumbersome if the units are phonemes, since it only could be used for short 
texts in one experimental session and because of the difficulty in recording the 
responses of linguistically naive subjects. Regardless of the units used, it would 
be necessary to make a new estimation for every next text analysed. Neverthe- 
less, this method of estimation should reflect both the effect of the structure of the 
language (assuming that the subjects respond in terms of this structure) and the 
semantic content of the message. While this sort of analysis is of little importance 
to linguistics, per se, it is of great potential value to the determination of the 
psycholinguistic units of decoding. 

Once entropy profiles have been computed for a variety of texts, it would 
be of interest to determine the degree to which the points of high entropy in 
the texts coincide with the morpheme boundaries. If such a correspondence is 
found, it would be possible to define morphemes objectively in terms of entropy 
relations and perhaps it would be possible to distinguish various types of mor- 
phemes in terms of these entropy relations. If such a correspondence does not 
occur, we can only hope that the points of high entropy have enough common 
characteristics to permit the identification of new linguistic units. The discussion 
of entropy profiles has been mainly concerned with the isolation of morphemic 
or morpheme-like units as states of some higher order Markov Process. Once 
these units have been determined, we could transcribe our texts in terms of 
these known units and repeat the analysis, with the aim of detormining yet 
higher order units. 


5.3.4. The ‘Cloze’ Procedure 


While not strictly an application of entropy measurements, a new method 
of measuring ‘comprehensibility’ of relatively large scale texts being developed 


* Cf. W. Taylor, Cloze procedure: A new tool for measuring readability, Journalism 
Quarterly 30. 415-33 (1953). 


| 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 111 


by Wilson Taylor is certainly relevant here and could be translated into informa- 
tion theory statistics. The underlying logic of the method is as follows: In the 
process of encoding, transitional dependencies among semantic events, among 
grammatical;and syntactical regularities, and also (although less importantly 
here) within skill sequences are simultaneously contributing to a rather precise 
selection among hierarchies of alternatives at each choice point. If the encoder 
producing a message and the decoder receiving it happen to have highly similar 
semantic and grammatical habit systems, the decoder ought to be able to predict 
or anticipate what the encoder will produce at each moment with considerable 
accuracy. In other words, if both members of the communication act share 
common assoeiations and common constructive tendencies, they should be able 
to anticipate each other’s verbalization. 

The term ‘cloze’ is derived from the gestalt notion of closure, e.g., the tendency 
to fill in a missing gap in a well-structured whole. Given the sequence ‘Chickens 
cackle and————quack,’ almost anyone would immediately supply the missing 
‘ducks.’ Similarly, given ‘The old man————along the dusty road,’ almost 
everyone will supply some verb form (grammatical disposition) and most will 
be affected by the ‘old’ element semantically and choose an appropriate verb, 
such as ‘hobbled,’ ‘crept,’ or ‘limped.’ As the actual procedure has been worked 
out, the experimenter deletes every n'^ word in a text (it has been shown that 
thi$ automatic procedure works as well or better than either deleting specific 
categories of words or words at random, providing one is using a text of suf- 
ficient size), leaving equal sized blanks in their places, and decoding subjects 
read through the passage filling in the missing words. The more closely the 
totality of sequential cues in the passage elicits at each test point the same word 
selection as the original author’s, the higher will be the decoder’s ‘cloze’ score 
(only absolytely correct fillÉins are counted, judging synonyms proving too 
difficult and not materially affecting results). 

Taylor has demonstrated the feasibility of this technique as an index of ‘read- 
ability’—in fact, it behaves much more satisfactorily than either the Flesch or 
Dale-Chall formulas. Not only does it order the same materials used as demon- 
strations by the authors of these standard formulas in the same way, but on 
some special test materials it alone yields sensible results. For example, both 
Flesch and Dale-Chall indicate a passage from Gertrude Stein as being very 
‘easy’! Taylor’s ‘Cloze’ score shows Stein, more appropriately, as very difficult. 
Tn other words, this method takes into account the highly unpredictable semantic 
and grammatical sequencing characteristic of Stein. Taylor has also tested the 
assumption that his method is essentially a measure of degree of ‘comprehension.’ 
In a very carefully designed experiment using Air Force training materials for, 
which comprehension tests were already available, he showed that ‘Cloze’ 
scores correlated very highly with initial comprehension scores (pre-message) 
and also predicted terminal comprehension (post-message). Ma 

There are many possible applications of this technique to psycholinguistic 
problems. For example, it is possible tg construct alternatively coded messages 
on the same topic and use the ‘Cloze’ method to determine which form produces 


112 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


the most information transfer (cf., section 7.3 for discussion of an entropy meas- 
ure of information transfer which could be combined with the Taylor procedure). 
Along similar lines, one may construct messages which vary in the transitional 
dependency of either their semantic or grammatical sequencing, or both, and 
use the ‘Cloze’ procedure to measure the effects produced on decoders (cf., 
section 5.4 for discussion of a method for constructing such messages). Using 
the same message and deleting every, say, fifth word, but using five equated 
groups to cover the entire message (e.g., group I having words 1, 6, 11, etc. 
deleted, group II having words 2; 7, 12, etc. deleted, and so forth) it should be 
possible to use this method to construct an entropy profile at the word unit level. 
The significant advantage of Taylor’s ‘Cloze’ procedure is that it taps simul- 
taneously all of the complex determinants affecting word choice, both at various 
levels of organization and through long stretches of sequencing; it is applicable 
to comparing encoders (e.g., readability), messages (comprehensibility), and 
decoders (e.g., individual, differences in reading skills, second language mastery, 
information about topic, etc.). 


5.4. Transitional Organization: Association Techniques®* 


In any empirical analysis of verbal behavior as it occurs ‘naturally’ (in a 
conversation, an interview, a letter, a book, an oration or what have you) the 
investigator is likely to feel overwhelmed with problems of multiple causation 
affecting the production of utterances. In an effort to simplify the analysis, 
one might (following Skinner) divide the ‘causes’ into four major groups: (1) 
States of the speaker. Here one might study such variables as drives or needs, 
attitudes, beliefs, fatigue, etc. (2) Audience variables. The language or sub- 
language spoken or understood by the audience, the stimuli from the audience 
indicating approval or disapproval, the ease with which the audience can hear 
the speaker, etc., are important considerations within this category. (3) Verbal 
and non-verbal referential stimuli. Under this heading one might investigate the 
effects of presence or absence of things being talked about, past experiences in 
the presence of given stimuli, discriminative reinforcement histories, etc. (4) 
Intraverbal connections. In this category one might study the tendencies of a 
speaker’s responses to influence his future responses; i.e., the tendency of the 
choice of one word to lead to the choice of a related word later, the choice of one 
form of utterance to lead to the choice of a particular subsequent one, etc. Since 
we are concerned here with transitional organization of language behavior, the 
fourth category is the one to which we may turn our attention. 

The general assumption being made here is that emission of any antecedent 
response increases the probability of occurrence of a hierarchy of interrelated 
subsequent responses. It is also assumed, of course, that these intraverbal con- 
nections arise in the same manner in which any skill sequence arises, through 
repetition, contiguity, differential reinforcement. It should be recognized that 
this analysis does not lead immediately to a tool for breaking down contextual 
effects, Any utterance (especially a single word) may be thought of as belonging 


58 James J. Jenkins. 


-—————— 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 113 


to a large number of response hierarchies, sound classes, form classes, sequence 
classes, frequency classes, ete. The analysis does suggest, however, experimental 
techniques for dealing with fragments of context in simple situations in which 
their specific influences may be more precisely studied. 


5.4.1. The Word Association Technique 


A first approach to the examination of interrelationships between word units 
‘may be found in the classic word association test. In this kind of test the subject 
is instructed to respond to a stimulus word with the first word (other than the 
stimulus word) that occurs to him. Substantial amounts of data have been 
collected on the responses of groups of people to small sets of stimulus words. 
For stimulus words occurring with high frequency in a given culture hierarchies 
of response words have been observed. For a given stimulus word a large number 
of subjects (sometimes as high as 80 per cent) may give the same response word; 
a much smaller number of subjects give a second response word; a slightly 
smaller number gives a third word, and so on down to responses which are made 
only by individual subjects. Thus, for a stimulus word the probabilities of given 
responses may be specified for cultural groups. While there is some evidence 
that the response hierarchies obtained from a group of subjects are related to 
individual hierarchies of response, this has not been clearly established. 

It is possible, then, with this technique to ascertain the transitional probability 
between stimulus words and response words for a given group under these 
restricted conditions. This amounts to specifying a divergent hierarchy of re- 
sponses to each given stimulus word. 


In addition, it has recently been shown that these probabilities are directly related tO 
the transitional probabilities between the same words when they are both produced by the 
subject, himself in a restricted recall situation. If S-R words from the association test are 
scrambled in a list and read to subjects who are asked to recall them, it can be observed 
that in recalling the list, the subjects tend markedly to recall the words of the pairs together. 
It appears that recalling one word of a pair acts as a stimulus for the recall of the second 
word of the pair. As the strength of the word pairs on the association test norms is increased, 
the amount of pairing in recall increases. To a considerable extent the order (apparently 
freely determined by the subject) is predictable from a knowledge of the cultural S-R 
pairs. Our information is thus extended from a knowledge of responses made to outside 
stimuli to a prediction of responses made to previous responses. This is an important step, 
since it suggests that we may use the word association test data in constructing experi- 
ments to examine the effect of high and low transitional probabilities on the performance 
of a variety of verbal tasks. ; j 

Past experiments in free association have also demonstrated the importance of instruc- 
tions given the subject in the determination of the response words made to the stimuli. 
If, for example, the suggestion is made that opposites may be given or even more directly 
the subjects are requested to respond with opposites, the variety of response words de- 
creases markedly and the frequencies of a few responses rise correspondingly. In this 
situation also the responses are in general more rapid. It is as if a major portion of the 
response hierarchy were removed and only the specific subportion designated by dual class 
membership (related to stimulus word and opposition) were available. The existence of 
this phenomenon illustrates the possibility ofedetermining transitional probabilities under 
special limiting conditions. 


. 
. 


114 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


It has also been shown that speed of response to a stimulus word in free association is 
an index to the rarity of the response word (although it may also indicate emotional in- 
volvement or competition of response words). In like manner, speed of response is also a 
funetion of the familiarity or rarity of the stimulus word. This is additional evidence 
supporting the notion that the free association test measures transitior'al probabilities 
in a manner which should be useful in experimentation which moves closer to ‘real life’ 
situations and the problems of context. 


5.4.2. Word Association in the Study of Language Structure 


The above characteristics of the word association test suggested a major 
experiment designed to evaluate the effect of varied transitional probabilities 
measured in this manner. In brief, the experiment would consist of three stages: 
(1) building up networks of high and low transitional probabilities by word 
association techniques, (2) using these networks to construct stories or essays 
of very high and very low average transitions, and (3) testing these stories against 
each other for differences in comprehension, reading or speaking ease, ability 
to withstand mutilation (*Cloze' procedure), ete. This would constitute a full- 
scale test of the efficacy of this approach to transitional probabilities. 


Stage one could be accomplished by capitalizing on the control of set and the measure- 
ment of association strength which have been pointed out above. A group of subjects could 
be asked to respond with the first verb they think of when a particular noun is given; the 
first noun they think of for a given adjective, the first adverb for a given verb, etc. The'most 
popular and most rapidly given words would be paired with the stimulus words to con- 
struct high probability sentences. The very infrequent and delayed responses would be 
used for the low probability sentences. While the stories or essays resulting from the ma- 
nipulation of these rather sizeable amounts of data might not be great literature, it seems 
likely that fairly parallel texts (in content) could be developed. In stage two these texts 
would be assembled and tested to exclude extraneous variables, such as differences in the 
basic frequency of occurrence of words in the culture, the sentence constructions, order of 
presentation of material, etc. In stage three the texts would be presented ‘to new groups 
of subjects in controlled reading situations. It would be predicted that the high transitional 
probability text would be read faster (both silently and aloud), would require fewer eye 
fixations, would be more completely understood (as determined by a comprehension test), 
would be more accurately recalled after a lapse of time, and would be more easily read 
after mutilation (i.e., when every fifth or tenth word is deleted). All of these predictions 
can be readily Gier 


It might be of further interest and lend verification to this study if another 
group of subjects were simply given the lists of words and asked to construct 
stories. It would be predicted that these subjects would use the combinations 
found to be of high transitional probability and avoid the low transitional 
probability combinations. This would constitute further evidence for the simi- 
larity of the stimulus-response and response-response conditions and would 
increase our information concerning these phenomena. If these predictions are 
borne out, the word association test would appear to be an instrument par 
excellence for the study and examination of transitional Probabilities as they 
effect context. 

Other suggestions appear relevant both for the understanding of context 
and for the understanding of the phenomenon of word association itself. The 

» 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 115 


linguist looks at free association data or serial associations and notes that stimuli 
and responses often fall in the same form classes. The data presently available 
on free associations are not sufficient to determine if this is the case, since for 
the most par} stimuli have been nouns and adjectives with a very few verbs. 
It is proposed that a large body of associative data be built up, systematically 
sampling grammatical classes, grammatical ‘tags’ and various lexical units. 
Pronouns, verbs, adverbs, prepositions, conjunctions, relative pronouns, etc. 
must be studied. Various changes in the stimuli (from singular to plural, present 
to past tense, etc.) need to be explored. This simple kind of experimentation 
may contribute markedly to our understanding of language habits which are 
essential to mature language behavior. Cross linguistic studies would be of 
interest and may further embody suggestions for second language learning. 

A straightforward linguistic analysis of free association also may contribute 
to a clarification of ‘normal’ response categories. In spite of the long history of 
use of free association tests, a satisfactory method of classification has not been 
found. The most common attempts have been an unsystematic mixture of 
semantic, psychological and linguistic criteria. The inadequacy of these measures 
is indicated by the following example: in one system the response ‘length’ to the 
stimulus ‘long’ i8 classified as an example of ‘compounding;’ however, the pair 
‘height-high’ is an example of ‘phonetic similarity’ and is the same as the response 
‘able’ to ‘table.’ Perhaps purely linguistic criteria may be found which will 
classify possible responses. While it is unlikely that any system will ‘explain’ 
all of the responses, a suggestion for classification is presented here. It is intended 
to apply it to the broad collection of associations proposed above. It seems prob- 
able that refinements may be included as the work progresses. 

Word associations may be interpreted as a result of relative distribution of 
the stimuli and responses. The similarity between any two words can be con- 
ceived linguistically as the degree of similarity in distribution. However, it 
seems apparent that this similarity may be profitably divided into two classes, 
paradigmatic and syntagmatic. Two words are considered paradigmatically 
similar to the extent that they are substitutable in the identical frame (this 
corresponds rather closely to Zellig Harris’ use of the term ‘selection’) and 
syntagmatic to the extent that they follow one another in utterances. 

For example, if we were to measure the paradigmatic similarity between ‘table’ and its 
most common response in word association tests, i.e., ‘chair,’ we would investigate to what 
extent they occurred in the same frame. ‘Table’ and ‘chair,’ as well as almost any other 
member of the noun class, such as man, woman, dog, cat, ete., occur, for example, in the 
frame, 'I saw & .' If we then consider the frame, ‘I bought a .,! we have 
eliminated ‘man’ and ‘woman,’ but our class still includes, ‘table,’ ‘chair,’ ‘cat,’ ‘dog,’ 
etc. At the other extreme is the frame ‘My favorite piece of furniture is 2 in which 
‘table’ and ‘chair’ occur, but not the others. Furthermore, ‘chair’ and not ‘table’ occurs in 
the frame ‘I like to sit in an easy .’ We would then hypothesize that one factor in 
any word association test would be the relative paradigmatic similarity of the hierarchy 
of responses, so that frequency of responses would be a function of paradigmatic similarity. 

Another factor in forming word associations is the relative frequency with which words 
follow one another in a sequence. The frame € saw a .’ may obviously be Diete 
by a larger number of possibilities than ‘I bought a ” or ‘I sat ona 3^ Conse- 


116 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


quently, the associative strength of ‘sat-chair’ will be greater than ‘saw-chair.’ We can 
then define syntagmatic similarity as the probability with which any one word will be 
followed immediately by the second. It seems reasonable to exclude from this analysis the 
grammatical morphemes, or function words, such as ‘a,’ ‘and,’ etc. 

As presented up to this point, paradigmatic similarity is restricted almost exclusively 
to words of the same form class. Syntagmatic similarity, however, can be extended to in- 
clude both words of the same as well as of different form classes. The example above is one 
of association between verb and noun. If we include the frame ‘I bought a table 
and .’ we ean establish a syntagmatic similarity between words of the same form 
class. 


If both paradigmatic and syntagmatic similarity may be factors in strength 
of association, it follows that the highest associative strength will be between 
words of the same form class, insofar as only these words can be similar both 
paradigmatically and syntagmatically. It is not surprising, then, to find that 
the most frequent types of responses among adults are ‘coordination’ (e.g., 
table-chair) and ‘contrast’ (e.g., black-white). Certain related hypotheses 
present themselves regarding what might be expected in word association tests. 
For example, if, as seems likely, the sequence ‘black and white’ is more frequent 
than ‘white and black’ in our culture, this difference should manifest itself in 
word association tests in that ‘black’ would tend to elicit ‘white’ significantly 
more than the reverse. In this light, the Kent-Rosanoff tests® were re-examined 
and yield the following cases (out of 1000 responses): table-chair, 844 vs. chair- 
table, 494; black-white, 706 vs. white-black, 605; hand-foot, 156 vs. foot-hand, 
198; long-short, 758 vs. short-long, 336. The latter is an example of a word, in 
this case ‘short,’ with two competing responses, namely, ‘tall’ and ‘long;’ ‘long,’ 
however, has just one main response. A series of words of this kind might be used 
in experiments to test the validity of this hypothesis. 

Tt seems likely that a few special categories may have to be invoked for a 
complete analysis of associations. For example, the ‘phonetic similarity’ class 
(e.g., table-able) may be indispensable, although it accounts for a very small per- 
centage of associations. Even here it seems likely that one might get a higher 
percentage of this type of response by selecting words which are either paradig- 
matically or syntagmatically similar as well as phonetically similar. 

A basie problem for this analysis, of course, is the determination of objective 
measures of paradigmatic and syntagmatic similarity. As a beginning, paradig- 
matic similarity might be defined simply as common form class membership, 
but a stronger measure of similarity might be developed through judges’ ratings 
or sentence completion techniques. Syntagmatic similarity is even more difficult 
to measure. Ideally, an extensive count of spoken and written English might 
provide it, but the task is too great for practicality. Again, perhaps judges’ 


ui or sentence completions will be required. Further work here is sorely 
needed. 


5° The terms are those used by Miller, Language and communication, based on work done 
by Woodrow and Lowell, Psychological monograph 22. 97 (1916). 
W, i Russell and J. J. Jenkins, Kent-Rosanoff norms for Minnesota college students 
‘in press). 


2 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 117 


5.4.3. Context and Association 


Two experiments which have brought the word association work closer to 
context problems are given here to illustrate the uses of the technique as an 
experimental tool. 

Howes and Osgood“ have made a careful study of word associations in the 
determination of responses to a complex of four stimulus words. Subjects were 
told that they would be given four words and that they were to respond to the 
last word by writing the first other word it made them think of. Control sets 
used nonsense words or numbers preceding the last word. The experimental sets 
were devised to study the influence of adjacent words on the responses to the 
last word. Variables studied were the distance (in time) of an experimental 
word from the last word, the density of the experimental words (whether they 
were all calculated to influence the stimulus word in the same way; whether two 
of them were; whether only one of them was) and the cultural frequency (by 
Thorndike-Lorge count) of the experimental words ustd. As an example, con- 
sider the stimulus word ‘man.’ Used alone it evokes ‘woman,’ ‘boy,’ ‘child,’ 
etc. If the word ‘yellow’ is inserted before it, does the response ‘Chinese,’ Japan- 
ese,’ Jap,’ etc. appear? If so, does this response decrease as the word ‘yellow’ is 
moved one or two words away from the stimulus word ‘man?’ If the word ‘alien’ 
or ‘eastern’ is added instead of the other neutral words in addition to ‘yellow’ 
does this increase the number of ‘Chinese’ responses? If in place of ‘yellow’ some 
rare synonym were used, would it achieve the same result? 

This experiment clearly demonstrated that all of these were significant vari- 
ables. The subjects did respond to the compound stimuli. The influence of an 
experimental word decreased as it moved away from the last word. Increasing 
density increased the number of influenced responses. Words of high cultural 
frequency exercised more influence than words of low cultural frequency. This 
experiment is an excellent example of the use of a simple tool to attack this 
complicated problem. Its implications are immediately obvious. 

A second experiment is one undertaken by MacCorquodale and others as 
part of the Minnesota Studies in Verbal Behavior. While the research is nof 
as yet complete, the influence of associative bonds in context appears to have 
been demonstrated. In an attempt to reveal ‘thematic strengthening,’ alternate 
sets of sentences were constructed to have the ‘same meaning.’ In one pair of 
such sentences a word was changed to a substitutable word, but one which it was 
felt strengthened a different response hierarchy. The sentences were left ineom- 
plete and given to different groups of subjects for completion. For example, 
one sentence (in its control form) read, ‘The children noticed that the snow was 
beginning to hide the ground as they got out of ——.’ In its experimental form 
this sentence read, ‘The children noticed that the snow was beginning to blanket 
the ground as they got out of . The difference between the sentences, then, 
lies in the response hierarchies evoked by the two words ‘hide’ and ‘blanket.’ 


*i D. H. Howes and C. E. Osgood, The effect of linguistic context on associative word 
probabilities, American Journal of Psychology (forthcoming). 
LÀ 


118 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


The sentence exercises only the control that the children must be getting out of 
something or somewhere. Any difference in the determination of what or where 
must be the result of the associations strengthened by the changed words. In this 
example, the control sentence elicited many references to ‘schaol,’ ‘the bus,’ 
‘the house,’ ete., and the experimental sentence in contrast elicited, as hypoth- 
esized, a large number of references to ‘bed’ which were almost totally lacking 
in the control group. Further experimentation with this technique should reveal 
in actual context the operation of the significant variables dealt with by Howes 
and Osgood. 

In summary it appears that many important questions regarding ‘language 
in action’ may be attacked with one of the oldest tools in the psychological 
repertoire, the free association test and its derivatives, and that many challeng- 
ing hypotheses are available for research. 


5.5 Channel Capacity in Semantic Decoding™ 


In Shannon’s development of information theory, channel capacity is defined 
as the maximum rate (expressed in bits per second) at which a communication 
channel can transmit messages with a minimal amount of error. When the rate 
at which messages are presented to a channel (i.e., the rate bf input) exceeds 
its capacity, the amount of random error in transmission increases with the 
amount of excessive information. Comparable phenomena seem to occür in 
ordinary language communication. When a radio announcer spins through a 
series of baseball scores—Yankees 2, Browns 4; Red Sox 12, Senators 5; Indians 
3, Tigers 7; White Sox 3, Athletics 2—what is a simple task for the encoder may 
be an impossible task for the decoder (who wants to know who played whom 
and with what result). If the decoder does hold onto one particular game and 
its result, he loses both what went before and what followed. In this case, the 
channel capacity of the decoder has been passed. 

Experimentally, it is necessary to deal with the human communicator as a 
single system intervening between manipulatable states of some physical input 
system and recordable states of some physical output system, e.g., as interven- 
ing between observable stimuli and responses. However, this total communicat- 
ing unit is comprised of many sub-systems whose limits, or capacities, may vary 
one from the other. One experimental problem is immediately apparent: in 
order to study the characteristics of any system in a communication chain, 
it is necessary to devise conditions under which the capacities of the other systems 

, are not limiting factors. 


5.5.1 Theoretical Analysis 


For present purposes we shall eliminate by our choice of stimulus materials 
and response categories grammatical aspects of decoding. As shown in Figure 
12, the semantic system is here conceived as a set of mediating processes 
Co . -Ia —— 83) which, as implicit reactions, are dependent to 
variable degrees upon states of the input system (S, .. . Sa). We shall assume 

*! Charles E. Osgood. 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 119 


INPUT SEMANTIC OUTPUT 
SYSTEM SYSTEM SYSTEM 


decoding encoding 
S1———————— n— s————————À5 Ri 


S ro sr R, 


S,———————— r s R 


Sx 


tn Ba———————— Ra 


Figure 12 


that, as with any system, mechanical or organic, the semantic system is limited 
in the number of different states which it can assume during any finite time, 
e.g., its rate of shifting from state to state is finite. The capacity of a system 
under optimal conditions will be called its maximum capacity. We shall also 
assume that the output system in human language behavior has a lower maxi- 
mum capacity than the semantic system, e.g., that sequences of ‘ideas’ can 
proceed at a faster rate than the sequences of vocalizations with which they are 
assoeiated. What we shall call the functional capacity of a system—that which it 
displays under a given set of conditions—is always equal to or less than its 
maximum capacity. 

What are some of the conditions affecting functional capacity? (1) The greater 
the conditional dependencies of states in a given system upon states in its antecedent 
system, the greater its functional capacity. Conditional dependency in this context 
is assumed to be equivalent to habit strength. Since latency of reaction is an 
inverse function of habit strength, the stronger the decoding habits the more 
rapidly each mediating reaction will follow presentation of the appropriate sign. 
One of Baseball’s Faithful, for whom the signs ‘Athletics’ and ‘Red Sox’ are 
strongly associated with differential mediators, would have less difficulty keeping 
up with the announcer's stream of scores than a casual follower of the sport. * 


Referring back to Figure 12, in decoding the conditional dependencies are those between 
members of the set S; and members of the set ra. In ordinary language communication, of 
course, these conditional dependencies are very high (e.g., heard or seen words as physical 
stimuli are strongly associated with particular significances and not with others) j under 
these conditions, therefore, functional capacities should tend to approach their maxima. 


The questibn of the effect of transitional dependencies within a given system, 
e.g., the tendency for certain states of the system to follow others with non- 
chance probabilities, raises a number of complicated problems. : Transitional 
dependency (or redundancy) is assumed to be equivalent to ‘associations’ when 
dealing with sequences in the semantic system (e.g., predictability of subsequent 
members of the set ra given knowledge of the occurrence of antecedent members 
of this set). Certainly, on a practical leyel, it seems obvious that the rate of de- 
coding will be faster when the sequence required of the semantic system is one 


120 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


for which it is already ‘set’ on the basis of past experience—the Lord's Prayer 
is presumably easier to decode than a series of baseball scores. 


In the first place, it should be noted that the example given in the preceding paragraph 
involves high conditional dependencies as well—the sequence ‘impressed’ on the test system 
by its input happens to correspond to already established transitional probabilities within 
the test system. If the conditional dependencies between input and semantic systems were 
near zero (for example, with a series of low association value nonsense syllables as stimuli), 
only random output could result, and the transitional organization of the semantic system 
would seem to be irrelevant. On the other hand, with high degrees of conditional dependency 
between input and semantic systems the amount of transitional dependency operative 
can be varied independently by manipulating the sequences of input signals (e.g., from a 
random sequence of signs like AGAINST WOULD IT THE FOLLOWED SET to one of 
high transitional predictability like A ROLLING STONE GATHERS NO MOSS). This 
was essentially the variable investigated by Miller and Selfridge in studying the case of 
learning verbal materials chosen with varying approximations to English syntactical 
structure. 


Taking into account conditional dependency, we may then state that (2) 
the more the sequences of states impressed on a system by the input correspond 
to existing transitional dependencies within the system itself, the greater will be its 
functional capacity. Assuming equally strong decoding habits, the rate at which A 
ROLLING STONE GATHERS NO MOSS can be handled by the semantic 
system would be faster than the rate of handling messages like STONE A NO 
MOSS ROLLING GATHERS. 


So far we have assumed that the number of units in messages, as determined objectively 
from the input or output, necessarily corresponds to the number of states assumed by the 
semantic system in any sequence, e.g., A ROLLING STONE GATHERS NO MOSS con- 
‘tains six units because there are six ‘words’ separated by white spaces. But let us suppose 
for the sake of argument that in English ROLLING is always followed by STONE and 
STONE is always preceded by ROLLING, i.e., maximum transitional dependency or 
redundancy—would ROLLING STONE represent one state or two sequential states in the 
semantic system? Conversely, does everything within the brackets defined by either white 
Spaces (orthography) or pauses of certain length (speech) constitute a single unit semanti- 
gally? Since GATHERS is divisible linguistically into GATHER and S in morphemic 
analysis, aren’t there at least two semantic units here? 

We do not as yet have any satisfactory ways for identifying semantic units and correlat- 
ing them with message units (cf., discussion of psycholinguistic units, section 3). It seems 
likely, however, that high orders of transitional dependency within systems will be equiva- 
lent to reduction in numbers of units or states. The 'short circuiting process' envisaged here 
is presumably more feasible in the semantic system than in the motor skill output system. 
If ri is highly predictive of ra, the required indexing responses can be initiated by sı rather 
than waiting for sa; but just because R, (e.g., saying A ROLLING STONE ...) is highly 
predietive of Ra (saying . . . MOSS) does not mean that the encoder will skip the interven- 
ing vocalization. Such a ‘short circuiting process’ may, of course, underlie the empirical 
ee with Zipf to the effect that frequently used forms tend to become reduced in 

length. 


So far nothing has been said about the number of alternative states among 
which a system must choose, the variable most often dealt with in information 
theory studies. The usual observation is that performance, as indexed by latency, 


> 


p 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 121 


errors, or some other measure, decreases as number of alternatives is increased. 
In such experiments, however, conditional dependencies have been low. In intelligi- 
bility studies, for example (cf., Miller's Language and Communication), it is 
necessary to work with a signal/noise ratio near discrimination threshold for 
number of alternatives to have its maximum effect. Obviously, if the signal 
were clear, it would make little difference how many alternatives were allowed. 
In ordinary communication, the number of alternative semantic states or mean- 
ings is extremely large, but we have no trouble in decoding as long as the pe- 
ripheral signals are clear. 


In effect, such studies have used the number of alternatives as a means of manipulating 
the conditional and transitional dependencies in their decoding and encoding. Such manipu- 
lation is feasible if conditional dependencies are low so that all of the possible states of & 
system follow a given state of the antecedent system with nearly equal probabilities. In 
such a situation, where habit strengths for all responsse are nearly equal, additional alterna- 
tives have the effect of increasing response randomness or entropy. However, if conditional 
dependencies are generally high so that given states of the antecedent system reliably lead 
to particular states of the subsequent system, then one habit strength is so much larger 
than the others that additional alternatives can have little effect on response entropy. 


Similarly, nurhber of alternatives should become a less important deter- 
miner of channel capacity as transitional dependencies within the system in- 
credse—if alternative b is highly dependent upon occurrence of alternative 
a and alternative d is highly dependent upon occurrence of alternative c, we 
have effectively reduced the alternatives from four to two. Assuming these 
arguments to be valid, (3) the greater the number of alternative states required of a 
system, the lower will be its functional capacity; the effectiveness of this factor varies 
inversely with both the conditional and transitional dependencies involved, having 
no effect when either is maximal. In other words, channel capacity becomes 
independent of number of alternatives when either conditional dependency 
(predictability of states of the subsequent system from those of the antecedent 
system) or transitional dependency (prediction of subsequent states of the test 
system from antecedent states of the same system) becomes maximal. 

A final general variable to be considered is the nature of the alternatives repré- 
senting various dimensions, It should be easier, for example, for a subject to 
choose among four objects differing only in color than to choose among four 
objects differing simultaneously in shape and color, e.g., among red circle, green 
circle, yellow circle and blue circle as compared with among red circle, green 
circle, red square, and green square. Generalizing, (4) if total number of alternatives 
is held constant, the slope of channel capacity as a function of number of alternatives 
should be steeper as the dimensionality of the alternatives is increased. All of the 
hypotheses described above are susceptible to experimental test, as well as a 
number of secondary hypotheses to be described in course. 


5.5.2. Experimental Requirements 
In most psychological experiments dealing with intact human subjects we 
i i imuli 7 ). Therefore we are 
manipulate input (stimuli) and observe output (responses). 


122  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


necessarily dealing with the complete decoding-encoding sequence and all of 
the systems intervening between S and R. In the information theory sense, 
we are necessarily treating the individual as a channel connecting input and 
output systems. If we are interested in the capacity of any particular system, 
it is necessary that the contribution of other systems be minimal and roughly 
constant. If we are studying decoding time, we want to be able to segregate 
encoding time. There seems to be no direct way to index decoding time as a 
separable portion of total time (e.g., time between presentation of stimulus and 
occurrence of reaction). On the other hand, this does seem to be possible in the 
case of encoding time. Therefore we would start with decoding time as the vari- 
&ble and encoding time as the constant. , 

The general nature of the research proposal is as follows: (1) We provide 
the subject with an extremely simple and overly practiced encoding response 
(e.g., reaching out and touching an object). (2) Using optimally coded input, 
we give him practice at the encoding alternatives until conditional dependency 
under this condition is maximal (e.g., the ‘locations’ of the objects to be touched 
perfectly established). Optimal coding in this case might be flashing pictures 
of the objects to be touched on the screen one at a time (e.g., shown a picture of 
the round, red, tall object, he must touch it as soon as possible). Encoding time 
under these conditions should quickly reach a stable minimum. (3) The subject 
is now presented with serial verbal information, either spoken or written, such as 
ROUND...RED... TALL, and must react by touching the correct object 
as soon as possible after hearing the last signal. (4) The rate of presentation of 
this serial information is gradually increased. Measurement is made of both 
total time (from onset of first signal to termination of encoding reaction) and 
encoding time (from end of last signal to termination of encoding reaction). 

The general nature of the results to be expected is shown diagrammatically 
in Figure 13. Up to a certain critical rate of input (range a), total time will be a 
decreasing function of increasing rate of presentation and encoding time will 
be a constant. Encoding time is constant through this range because it depends 
solely upon decoding of the last signal plus the constant encoding time—prior 
signals are completely decoded before the next appears. Total time decreases 
through this range because the rate of presentation is becoming faster. For a 
certain range beyond this critical point (range b), total time will remain at some 
constant value while encoding time gradually increases. Encoding time in- 
creases because it now includes increasing time Spent in decoding prior signals 
(e.g., the subject is still decoding ROUND when RED appears and is still de- 
coding RED when TALL appears, starting the measurement of encoding time). 
Total time remains constant through this range because the increase in encoding 
time is compensated for by the more rapid rate of input. At some further point, 
total time should become variable, and this should be accompanied by appearance 
of frequent errors (range c). 

At the first critical point—that at which total time becomes constant and 
encoding time begins to inerease—the difference between total time and encoding 
time should provide a measure of decoding time under these particular conditions, 

2 


«—Q 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 123 


mande PRG. ao Brora tense range ¢-—— 


Rate of Input (units per second) 


Figure 13 x 


e.g., that amount of time required for decoding n—1 input signals. The projec- 
tion of this critical point of the base line (shown by dashed arrow in Figure 13) 
should indicate the decoding channel capacity under these conditions, e.g., the 
rate of input events in units per second which can be handled by the system. 


Design 1. Materials might consist of a set of objects variable in four ways through two 
dimensions (SHAPE: circular, square, triangular, oval; COLOR: red, yellow, green, blue) 
and in two ways through two other dimensions (CROSS-SECTIONAL SIZE: wide, narrow; 
HEIGHT: tall, short). At any one time, a maximum of 16 alternatives may be set in the 
panel, either each of 4 shapes in each of 4 colors (16 alternatives, but only two dimensions) 
or each combination of 2 shapes, in 2 colors, of 2 sizes, and having 2 heights (16 alterna- 
tives involving 4 binary choices). In this way one may investigate the effects of varying 
the number Of alternatives when dimensionality is either held constant or varied with 
number of alternatives. These objects would probably be displayed in a panel against 
electrical contact switches, so arranged that a mere touch against any one will stop the 
timers for total and encoding time. 

Design 2. A closer approach to typical linguistic materials could be obtained with a panel 
of ‘nominal’ objects arranged in 1 to 4 rows or columns, these objects being BALL, WHEEL, 
HAND, FACE, for example, and being set on levers. Each row could be in a different 
color or some other ‘adjectival’ variable. Each object could be capable of movement in 4 
‘verbal’ ways, e.g., PUSH, RAISE, TURN, SHAKE, and in 4 ‘adverbial’ modes, e.g., 
QUICKLY, SLOWLY, SMOOTHLY, ROUGHLY. Again, the input information could be 
recorded on tape and could be varied in both rate and complexity with respect to the re- 
sponse board. At the simplest level would be commands involving only two variables, e.g., 
PUSH A WHEEL, SHAKE A BALL; this could be extended to RAISE THE RED FACE 
and TURN THE BLUE WHEEL, etc.; and extended further to four alternative linguistic 
dimensions, e.g., SHAKE THE YELLOW HAND SLOWLY or TURN THE GREEN 
BALL SMOOTHLY. i Soa 

Design 8. This general type of method could probably be extended to decoding of pictorial 
materials. We might first give the subject a statement, e.g., THE CIRCLE IS RED, then 
flash on the screen a simple picture with the subject to respond either true or false as quickly 
as possible. The picture shown could vary from the simplest case of being a red (or not red) 
circle, to a binary situation showing a circle and a square, one red and the other not red; 
similarly, the statement to be tested against the picture could be varied from THE CIRCLE 


124 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


IS RED to THE LARGE CIRCLE IS RED (with appropriate samples of objects shown), 
and so forth. Even more complex linguistic combinations could be used with appropriate 
pictures, such as THE LARGE BALL UNDER THE ROUND TABLE IS GREEN. Ex- 
trinsie variables would be such things as frequency of usage of the labels used (e.g., de- 
coding habit strengths), amount of relevant and irrelevant informatioh, transitional 
predictability of the sequences use (e.g.; THE MAN IS SMOKING A PIPE should be 
decoded correctly more quickly than THE WOMAN IS SMOKING A PIPE). 


5.5.3. Test of Predictions 


(1) The rate of presentation at which encoding time begins to increase will always 
be equal to that at which total time becomes constant. This applies to all situations 
and is important because it makes possible the specification of empirical units 
of decoding channel capacity. If this prediction does not hold, it means that our 
theoretical analysis of this general situation has been wrong. If it does hold, 
then we are in a position to explore the effects of many other variables upon 
decoding time, using this critical rate as an index. 

(2) Channel capacity will be an increasing function of the strength of the decoding 
habits involved—e.g., of conditional dependencies. In the sample material given 
above, conditional dependencies were maximal—the decoding significances of 
GREEN, ROUND and so forth are maximal—and this condition therefore 
serves as a control. If nonsense syllables were substituted for these words with 
other groups of subjects, and different groups were given varying amounts of 
pre-training in decoding (seeing particular nonsense items with appropriate 
objects), it would be possible to test this prediction. The greater the amount of 
pre-training (and hence, theoretically, the greater the conditional dependency), 
the greater should become the decoding channel capacity. The function derived 
would presumably be typical of other learning phenomena, e.g., a negatively 
accelerated growth curve. Another way of testing this prediction would be to use 
meaningful materials varying in familiarity or frequency of usage. If VERMIL- 
LION, MAUVE, TURQUOISE, and so on were substituted for familiar color 
labels, one would expect decoding channel capacity to decrease. 

4(8) Channel capacity will be an increasing function of the strengths of associations 
between sequential semantic states, e.g., transitional dependency. Here one could 
manipulate external redundancy (for example, man vs. woman smoking pipe 
as discussed above or “turn wheel” vs. “turn face" in another design) or pre- 
experimental training (for example, by giving training in which certain se- 
quences of nonsense syllables were highly probable and others unlikely). The 
manipulation of pre-experimental training would permit the greatest control and 
hence presumably yield the most stable functions. J 

(4) Channel capacity will decrease with number of alternatives. (5) Channel 
capacity will be a steeper function of number of alternatives when number of dimen- 
sions of variation also increases than when dimensionality is constant. The materials 
described under design 1 provide a means of testing these predictions. The total 
number of alternatives can be increased from 4 to 16 with dimensionality either 
constant (from 2 shapes in 2 colors to 4 shapes in 4 colors) or inereasing (from 
2 shapes in 2 colors to 2 shapes in 2 colors of 2 sizes and of 2 heights). If this 


2 
2 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 125 


source of variation is combined with degree of pre-training on nonsense substi- 
tutes for meaningful words, then the additional predietion—that number of 
alternatives as a variable has decreasing effect as conditional dependencies 
increase—can be tested. 

One additional possibility in this line of study may be mentioned, and that 
it its relation to the problem of psycholinguistic units. If stability of the decoding- 
time index for a given condition and abrupt changes in its value for varying 
conditions can be demonstrated, it should then be possible to determine what 
sorts of linguistic variations involve the addition or subtraction of semantic 
decoding units. If changing numbers of phonemes or syllables or even grammat- 
ical morpherhes do not change the decoding channel capacity, it would be ap- 
parent that these are not relevant psycholinguistic units as far as semantic 
decoding is concerned. On the other hand, if adding or subtracting lexical mor- 
phemes did regularly produce correlated shifts in decoding time, this would be 
evidence for the lexical morpheme as a semantic deoeding unit. It is realized, 
of course, that what has been suggested in these few pages on channel capacity 
represents close to a lifetime of research for the person who undertakes to inves- 
tigate this problem fully. On the other hand, it should not take long to determine 
whether or not the basic experimental notion—that total time can.be separated 
experimentally into measurable decoding and encoding times in the manner 
indicated—is itself valid. 


6. DIACHRONIC PSYCHOLINGUISTICS 


In this section we discuss a variety of topics, all of which have in common the 
fact that they involve comparison between two or more stages‘in language 
development. Attention is first directed toward development of language be- 
havior in the individual member of a speech community, first language learning; 
a general theoretical model of the process is described, a possible experimental 
analogue is suggested, and various research problems are discussed. A second 
topic is second language learning and bilingualism. Although these are important 
problems, they came under only tangential discussion in the seminar and hence 
are only treated briefly here. They are already areas of concentration for many 
specialists. The third general topic in this section is language change. Although 
this term refers to the speech community rather than the individual, it will 
become apparent that the processes at work have their loci in the nervous 
systems of many similarly constituted individuals. The treatment of each of 
these problems is ‘psycholinguistic’ in that relationships between the changing 
structures of messages and changing behavioral organizations of message users 
are stressed, and the underlying commonness of these problems will be apparent 
from the reappearance of identical principles, chiefly learning principles. 


b 


6.1. First Language Learning 


This section attempts to apply learning theory to the development of language’ 


behayior. The major concern will be with modifications produced by the actions 
of persons in a given language-speaking culture in setting up models of verbal 
behavior, administering reinforcements, etc. This account, does not give atten- 
tion to the maturational or genetic features which presumably operate across 
all cultures and may influence rates of development, sequence, individual differ- 
ences, and the like. This, of course, in no way implies that these are not important, 
but reflects, rather, limitations in seminar time and report space. Findings in 
the area of motor skill development mark this as an important area of research, 
and it is being carried on by such men as Jakobson, Leopold, Grégoire, Ohnesorg, 
and Cohen. In other words, we have arbitrarily limited ourselves to the learning 
of language decoding and encoding behavior. 


6.1.1. A Psycholinguistic Analysis of Decoding and Encoding? 


6.1.1.1. Language decoding. In human communication decoding refers to the 
process whereby certain patterns of stimulation (usually auditory or visual) 
elicit certain representational mechanisms (ideas or meanings) via the opera- 


53 Charles E. Osgood and James J. Jenkins. While members of the seminar were of some- 
what different theoretical persuasions, it was agreed that theoretical differences are not 
critical at this point, This analysis follows Osgood’s mediational theory in the main. For 
alternative (but not necessarily contradictory) views of language learning cf. B. F. Skinner, 
Verbal behavior, William James Lectures, Harvard University, 1947, or Roger W. Brown 
and Don E. Dulaney, A stimulus-response analysis of language and meaning (privately 
distributed). 


m 126 


————— 


p a a 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 127 


tions of a complicated central nervous system. The basic question here is, how 
do certain stimulus patterns (signs) come to represent other stimulus patterns 
(objects), i.e., how are meanings acquired? 

The firstateps in the development of meaning, and hence in learning to decode 
the environment, are inseparable from the first steps in the development of 
perception. We infer that intimate ‘knowledge’ about common objects in the 
environment is first obtained from their proximal cues—the sensations of warm 
milk in the mouth and stomach, the feeling of a wooden block in the hand, the 
experience of being cuddled. We further infer that, since distal cues (visual, 
auditory) of objects can antedate their palpable presence, these cues will tend 
to become signs of these objects. The unique visual cues from the infants bottle 
will become signs of milk-object, the sounds of mother’s voice will become signs 
of her palpable presence, and so on. The general mechanism here can be seen 
by reference to Figure 14. Total stimulation from the object (S) elicits a complex 
set of reactions (Rr); in the case of the baby’s boétle, these reactions would 
include sucking, salivating, swallowing, and so forth. The distal stimuli ([S]) 
which regularly antedate or accompany total stimulation from the object will 
tend to evoke some reduced portion of this total behavior as a representational 
mediation process (rm); in the present instance, sight of the bottle may produce 
anticipatory salivating and lip-pursing movements. The self-stimulation (Sm) 
arising from the mediating reaction is the conscious awareness of meaning and 


` may become associated with various instrumental sequences (Rx), such as 


reaching forward with the hands, vocalizing, and so forth (e.g., encoding mecha- 
nisms). 

Distal cues (perceptual signs) bear a necessary and inevitable physical rela- 
tion to the objects they represent—not the arbitrary, assigned significance 
characteri8tic of most linguistic signs. Since the distal cues of common objects 
appear in a variety of contexts—at various angles of regard, under various 
iluminations, at varying distances, and so on—but antedating the same be- 
havioral object, these modes of appearance become a class of signs having the 
same significance. This is the phenomenon of perceptual constancy, and if is 
only one instance of the intimate relation between perceptual and meaningful 
processes (cf., section 3.1.1). P 

In learning the significance of linguistically coded stimuli, the representational 
processes already. established in the child’s pre-verbal, perceptual experience 
with common objects and situations are merely transferred (conditioned) to 
those auditory stimulus patterns which adults arbitrarily assign to these objects. 
The typical procedure is for the adult to deliberately or unconsciously direct 


PZ. 
Ue 
- 
M i 
Ree ru E BR Ry 
Figure 14 


128 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


the child's ‘attention’ (orientation of exteroceptive receptors) to some object 
while repeating the vocal sequence which for him labels the object. It is char- 
acteristic of language that the same noise is usually applied to a class of objects 
and situations. The large, green, light-weight beach ball, the small, red, dense, 
rubber ball, the small, white, hard, golf ball and so on are all stimulus situations 
labelled /bohl/ at haphazard intervals. It can be shown that a hierarchy of 
representational mediation processes will emerge from such experiences. The 
strongest and most available decoding habits in the hierarchy of the auditory 
stimulus /bohl/ will be those most frequently elicited by the distal cues of the 
particular objects encountered. Since most ball-objects are round, graspable, 
resilient, and throwable, representations of these common characteristics will 
gradually become the stable significance. It should be pointed out that we are 
dealing here with concept formation. 

6.1.1.2. Language encoding. In human communication encoding is the process 
whereby a speaker's intentions become coded into those vocal reactions which 
produce intelligible sounds in a given language. This is commonly called the 
‘expression of ideas.’ It involves both the formation of complex motor skills 
and their association with representational mechanisms of the sort discussed 
above. 

(1) Development of vocal skills. The development of basic vocal skill compo- 
nents in young infants can probably be viewed as a gradually changing series 
of stages. The first stage we might typify as ‘random’ behavior. We know for 
other easily observed and recorded motor systems that the earliest activity is 
simply a kind of mass activity. The system does everything it is capable of doing, 
as if the motor neurons were firing off indiscriminately. This appears to be true 
of verbal behavior as well. Profiles of sounds produced by new-born infants show 
no differences over racial, cultural, or language groups. The determiners of 
frequency of emission of given sounds appear to be physiological rather than 
situational. When by happenstance the articulators are in a given position, a 
given sound emerges. As the organism develops, a progressive differentiation 
seems to take place (again, as with other motor systems). More and more aspects 
of the verbal production become ‘differentiated out! and controllable as indicated 
by repetitions, predictable variations and the like. It seems probable that the 
gross features of the behavior are controlled first. Thus, we should expect volume 
and pitch control much earlier than precise articulations. 

As further development occurs, control and the possibility of repetition and 
persistence extends to the fine musculature of the articulators, and at about 
this point we begin to talk about the ‘babbling stage.’ Analysis of sound profiles 
here indicates that differences are evident between infants in different language 
groups. How do these arise? One of the phenomena of learning discussed earlier 
was that of secondary reinforcement. Stimuli which have been present during 
or preceding reinforcement acquire reinforcing power themselves. We may 
assume that parents’ activities and, indeed, the parents’ presence, is reinforcing 
to an infant, and that this reinforcement has been accompanied by verbalization 
on the part of the parents in the sounds of their own language. Thus, the language 


2 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 129 


sounds themselves can acquire secondary reinforcing power, e.g., become signs 
of primary satisfactions. We may predict, then, that when a child utters a sound 
like one in the language, this act through its auditory feedback is in itself rein- 
forcing. This response is increased in strength over the utterance of sounds 
which do not appear in the language and hence have less reinforcing power. 
When we say that the strength of an utterance is increased, we mean that its 
probability of appearance is raised, and we would predict that a given sound 
(at this stage) will be made over and over again until it is temporarily extin- 
guished—at which point another rewarding sound will take its place. 

It has long been postulated that a ‘circular reflex’ of some sort accounts for 
babbling behavior (cf. E. B. Holt or F. Allport), but it is questionable whether 
this is needed to explain babbling, and it is obviously not sufficient to explain 
the change in sound patterns toward those of the culture. The writers incline to 
the notion that secondary reinforcement is a necessary and sufficient condition 
to explain this phenomenon. 

Accepting the fact that gratifying speech sounds tend to be selected from a 
larger potential pool of skills, how are these particular spatiotemporal integra- 
tions established and strengthened? Motor synchronizations and sequences 
that occur on teflexive, echolallic, or imitative bases with sufficient frequency 
are a sufficient condition for establishment of proprioceptive, feedback systems. 
This makes possible more rapid and stable occurrence, which is the necessary 
condition for organization of these skill components on a central motor level 
(cf., section 3 of this report). In any case, it seems doubtful if phonemes are ever 
produced as isolated units, except by a process of abstraction—babbling (the 
earliest stage of ‘deliberate,’ repeatable encoding of speech sounds) is typically 
syllabic rather than phonemic. "e 

The language community exercises continuous control over the variation 
which is permitted for any speaker. A person’s language is susceptible to external 
pressures as well as to the internal pressures toward modification. If a difference 
makes a difference (i.e., if it is phonemic), there is a concerted, (though perhaps 
indirect), effort on the part of the community to enforce a discrimination— 
people either do not understand or misunderstand. The language community is 
thus a source of direct differential reinforcement. The learner himself has a 
source of control in that he gets continuous feedback from his own productions 
and can compare those productions with those of others in the community. 
Diverse general social pressures may motivate him to conform, or his past 
rewards for conformance to models may hold him within certain distortion 
limits. ° ; 

It seems likely that variation will be countenanced by the community where 
the difference does not make a difference in the code. Allophonic variation is 
permitted since the environmental probabilities make clear what is being said. 
In a similar manner the community may tolerate considerable degrees of dis- 
tortion from non-native speakers (foreign accents) when the variation is con- 
sistent, predictable and, in effect, trapslatable. It is interesting to speculate on 
the effect of variations which might be phonemic in another language but not 


130 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


in the one being spoken at the time (say, the intrusion of a click into English). 
It seems likely that such variations would simply have the effect of static or 
‘noise in the system’ and, depending on the degree of disruption of communica- 
tion, might be tolerated. ' 

(2) Development of semantic encoding. While it is true that certain automatisms 
(reading aloud, reciting number series, and the like) may short-circuit the inter- 
pretive process, most communicative acts, including voluntary speech, are 
largely determined by what we may call the ‘intentions’ of the speaker, for lack 
of a better term. We identify ‘intentions’ with the self-stimulations produced 
by representational mediators. The problem here will be to describe the ways 
in which representational mediators (meanings, significances, ways of perceiving) 
become associated with vocalic skill sequences. 

We have already noted that the syllabic babbling responses of the infant 
produce auditory feedback or self-stimulation. If the same response is immedi- 
ately repeated, as is the case in babbling, this auditory input will tend to become 
associated with elicitation of the response. This stage in the development of 
semantic encoding is shown as (a) in Figure 15. S, refers to the indeterminate 
conditions which initiate the first occurrence of the vocal skill component (Ry), 


producing auditory self-stimulation (S,). The importance of this stage is that 
it readies the child for imitation of ‘other person’ speech. Once vocalic self- 
stimulation is associated with production of the vocal reaction, auditory stimuli 


received from other people (Ss), particularly if they are similar in stimulus 


5 
e E 
/ 9 
x Apes eom ai 
(a) Circular "Reflex" v 
» 798 
gp € 
Y KES 
Ga Cy a Ry 
(b) Imitation 
229. n 
o x 


d se 
Lae A» 
[y= Ta- 7 5px 


(c) Labeling and Semantic 
Encoding 


5 


Figure 15 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 131 


quality, will also elicit the reaction, as show in Figure 15(b). This is a case of 
primary stimulus generalization. If these imitative reactions are rewarded (as 
they definitely are in the average family situation), the child develops a broad 
tendency to mitate other people's speech (cf., Miller and Dollard, Social Learn- 
ing and Imitation, 1941, for details here). Given this tendency to imitate vocalic 
responses, the remainder of the process follows quite simply. As shown sym- 
bolically in Figure 15(9, the adult vocalizes the correct label for a common 


object or situation (S) while directing the child's exteroceptors (usually visual) 
to the appropriate stimuli, The child imitates this sound (R,). If, as is likely 
at this stage in development, the distal stimuli [8] of the object have already 
acquired meaning (cf., earlier section on decoding), the self-stimulation from 
this mediation process (ra Sm) must also be becoming associated with the 
correct vocalic reaction (step 4 in the diagram). This is a umit in semantic en- 
coding, a learned association between an ideational representing process and a 
particular vocalic skill sequence. ^ 

An example is probably in order. Through his babbling practice, the child 
will imitatively produce /boh/ upon hearing his mother say /bohl/—note that 
perfect imitatidh is not expected, since accuracy depends on the babbling practice 
the child has had. On repeated occasions mothers and other linguistically 
sophisticated individuals hold up, point to, hand over, and handle the object 
BALL while vocalizing the label. Since the distal cues of this object already 
elicit representational mediators (e.g., have significance) in the child—he has 
already played with such objects a great deal—these intervening processes tend 
to become associated with his own imitative vocalization of /boh/. Note how 
the development of such semantic encoding frees the individual from dependence 
upon immediate external cues—any antecedent condition, ideational and motiva- 
tional as well as external, which gives rise to the appropriate mediation process 
becomes capable of eliciting this bit of vocal expression (e.g. desire for the 
object). Being associated with a common mediation process, this vocal skill 
will also transfer to all those signs which elicit the mediator, ie; the label 


‘spreads’ to all members of the class, ‘ball.’ 
and encoding. So far we have dealt 


6.1.1.3. Grammatical aspects of decodti 
MEC 4 i: of representational mediators 


with pure semantic decoding (the association lia 
p e in various modes of presentation) 


as responses with more or less isolated signs in : È 
and pure semantic encoding (the association of representational mediators as 
stimuli with more or less isolated vocal skill sequences). Most messages received 
and sent by adult communicators, however, involve complex sequences of ee 
many of which have a largely operative function—the connective matrix within 
which semantic units are studded. This matrix of non-lexical material follows 


complicated but largely unconscious rules; neither speaker nor abeat 
aware of deliberately selecting or noting word aod fadi res "s 

; inni ctable . This is 
the like, yet these phenomena proceed in E iod io TEA 


probably part of what Sapir had in mind when 


patterning of language.' These rules are the grammar of the language. 


. 


182 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


That the decoder is reacting to grammatical information, however, is indi- 
cated by the sharp awareness to error when wrong signals are received. The 
absence of an s-ending on the verbs in the boy who live next door eat the apple 
delivers an error signal to the sophisticated listener. Some process—set in motion 
by the singular noun form, persisting through reception of the verb form, and 
predictive of the nature of this verb form—must be postulated to account for 
this sensitivity to grammatical error. Similar postulation must be made to 
account for the encoder's unconscious precision in ordering and affixing tags 
through long conversational sequences. 

Grammatical facility is here assumed to be a special case of the formation of 
anticipational (decoding) and dispositional (encoding) mechanisms in the human 
nervous system, both dependent upon the frequent repetition of redundant 
events in sequential inputs or outputs. Following Hebb's general notion about 
neural integration,“ according to which near-synchronous activities in neigh- 
boring loci tend to become more and more strongly associated by the develop- 
ment of synaptic connections, if two or more input or output events, a and b, 
are both redundant (e.g., occur together or in close sequence) and frequently 
experienced, the central neural representation of one will tend to become a 
condition for the occurrence of the other. Under conditions of very high fre- 
quency and redundancy, a may become a sufficient and hence ‘evocative’ condi- 
tion for the occurrence of b (and certain perceptual errors and illusions may 
result, for example); under lesser degrees of frequency and redundancy, a 
merely becomes ‘predictive’ of b, by lowering the threshold for the occurrence 
of b in competition with other possible events (and increased stability of decoding 
or encoding sequences is thereby provided). 

(1) Ordering grammatical mechanisms. At an intermediary stage in language 
development the child encodes largely in holophrastic units (e.g., ‘pure semantic 
encoding’). Each expression is a content unit. The ‘little words’ (connectives, 
prepositions, articles) are lacking as is grammar in general. Similarly, we find 
that certain aphasic individuals encode in what has been called a ‘telegraphic 
style,’ again lacking connective words and grammatical correctness. The normal 
adult both orders his semantic units into certain arbitrary constructions and 
surrounds them with grammatical ‘cement.’ In the argot of electronic computers, 
we might say that global impulses from the semantic unit feed into a ‘sequence 
timer’ which unreels the message in a certain order and adds certain elements 
according to the rules of its construction. : 

Let us take a series of utterances all fitting the same standard syntactical 
construction: (the fat man) (is walking) (on the sidewalk); (a black dog) (was 
running) (after a car); (awful storms) (come) (in the fall). Every time the child 
uses such a construction, perhaps in deliberate imitation of adult speech, definite 
sequences of events take place—adjectival forms are followed by nominal forms, 
these in turn being followed by verb forms, which are themselves followed by 
prepositional phrases. We have here the general condition for formation of an 
ordering type of disposition, whereby actiyity type a in the motor associational 


** D. O. Hebb, The organization of behavior (New York, 1949). 


D 


a 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 133 


area ‘tunes up’ activity type b, and so on. Having said the big, fat . . . , we experi- 
ence a disposition toward encoding some nominal form; having encoded an actor- 
action phrase, e.g., boys eat . . . , we experience a hierarchy of multiple readinesses 
for several subsequent types of phrases—prepositional (in the woods, with their 
hands, ete.), object (food, meat, ete.), adverbial (rapidly, quietly, etc.), and so 
on. The comparative strengths of dispositions in such hierarchies could be 
estimated empirically by presenting incomplete utterances of the type given 
above. Similar mechanisms operate in sequential decoding. 

(2) Set mechanisms. In lengthy sequences of encoding the speaker may have 
to maintain and reiterate, for the hearer’s benefit, certain types of information. 
An entire digcourse may be cast in some past time, may be concerned with plu- 
rality of number, deal with feminine gender, and require a subjunctive mood. 
It would obviously be convenient for both speaker and hearer to delegate these 
constancies to some lower-level, automatic mechanism. In computer language, 
the mechanism here would be analogous to a kind of ‘locking device’ which sets 
the computer for some repetitive operation, say, to ddd two zeros to any num- 
ber ending in 5. Similarly, when the ‘past’ oriented human communicator en- 
codes any verb, a dispositional set operates to add some one of the allomorphs 
of -d. $ 

Set dispositions also participate in hierarchial habit systems. Whenever a 
variety of semantically determined contents (stems or roots of words) channel 
upon the same dispositionally determined suffix, we have a convergent encoding 
hierarchy. Examples would be plurals (lamp/s, root/s, cat/s, stem/s, leave/s, 
boy/s), verb tags (go/ing, sing/ing, play/ing), adverbial tags (smart/ly, 
casual/ly, soft/ly), and so forth. Such convergent hierarchies are the psycho- 
logical condition for generalization (transfer, spread of habit); we can immedi- 
ately see a feasible mechanism here for what linguists refer to as extension by 
analogy. Having established boy: boys, cat: cats and so forth for the common 
signs in early life, this paradigm generalizes promptly to argot: argots and system: 
systems as these new nominal contents are encountered. The same generalization 
mechanism applies to syntactical constructions; a construction formed with 
such simple contents as the doggie eats his dinner quickly generalizes to sich 
complicated contents as the linguist studies his corpus assiduously. — — À 

Wherever the same semantically determined content terminates in varied 
dispositionally determined suffixes, we have a divergent encoding hierarchy. All 
of the suffixes which can be combined with a given root morpheme constitute 
such a hierarchy, e.g., play/-, play/s, play/er, play/ing, play/ed, play/fulness, 
play /fully.eSince divergent hierarchies are known psychologically to produce 
interference, the question arises as to why interference is largely lacking here. 
Encoders rarely substitute erroneous grammatical endings, saying the child 
player in the field for the child plays in the field, for example. This highlights the 
discriminatory function of dispositional sets in encoding. Interference occurs 
within hierarchies only to the extent that highly similar stimulus situations are 
operative. Note that in each case where a different suffix is applied above, a 
different dispositional set is operative. In other words, in each case the same 


e 
. 


134 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


semantic stimulus input is compounded with a distinctive dispositional input, 
making it possible to encode different endings discriminatively. 

Tt is clear what should happen in cases where both the semantically determined 
content and the dispositionally determined set are the same (or highly similar), 
but a divergent set of suffixes must be encoded. This 7s the condition for inter- 
ference and hence errors. The prime examples of this occur with the irregular 
forms of a language. With a constant dispositional set (plurality) and semantic 
determinants similar to other regular forms (leg: legs; toe: toes; hand: hands), 
the youngster typically encodes foots as the plural of foot—and similarly for goose: 
gooses; mouse: mouses, and the like. With a constant dispositional set (past tense) 
and semantic determinants similar to regular forms (walk: walked; play: played), 
the youngster typically encodes breaked as the past of break and catched as the 
past of catch. An interesting prediction arises here: since the shift in such di- 
vergent hierarchies is typically from a weaker to a stronger habit, one can pre- 
dict that errors in encoding irregular verbs will be inversely related to their fre- 
quency of usage in the language. Since go is a very high frequency verb, there 
should be less tendency to encode goed as the past form than to encode slayed 
(rather than slew) as the past of slay, a relatively infrequent verb. In other words, 
only relatively high frequency words should be capable of persisting in their ir- 
regularity against the combined onslaughts of regular dispositional tendencies. 

(3) Congruence mechanisms. There are many characteristics of grammatital 
encoding and decoding that simply reflect regularities in the message itself. These 
are the various correspondences or agreements that are maintained between parts 
of messages—a kind of useful redundancy. Most familiar to English-speaking 
communicators, perhaps, are the congruences set up between nominal and verbal 
forms. In the present tense, a singular noun takes a verb form ending in -s while 
a plural noun takes a verb form with a zero ending (the boy eats; the, boys eat.). 
We must postulate that such congruence mechanisms have a reverberatory 
‘holding’ characteristic; set in motion by the occurrence of a prior grammatical 
tag, they persist ‘silently’ until a second, corresponding tag is encountered, where- 
upon they release a particular encoding unit and are themselves eliminated. In 
computer language, this operates much like a condenser, one input signal setting 
up a cyclical action which is only released into another channel when another 
input signal of a specified type is received. : 

One of the characteristics of reverberatory mechanisms in the nervous system 
is that they tend to extinguish in fairly short order unless reinforced repeatedly 
from some external source. This leads to the prediction that, other things equal, 
the longer the delay between congruent elements in a message the greater the 
probability of error. Whereas the probability of encoding the boys runs fast is 
relatively low, the probability of the boys whose father used to be a track star runs 
fast is greater. Confusion among the prior tags for a congruence relation also 
produces errors, e.g., singular and plural forms occurring close together and prior 
to a verb as also illustrated in the sentence above. 

6.1.1.4. Summary. A model for both decoding and encoding operations in 
human communicating has been developed on the basis of a mediational type 


» 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 135 


of learning theory. On the decoding side, linguistic messages are viewed as se- 
quences of auditory (or visual) stimuli which include cues for both semantic and 
grammatical decoding operations; on the encoding side, similarly, the vocalic 
(or orthographic) skill sequences are jointly elicited by stimulus compounds aris- 
ing in both semantic and grammatical levels of organization. The semantic or 
ideational level of organization has been identified with the development of 
representational mediation processes; the grammatical level of organization has 
been identified with the development of anticipational (decoding) and disposi- 
tional (encoding) processes; the motor skill level has been identified with integra- 
tions of sequential activities in the motor projection area itself; and the message 
level comprises either the patterns of auditory and visual stimuli received by the 
communicator (decoding) or the patterns of vocalic and orthographic skill se- 
quences produced by the communicator (encoding). 


6.1.2. An Experimental Analogue for Studying Language Learnings" 


As a means of studying problems of language acquisition, the seminar tried to 
devise a simplified model of language behavior, such that experimental manipula- 
tion and control would be feasible. The experimental model should include the 
essential charaéteristics of natural languages, and the following are at least 
minimal requirements: (1) The model must have both a complete dictionary and 
a complete grammar. (2) It must be constituted of a hierarchy of units, such that 
every ‘utterance’ is completely organized in terms of any of the levels of the 
hierarchy. (3) There must be units which are sequential and units which are 
synchronous in respect to each other. (4) Operation of the model must involve 
both perceptual, receptive processes and activational, responding processes. 
Corresponding to the aural-oral cycle of speech in natural languages, our model 
will be basgd on a visual-manual cycle. 

If we want to use our model to study acquisition of first languages, we must 
eliminate, or at least carefully control, the possibility of mediation by transla- 
tion. Three suggestions were made as to how this might be done: (1) Use of 
chimpanzees, or lower primates as subjects. This might be too expensive, too slow 
for complex problems, and possibly not parallel to human language learning 
(although it might be of interest in its own right). (2) Human infants might be 
used as subjects, but their general nonavailability for prolonged and rigidly con- 
trolled experimentation makes this possibility infeasible. (3) The solution 
actually adopted is to put adult humans into a non-translation situation, that is, 
where mediation by their own language would be not necessarily absent, but 
constant arfd trivial. Of course, with adult subjects the problem situation cannot 
be identical with the situation of infants learning their first language, at least 
from the point of view of the subjects, but we can control what we believe in 
theory to be the essential factors of the situation, wks the model can easily be 
changed to follow any necessary modifications in the eory. 

The subject (who sib suggestively be called ‘infant’) sits before a panel rey 
which is a set of control knobs or leverg. As he sits there, lights of different colors 


*5 James J. Jenkins and Leonard D. Newmark. A 
. 


136 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


and in various sequences flash on the panel in front of him. The flashing of lights 
is controlled by an experimenter (‘parent’) sitting at a corresponding panel in 
another room, unseen by the infant. The parent’s ‘messages’ are ‘meaningful,’ i.e., 
are in accord with a system of intent as coded in the model language, but it is 
probably not necessary that the infant be preinformed that the light flashes he is 
watching are so meaningful. He may respond to the flashes in a variety of ways, 
but eventually he will probably try moving some of the controls and see the 
visual result of his own responses (e.g., feedback"). This would correspond roughly 
to the random behavior stage in vocalizing. Some mechanism of reinforcement 
(say flashing the word ‘right,’ which we assume has reinforcing properties for the 
subject) strengthens such responses. While such reinforcement may be used 
initially just to get the subject to ‘say’ something, later it servesin the discrimina- 
tion between proper and improper responses for the code. This parallels the in- 
creasingly discriminated reinforcement applied in language learning. 

The infant watches the patterns of light on the panel, and with reinforcement 
presumably will attempt to duplicate these, or make any other reinforcible set 
of responses, by manipulating the controls at his disposal. Patterns of increasing 
complexity will be discriminated by the infant, and hierarchy of patterns ac- 
cording to complexity will be established both for his recognition and for his 
motor manipulation. Eventually, we would suppose that he will learn the entire 
code, the model language. D 

The apparatus consists of two light-boards with controls such that operating 
controls on one board turn on lights on the other board, and if we want visual 
feedback, lights on the same board as well. The effects of getting feedback which 
differs from the parent’s signals can be studied by building in a different set of 
relays of the controls to the two boards. The controls are of two types, corre- 
sponding approximately to natural language segmental and supra-segmental 
features: the left hand operates controls which are held constant over more or 
less larger sequences; the right hand may operate one of two other types—either 
controls (perhaps press bars) which are operated in sequences, corresponding to 
segmental phonemes, or a single complex mechanism with independently but 
synchronously acting elements, corresponding to distinctive phonemic features, 
e.g., a knob which may be pushed or pulled, raised or lowered, moved right or 
left, or twisted left or right, with neutral positions possible for any of these 
operations. Operating these controls produces patterns of light on the boards. 
Visual qualitative differences between infant’s and parent’s patterns may be 
introduced by such modifications as intensity differences or wave-length shifts 
which contrast the lights produced by parent and those produced by infant. 
These differences parallel the non-linguistic features which enable us to dis- 
tinguish one speaker's voice from another or other's voices from our own. 

Since the model is to be simpler than natural language in nonessential points, 
the code may be constructed with a simpler ‘phonology’ (three or four distinctive 
features, or perhaps five phonemes, plus two suprasegmental features), mor- 
phology (perhaps 30 morphemes in three classes as determined by phonemic 
constituency or position of occurrence in the Sequence), and syntax (two pos- 


» 
a 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 137 


sible orders of morphemes). Of course, the code can be complicated to any de- 
sired degree, and the corresponding effect on learning can be measured. 

The problem of how subjects can be motivated in the rather laborious set-up 
proposed here was raised. It was generally agreed that we cannot tell until the 
experiments have been tried whether the satisfaction of ‘solving the problem’ 
would be adequate in maintaining the cooperation of the subjects. It may be true 
that children in the real situation are subject to the same boredom as the ‘infants’ 
in the experimental situation, so this factor might well be studied by experi- 
mentally varying the type of reward given the subject for participating in the 
experiments. In general, however, it appears likely that some sort of relevant re- 
inforcement must be amply provided. 

It will be noted that no ‘meaning’ is attached to the code symbols or patterns 
of symbols, as they appear in the experiments outlined above, other than that 
they belong to a code, are recurrent, and are somehow differentially reinforced. 
If ‘referential’ meaning is desired in the system, it can be introduced in a number 
of different ways. One way suggested was to introduce a separate field on the 
light-board—a field containing a moving light. Certain patterns of light in the 
vari-colored message-field, could be associated with movements of the light in the 
‘semantic’ field, ‘fequests’ for action on the moving light, etc. Obviously, any simple 
modifiable visual or auditory stimulus could be used as the referent. Since in 
natural language, many complicated conversations are carried on by response 
to linguistic and not referential material completely (note: Hello. How are you? 
Fine, thanks, etc.), it may be that the mechanism of language can be studied for 
our purposes without the many complications introduced by reference to extra- 
linguistic things. Furthermore, as we have previously noted, we may complicate 
and interfere with the learning process by the mediation of translation. — 

Although designed primarily for study of problems of first language learning, 
the apparatus can be used for study of different sorts of problems. For instance, 
having thoroughly mastered one code, the infant can be given proper instruc- 
tions and run through the experiments again with a different code. In some ways 
this situation can be made to resemble second-language learning situations, but 
it is doubtful that this experimental design is the best suited for study of such 
problems. It has also been suggested that interesting data might result if two 
subjects equally ignorant of any prepared code were placed at the two panels, 
rather than a naive ‘infant’ and a trained ‘parent.’ The nature of the communi- 
cative code, if any, finally adopted by these subjects might shed some light on 
their conception of their own natural languages. A à 

Some Buts problems in language learning that might be studied in this 
situation are discussed below. One problem previously referred to P the role of 
secondary and generalized reinforcement in language learning, particularly en 
role in the approximating of cultural sound patterns which seem to occur fi s : 
babbling stage. In the experimental situation outlined above we might ask wha 
the likelihood is of a subject repeating a ‘model’ pattern of lights under bes 
conditions. (1) Repetition when the patterns appear along with ae ah d d 
the stimulus field. This would presumably give a summary measure 0 ee 


138 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


ject’s past reinforcement history with respect to imitation in this kind of situ- 
ation. (2) Repetition when the patterns appear at the same time that reward is 
given. This would give the patterns the status of secondary reinforcers. (3) Repe- 
tition when the patterns appear initially without other stimuli but with repro- 
ductions by the subject being reinforced. We would here have a measure of the 
rate at which the subject learns to imitate initially neutral models. At this stage 
we would perhaps be most interested in (2), the reproduction of secondary re- 
inforcers. This ease seems to be closely parallel to the postulation earlier con- 
cerning babbling. While, of course, work with adults (even if successful) would 
not deny the existence of the circular reflex in infants, it would clarify the pos- 
sibility of the alternative explanation and might lead to further, more conclusive 
research. 

A second group of problems which might be attacked here would be those 
related to certain transitional phenomena. For example, it can be hypothesized 
that set dispositions, ordering dispositions and congruence dispositions (tense, 
order and agreement) are all dependent on frequencies with which sequences 
appear in the code and manifest themselves in the same way in association tests, 
recall situations, ete. Our experimental situation would permit us to study the 
growth of such dispositions and to control the actual frequenciés in any manner 
we choose. Experimental analogues of association tests and recall tests can be 
easily constructed. ! 

Further consideration along these lines would lead us to another set of experi- 

ments on paradigmatic development (see the section 5.4 for a discussion of the 
influences of paradigmatic and syntagmatic relationships upon association). 
One very basic question which might be asked is whether we could build up 
use by analogy’ in our experimental model. Suppose in one sequence of learning 
trials we presented units singly at first and then in later stages began using the 
units in context. Suppose unit one were rewarded in context one and unit two 
also rewarded in context one. A new context might be introduced and again 
both units rewarded when they appear in the context, and so forth. Then, if unit 
one were presented in context ‘X, would the subject tend to encode unit two in 
that wholly new context? This learning-by-analogy is often assumed in descrip- 
tions of language processes but has rarely been attacked experimentally. If this 
can be demonstrated, then another very interesting question arises—will unit 
one evoke unit two in an association test? Here we could have a control over the 
paradigmatic class which is not at all possible in ‘natural language.’ On a broader 
scale, studies of analogy such as these might be extended to verb tenses, sentence 
forms, ete. 

These are only a few examples of the kinds of learning studies which need to 
be performed. Whether this situation will approximate language in such a manner 
as to lead to fruitful results is, of course, a matter for empirical confirmation. 
One seminar member objected that this was ‘nothing more than complex in- 
strumental learning’ but that is probably exactly the way an arch-behaviorist 
might describe language itself. In general, we might say that this is a highly 
flexible coding and learning situation which provides for ‘listener,’ ‘speaker,’ and 


o 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 139 


‘listener-speaker’ situations. It has advantages over the experimental use of an 
artificial language in easé of production of the stimuli (it doesn’t need a trained 
speaker or phonetician), ease of control of the code, and ease of precise recording 
(each response is clearly what it is and ean be'set to make its own record). 


6.2. Second Language Learning and Bilingualism** 


When, after becoming a practical expert in his. own, first language, a person 
starts learning a second language, new sets of decoding and encoding habits are 
being formed in competition with the old. When the bilingual shifts from language 
to language, similarly, two systems of decoding and encoding habits come into 
conflict to a greater or lesser degree. The fact that the same general principles 
found to be important elsewhere in this section also are significant here justified 
discussing second language learning and bilingualism in the present context. 
Since the seminar did not devote much time to these topics, however, only a 
brief sketch of the thinking of some of us on these problems is offered. The reader 
is referred to recent books by Uriel Weinreich and Einar Haugen” for excellent 
treatments, undertaken from the linguistic point of view, but with very con- 
siderablé psychological and sociological sophistication. 


6.2.1. Compound and Coordinate Language Systems 


Both second language learning and bilingualism involve the acquisition and 
utilization of two linguistic codes. The messages produced in the two or more 
languages employ differently constructed and organized units, different gram- 
matical rules, and different and equally arbitrary lexical systems, excepting oc- 
casional cognates. To the extent that phonemic systems are different, two sets 
of differentiations and constancies on the decoding side and two sets of vocali 
Skill components on the encoding side have to be maintained. Since the entire 
systems of transitional redundancies in two languages are different, alternative 
anticipational and dispositional integrations have to be established. And since 
the lexical aspects of messages in two languages are different, alternative sets of 
semantic decoding and encoding habits have to be maintained—in other words, 
alternative sets of associations between message events and events in the repre- 
sentational system, or meanings. f 

Perhaps because of dependence on the model provided by second language 
learning in school situations, many writers seem to have assumed that meanings 
are constant in second language learning and in bilingualism. The meaning 
of the object HORSE remains the same as perceptually experienced. Hence the 
meaning of its alternative linguistic signs, horse/Pferd, must be the Fame Bl 
that is involved is two systems of coding the same meaning. This is the case 
under certain circumstances, which establish what we shall call a compound 
language system. In such a system, as shown in Figure 16, two sets of dps 
signs, one appropriate to language A ({S]a), and the other appropriate to lan- 


6 Susan M. Ervin and Charles E. Osgood. j 7 TA 
“ Einar Haugen, The Norwegian languafe in America (1953), and Uriel Weinreich, 
Languages in contact (Linguistic Circle of New York, 1953). : 
. 


140 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


COMPOUND COORDINATE 


b^, TTE ONSI "lg z^ fea ala Sm "li, 


A 


E E E Um, 


Figure 16 


A 


guage B (Sls), come to be associated with the same set of representational 
mediation processes or meanings (Tm — Sm). On the encoding side, likewise, the 
same set of representational processes comes to be alternatively associated with 
two sets of linguistic responses, one in language A (R14) and theother inlanguage 
B (Rl). This development is typical of learning a foreign language in the school 
situation. It is obviously fostered by learning vocabulary lists, which associate 
a sign from language B with a sign and its meaning in language A. A compound 
system can, however, also be characteristic of bilingualism acquired by a child 
who grows up in a home where two languages are spoken niore or less inter- 
changeably by the same people and in the same situations. In this instance some 
compromise representational processes taken from both languages may be es- 
tablished, with neither having pronounced dominance. 

A very different kind of relation between two languages in the same nervous 
system is what we shall call a coordinate language system. In this case, as shown 
on the right-hand side of the diagram, the set of linguistic signs and responses 
appropriate to one language come to be associated with one set of representa- 
tional mediating processes (Tm, — Sm,), but the set of linguistic signs and re- 
sponses appropriate to the other language become associated with a somewhat 
different set of representational processes (tm, — Sms). This kind of development 
is typical of the ‘true’ bilingual, who has learned to speak one language with his 
parents, for example, and the other language in school and at work. The total 
situations, both external and emotional, and the total behaviors occurring when 
one language is being used will differ from those occurring with the other. The 
kinds of representational processes developed must then also be different and 
hence the meanings of the signs. This development can also characterize the 
second language learner, who, relying as little as possible on translation and im- 
mersing himself in the living culture of another language community, comes to 
speak a second tongue well. a 

Even within a coordinate system there may be interference between the two 
sets of processes. Given the likenesses throughout human cultures in the situ- 
ations and objects dealt with by language, it is certain that the representational 
processes elicited by translation-equivalent signs in two languages will often 
be similar. In decoding, this produces a constant pressure on the bilingual to 
confuse meanings, to interpret a sign in language A as its translation-equivalent 
in language B would be interpreted. The more similar the signs—cognates, for 


2 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 141 


instance—and the more similar the mediators, the greater this pressure will be. 
Interference is most likely to occur when the languages are closely related and 
the cultures or the experiences associated with the languages are alike. 

On the engoding side, the more similar the meanings or representational proc- 
esses, the more errors there will be. These may consist in delays or blocking of 
response, if the alternative responses in the two languages are quite different. 
There may be intrusions of responses from the wrong language, if the items in 
the two languages are similar. These phenomena are often obvious in the com- 
pound system, where identical mediators must elicit alternative responses. They 
may take subtle forms in the coordinate bilingual, resulting merely in minute 
delays or shifts in response frequencies in comparison with monolinguals. Com- 
promise formations usually result, depending upon relative habit strengths in 
the two languages in vocalic skills, lexical associations, and transitional pat- 
terning. 

In spite of the pressures for interference, there are many instances of remark- 
ably pure bilingualism, in which the speaker, once launched in a given language, 
in an appropriate situation, and speaking of events associated with that language, _ 
will experience no difficulties and perform like a monolingual. There are at least 
three general predictive factors to be considered for the coordinate bilingual. 
In,the first place, the feedback stimuli from previous utterances in a given lan- 
guage are more associated with mediators appropriate to that language than 
another, unless considerable language mixture has occurred in past usage. 
Secondly, the current interpersonal situation will affect interference in speech 
as much as it does features of style or dialect within one language. Hearer bi- 
lingualism, the relative prestige of the languages, momentary feelings toward the 
hearer will alter the general availability of responses in each language or even 
lead to deliberate use of interference by a speaker. Finally, stimuli arising from 
the scenes, objects, and people present during the formation of a language will 
also be associated more strongly with mediators appropriate to that language. 
Hence bilinguals report that when they are with, or even think about, their 
parents or their home, the parental language becomes more available. A bilingual 
under emotional stress may revert to the language spoken when comparable 
emotions have been experienced in the past. 

For any semantic area we would expect speakers of more than one language 
to distribute themselves along a continuum from a pure compound system to 
a pure coordinate system. How would one measure or index the location of par- 
ticular individuals along this continuum? At one extreme, the meanings of trans- 
lation-equivalent signs are identical, and at the other the meanings of trans- 
lation-equivalent signs are different. Furthermore, the semantic differences 


involved tend to be connotative rather than denotative. The semantic bal 
(cf., section 7.2.2) seems particularly appropriate as a tool here. It could be 
ic area or to make a general 


used to measure coordinateness within a semanti à a 
estimate if appropriate samples could be devised. If a sample of pairs of seis - 
tion-equivalent signs were given to a varied group of two-language speakers f m 
differentiation against an appropriate form of the semantic ]nstreuenu wii 


. 
. 


142 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


D between profiles of the pairs computed for each speaker, the average D (dif- 
ference in meaning) should vary directly with the degree of ‘coordinateness’ of 
the language systems within each speaker. The validity of this measure could be 
estimated against such criteria as frequency of interference in ordinary conversa- 
tion, fluency measures, and translation facility, to which we now turn our atten- 
tion. 


6.2.2. Translation under Compound and Coordinate Systems 


In the process of translating from one language to another, linguistic signs in 
one language (814) must be decoded and equivalent or related linguistic re- . 
sponses must be made in the other language (Rls). The behavioral ‘situations are 
quite different, depending on (a) whether the translator maintains compound 
or coordinate languages in his nervous system, and, if the former, (b) whether 
he is translating to or from his dominant language. The left-hand diagram in. 
Figure 17 represents the translation process for a compound language system. 
Solid lines show translation from the dominant language, and dashed lines show 
translation to the dominant language. The right-hand diagram represents the 
translation process for a coordinate language system; the encircled numbers 
represent alternative translating circuits at different stages in the development 
of translating fluency in the coordinate system. 

Compound system translating. When the product of an ordinary foreign language 
course in high school, let us say, translates from his native English into the other 
language, we have the situation represented by the solid lines in the left-hand 
diagram. Encoding the foreign forms involves direct response competition, since 
both are associated with a single set of American culture meanings. The ‘same’ 
mediated stimulus must elicit a response (foreign language output) quite different 
from the dominant response (English language output) in the habit hierarchy. 
The task would be impossible were it not for differences in the total stimulus 
pattern for the translator. Such differences may be brought about by the ‘set’ 
to translate, the feedback of foreign cues from preceding output, distinctive 
dispositional tendencies (once the foreign language grammar and syntax has 
become sufficiently learned), and unique associations with the use of the second 
language. These cues must be sufficient to counteract the stronger English re- 
sponse tendencies in the presence of Sm. An analogous situation exists for the — 


COMPOUND 


LY EE asa! 
v Wig ^ at al 
bn’, B la OX aati ia om 
ae pne asd à 


COORDINATE 


£ 
PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 143 


bilingual who has learned to speak both languages in-a homogeneous environ- 
ment. Encodings in either language in response to inputs in either language may. 
be equally reinforced. One would expect more or less continuous interference and 
intrusion in encoding, in either language. This interference is not due to the pres- 
sure of a dominant language, as in the earlier example, but to lack of distinctive 
cues in previous use of the two languages. 

Theoretically, decoding from a foreign language should be facilitative for a 
compound system, since different inputs are associated with the same representa- 
tional reactions or meanings. To the extent that signs in the two languages are 
similar in form, decoding will be made easier. In a sense, the process of decoding 
the non-dominant language always involves translation, since the representa- 
tional processes of the compound system are appropriate only to the dominant 
language. It is interesting to note that the compound system, whether a result 
of second-language learning or childhood bilingualism, can never translate in 
a true cross-cultural sense, since there is no possibility of comparing meanings 
in two culture contexts when only one system of representational processes is 
present. 

On the other hand, the coordinate system provides true cross-cultural transla- 
tion, but with certain theoretical complications, for the system itself is changed 
by.the translation process. Let us take the ideal case of the bilingual in whom 
both languages are of about equal availability. As shown in the right-hand side 
of the diagram above, signs in language A must elicit meanings appropriate to 
language community A, due to the processes of learning to decode in a coordinate 
system. However, these language A mediation processes are associated with en- 
coding in the same language. How then does the coordinate bilingual translate? 

It will be recalled that to the extent that human cultures, situations, and 
objects are.similar, the coordinate system is likely to involve some representa- 
tional process in language B that is closely similar to that being elicited by the 
input in language A. This process may not be exactly the same as that elicited 
by the sign in language A. Translation-equivalent signs in the two languages 
may elicit slightly different representational processes due to the subtle differences 
in context or connotation which ordinary translations must lose, e.g. the difference 
between ‘Weltanschauung’ and ‘philosophy of life.’ Or there may be several 
processes in language B which are similar in different respects to a representa- 
tional process in language A. Such is the case when a translator must choose be- 
tween several partially adequate translations, as in the various English transla- 
tions of ‘gemütlich.’ In both of these instances, simple generalization between 
Sm, and Sm, By pathway (1) may make translation possible. Since generalization 
is most probable between maximally similar representational processes, the re- 
sult is the encoding of the most adequate cross-cultural translation. i 
Another possibility is that pathway (2) between Sm, and ra, is established by 


learning, which would be necessary if Sm, were 80 closely associated with a cultural 


context peculiar to language community A that appropriate pieni. n 
language B processes was inhibited. Sych learning might occur, for examp! idi 
a bilingual is in a situation which elicits Sm. This may be due to an appropria; 


144 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


external context, thoughts about events that occurred in the context of language 
A, or a conversation with a language A monolingual. For learning to occur, Tm, 
must also be elicited, perhaps because a language B monolingual describes or 
refers to situations or objects represented by Sm, but using language B signs. 
Generally, it would appear that a learned association between Sm, and ra, prob- 
ably only develops in situations in which some generalization between Sm, and 
Sm, also occurs. Learning then facilitates the appropriate translation. Whether 
through generalization, learning, or both, the translation is such that the mean- 
ings elicited in monolinguals in languages A and B are as similar as possible. 

The ideal translator or interpreter accomplishes a transformation of signs 
through a three-person channel (monolingual A—coordinate transiator—mono- 
lingual B) such that the representational processes of all three, or the meanings 
of the signs, remain unchanged. Obviously, the more the cultures, or the situa- 
tions and objects discussed differ, the less rapidly the interpreter can encode. He 
is delayed by lack of quick generalization to similar meanings in the other lan- 
guage, or by conflict between several partially appropriate meanings. Inter- 
ference in decoding may be produced by semi-cognates, similar or identical forms 
with varying meanings. i 

The above description treats the coordinate translator as though he were 
translating for the first time, but the translation process itself brings about new 
learning. Practice may reduce the capacity for cross-cultural translation, as we 
will demonstrate. Three different kinds of short-cuts may develop in the pro- 
ficient translator or interpreter. In encoding, a linguistic response in language 
B must repeatedly occur in close sequence with the representational process ap- 
propriate to language A, as shown by pathway (3). This produces the same kind 
of interference discussed earlier, in the compound system, since sm, may elicit 
different responses depending on whether the speaker is supposed to speak 
language A or to translate it. Note that the representational process of language 
B has not been elicited at all. The second short-cut may consist in direct decod- 
ing into language B by pathway (2). The representational process appropriate 
to language B occurs soon after presentation of the linguistic sign in language 
A when there is frequent translation. This sign in language A must therefore 
become associated with a mediation process appropriate to language B, and it 
will compete with the process appropriate to language A, and, to the extent that 
they are similar, will lead to elimination of the meaningful discrimination in de- 
coding. Finally, the mediation process may be only secondary, the response in 
language B being directly associated with hearing of a sign in language A, with- 
out intervention of a meaning process. This is most likely to happen for simul- 
bt interpreters who always use the same translation for the same word or 
phrase. 

If a coordinate bilingual is hired to interpret in one direction only, from language 
A to language B, we must predict that (a) he will gradually develop an appro- 
priate set of translation meanings for the signs in language A, and (b) he will 
lose his ability to speak language A. In»other words, he will become a perfect 
A-to-B translating machine, while retaining his proficiency in language B—thus, 

2 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 145 


a sort of dual-input speaker of language B. If a coordinate bilingual translates 
frequently in both directions, he must gradually (a) lose the distinctiveness among 
the mediation processes appropriate to each language and (b) suffer increasing 
confusion in encoding. In other words, the very process of two-way translating 
tends to transform a coordinate system into a compound system. It would be interest- 
ing to test these predictions on a sample of professional translators over time, 
using the type of experimental design suggested earlier with the semantic dif- 
ferential. This transformation can be minimized by refreshing the monolingual 
associations in both languages, of course. 


` 6.2.3. Grammatical and skill levels 


We have emphasized the semantic aspects of second language learning and 
bilingualism because these aspects have been relatively neglected. There is a 
great deal of carefully analysed information on purely linguistic aspects of bi- 
lingualism, phonemic, morphemic, and grammatical ipteractions and the like, 
and the interested reader is referred to the two sources given at the beginning 
of this portion of the report. For the most part previous work has reported the 
occurrence of certain phenomena without attempting to relate frequencies of 
occurrence to the learning experiences of the speaker. Grammatical aspects should 
be particularly profitable to study from this standpoint. Being organized on 
largely unconscious levels on the basis of transitional redundancies, these aspects 
of encoding should be especially difficult to learn and, once learned, should be 
equally difficult to suppress when trying to master a second language. For the 
coordinate bilingual, on the other hand, two alternative grammatical systems 
once established should provide for greater stability and independence between 
the two language systems. 

We have gonsidered here only the aspects of bilingual speech which reflect 
the influence of two linguistic codes on each other, omitting consideration of such 
non-linguistic features as pronunciation and style, (cf. section 4.1). Yet further 
study of these features would probably be rewarding, especially for psychol- 
ogists who would be interested in their sensitivity to differences in the attitudes 
of bilinguals and second-language learners toward the respective speech com- 
munities. 


6.2.4. Research proposals 


(1) Indexing degree of coordinateness of language systems. The possible use of 
the semantic differential as a means of determining the degree of separateness 
of meanings fér translation-equivalent signs has already been discussed. (A com- 


parison with responses of monolinguals is suggested in section 7434. "This would 
ound system exists, but the 


determine not only semantic areas in which a comp à 
language community for which the meanings are appropriate.) (2) Influence of 
bilingualism on perception and meaning. One of the writers™ is now engaged in 
research of this nature. Subjects varying in degree of bilingualism tell stories in 
response to the Thematic Apperception sest (a series of rather ambiguous situ- 


*5 Susan M. Ervin. e 
. 


146 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


ational pictures), (a) in language A after preliminary instructions in that lan- 
guage, and (b) after an interval of several weeks, in language B after similar 
preparation. The expectation is that ways of perceiving these pictorial signs, 
their meaning to the subject, will vary with the language being used, and with 
the degree of coordinateness of the systems in a given subject. (3) Measuring the 
transitional proficiencies of second language learners and bilinguals. The ‘Cloze’ 
procedure developed by Wilson Taylor and described briefly in section 5 seems 
adaptable to problems in this area. Passages in languages A and B, as translated 
by maximally facile coordinate translators, could be mutilated (every fifth word 
deleted, for example) and given to subjects with varying degrees of bilingualism 
or varying amounts of second language training. In the former ease, the more 
nearly equal the correct ‘fill-in’ scores for languages A and B, the more ‘truly’ 
bilingual in the coordinate sense the subject; in the latter case, the more nearly 
equal the scores in A and B, the greater the learning of the second language. This 
technique has two advantages; first, by its nature it samples all of the subtle 
contextual factors of both semantic associational and grammatical-dispositional 
levels; second, by using each subject’s own performance in his most proficient 
language as a criterion, it eliminatesindividual differencesin intelligence, language 
abilities, and the like. (4) Measuring interference between languages under varying 
conditions. Three factors were cited above in accounting for the degree of inter- 
ference in encoding—feedback, the interpersonal situation, and differential past 
experience, Encoding with these conditions varied can be studied to see how con- 
ditions influence the amount and the kind of borrowing. 


6.3. Language Change 


Languages change in response to both internal dynamics and external pres- 
sures. Psycholinguists are interested in both processes, but the present analysis 
concerns the former. The existence of similar forces operative under similar con- 
ditions everywhere in language is indicated by the existence of a limited number 
of types of change which reoccur in different, historically unconnected languages 
end at different chronologic periods. Linguists have accumulated an enormous 
amount of authenticated information relating to such changes and have been 
able to formulate a number of principles regarding the ‘hows’ of specific changes. 
The psychologist, in line with his general orientation, is typically interested in 
the ‘why,’ that is, in the isolation of general principles of behavior underlying 
shifts in linguistic habits. No doubt the interplay of factors in any particular 
instance is too complex to allow of complete explanation in the foreseeable future, 
but this does not mean that there are no general laws whose combinations limit 
the possibilities to the point where at least statistically probable outcomes may 
be hypothesized. 

A striking fact about language change is that although both changes of form 
and those of meaning are always proceeding simultaneously, each can be ex- 
tracted by analysis as a set of separate and practically independent processes. 


® Joseph H. Greenberg, Charles E. Oagoud: and Sol Saporta. 


? 
m 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 147. 


Nevertheless, they do impinge on each other at a few points. Each of these two 
major areas of change, the formal in which will be included both phonological 
and grammatical changes, and the semantic, will here be given separate treat- 
ment. In each of these two major areas, tentative generalizations concerning 
the facts of change based on linguistic data will be followed by a discussion of the 
psychological mechanisms which may be suggested as operative in bringing them 
about. 3 


6.3.1. Formal change 


6.3.1.1. Linguistic facts. Changes in the phonemes, the basic units of the sound 
systems of laħguage, may involve (1) replacement of one phoneme by another, 
(2) loss of a phoneme, (3) transposition or metathesis, (4) insertion of a phoneme. 
These types have been stated in decreasing order of frequency of occurrence, 
replacement being by far the most common. Changes in phonemes may further 
be classified as regular or sporadic. A change is regulag if it affects all instances 
of a given phoneme under specified conditions. Otherwise it is sporadic. Examples 
of regular changes are the following: In early Aramaic P in all instances was re- 
placed by ¢; in (probably) 18th century Hausa, a West African language, s was 
replaced by šin all instances where e, e', 1, or ù followed. An example of a sporadic 
change: In the development of Spanish from Latin, r was replaced by J in the 
word arbol ‘tree,’ earlier arbor. In cases of sporadic change, reference to specific 
instances is unavoidable. 

Regular changes are, in turn, divided into unconditioned and conditioned. 
In unconditioned changes, all instances of a phoneme.are affected without limit- 
ing conditions. A conditioned change involves only some of the occurrences of a 
phoneme under stated limitations in terms of other phonemes or positions in the 
utterance. In the above paragraph, the first change mentioned is unconditioned, 
the second conditioned. Unconditioned changes tend to occur in sets. What is 
involved is the replacement of some one distinctive feature common to a number 
of phonemes by another distinctive feature in all its occurrences. Sometimes such 
changes are, as it were, reversed in midstream. After affecting one or mote 
phonemes, the older form reasserts itself in some instances and the result is some- 
what checkered. An example of such a mass shift is Grimm’s First Law, a state- 
ment of consonant changes from Proto-Indo-European to Proto-Germanic. For 
instance, /p > f, t > þ, k > x/ among other changes, thatis, inthe threeinstanoes 
cited, a stop feature was replaced by a fricative feature. i 

Two types of results may be distinguished here: either a phoneme or group o 
phonemes through a change in feature may give rise to sounds which did 7 
previously occur in the language, or change may be to sounds which already 4 
exist. In the former eventuality, no change in phoneme inventory (i.e., v : 
phonemes) or distribution (i.e., occurrence in particular forms) resulta. In fact, 
some would call such a change phonetic, rather than phonemic. In the latter 
case, we have the phenomenon of merger which involves a reduction in aia 
inventory. The following empirical generalizations are offered with regard » 
merger. Each is understood as being preceded by 'other erus being equal’: 


LJ 
° 


148 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


(1) The more uncommon a phoneme is in human speech in general, the more 
likely it is to be merged with another phoneme. (2) The lower the frequency of 
a phoneme in a given language the more likely it is to merge with another pho- 
neme, providing this second phoneme is not itself of excessively high frequency. 
(3) The closer the points of articulation shared by two phonemes the more likely 
they are to merge. (4) The more distinctive features shared by two phonemes the 
more likely they are to merge. (5) The fewer the pairs of different linguistic forms 
which are distinguished by the phonemes, the more likely they are to merge. 

Conditioned changes result typically from conditioned allophonic variations 
in which by loss or change of the original conditioning factor a formerly non- 
significant contrast becomes phonemic. For example, in the transition from Proto- 
Indo-European to Sanskrit we reconstruct the following stages: 


1. ka ke ki ko ku 

2. ka če či ko ku (phonemically as in stage one. We now have an allophone [č] which 
occurs before e and 7. There*was no [č] in the language previously.) 

3. ka ča či ka ku (a merger of o and e with a has produced a phonemic contrast ka vs. 
ča which did not exist before). 


Another type of result ensues if à phoneme undergoes a conditioned change but 
the resultant of the change already exists. In this case, there is partial merger. 
The old instances of the phoneme which were not affected continue but the hew 
variant merges with another previously existing phoneme. It is clear that this 
type of change affects the distribution but not the inventory. In recent Russian 
o changed to a under conditions of lack of stress. Since stressed o continued and 
a already existed, no new phoneme was added to the language. In general, con- 
ditioned change is the diachronic aspect of the synchronic problem of condi- 
tioned allophonie variation. 

The following general facts about regular conditioned changes must be con- 
sidered: (1) The conditioning factor is more often a phoneme which follows 
rather than one which precedes. (2) The conditioning factor is almost always im- 
mediately following or preceding. Sometimes a vowel is affected by that of & 
following or preceding syllable. More remote conditioning factors are very rare. 
(3) The change usually results in an articulation which is more like the condi- 
tioning phoneme. That is, it is assimilative rather than dissimilative. In general, 
then, changes result in a sequence of articulation which abbreviates or eliminates 
movements. For example, the fronting of a k to č before 7 eliminates the forward 
movement from back to mid position. (4) Final positions of syllables, words and 
utterances are often conditioning factors for change, initial positiors only rarely. 
Changes with final position as their conditioning factor are typically those which 
result in loss or merger. This, of course, results in fewer phonemes in these posi- 
tions. We know of few languages which have more distinct phonemes in syllable 
final than in syllable initial position. 

Sporadic changes have the following characteristics: (1) Certainsoundsaremost 
frequently affected—liquids (r, 1), nasals (n, m) and sibilants (s, š). (2) Dis- 
similation is probably as frequent as assimilation, contrasting with its rarity in 


» 


Dd 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 149 


regular changes. (3) The conditioning factor often operates at a distance. Ex- 
ample: Latin peregrinus > Italian pellegrino ‘pilgrim.’ Here the succession of 
two r's has resulted in the dissimilation of the first to l even at a distance. The 
connection of such changes with speech-lapses, championed by Sturtevant, is 
highly plausible. 

As has been stated, regular sound changes proceed in general without regard 
to the meanings of the forms in which they occur. However, it seems likely that 
regular changes are encouraged or inhibited depending on their consequences 
for grammatical structure. Thus, although both Germanic and Romance lan- 

. guages have stress, the unstressed vowels have tended to merge in Germanic 
languages but to remain distinct in Romance. This may well be connected with 
the fact that unstressed syllables in Germanic are typically limited to non-root 
morphemes, while in Romance languages the root may sometimes be unstressed, 
e.g., Italian ‘amo ‘I love’ but a'mo ‘he loved.’ In turn, the merger and loss of 
final vowels in Germanic languages has had repercussion on the grammatical 
system, in that distinctions based on difference of inflectional morphemes which 
became homonymous had now to be expressed by other, syntactic, means, 

The chief process of morphological change is analogy. (1) By alternations be- 
tween morphs of the same morpheme one pattern is replaced by another, usually 
more common; or (2) the alternation is effaced completely by extension of one 
of the forms to the alternant. Such changes are often stated in the form of an 
analogical proportion. Examples of both types of change are (1) the replacement 
of ‘brought’ by ‘brang’ (sing: sang = bring: brang) and (2) ‘calfs’ for ‘calves’ 
(cliff: cliffs = calf: calfs). The extension of a formative, typically derivational 
affix, to a combination in which it had not previously existed, or the formation 
of a new compound out of existing elements may likewise be viewed as a kind of 
analogical process. Thus the new form draftee can be expressed as the consequence 
of an analogical extension (employ: employee = draft: draftee). The less fre- 
quent types of morphological change such as folk-etymology and blending are not 
considered here specifically. They may be looked upon as partial analogical 
processes. 

6.3.1.2. Certain general hypotheses relating to change in form. For convenienc 
in discussion, principles relating to formal language change may be taken up in 
two categories: (1) those relating to the locus (in utterances) of change; (2) those 
relating to the process of change. j i 

(1) Principles Relating to the Locus of Change. The evidence for generality 
described above makes it clear that changes in language structure are pot HP 
hazard as to*locus—to the contrary, there are definite ‘stress points within 
messages as sequentially unreeled. Are there any general DDR: 105 dew 

cross-linguistically, which would make it p to specify ‘stress points 

hence predict the most probable locus of changes 7 y 
I. Short-cireuiting. Features of subsequent phones will md dus uns 

wherever possible. The limits suggested by ‘wherever possible’ are at least " e 

lowing: (a) To the extent that an articylatory feature of a subsequent ph a i 

incompatible in the motor sense with that of an antecedent phone in its im- 


2 oe 
lence 


and 


150 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


mediate environment, the tendency toward short-circuiting will be reduced; 
(b) To the extent that an articulatory feature of a subsequent phone would 
change the phonemic (i.e. code) character of an antecedent phone in its im- 
mediate environment, the tendency toward short-circuiting will be reduced. In 
this case, speaker ‘lapses’ (see below) will tend to be corrected by*hearers. The 
basis for short-circuiting within the rapidly executed and tightly bound phone 
sequences that constitute syllabic units probably lies in the nature of skill forma- 
tion—the central programming of neural events in the motor cortex is much 
more rapid than the sequential execution of movements, resulting in overlapping 
excitation patterns and a tendency to anticipate. 

IL. Perseveration. Features of antecedent phones will tend to persist wherever 
possible. Again, ‘wherever possible’ is limited at least by (a) articulatory in- 
compatibility, and (b) production of significant (e.g., to hearers) phonemic 
changes. In the latter connection, it should be noted that shifts from one dis- 
tinctive feature to another are not necessarily significant—there may not exist 
in the language any meaningful unit (e.g., word) corresponding to the modified 
signal even though it is phonemic. The basis for perseveration of this kind would 
seem to be something akin to motor inertia or ‘least effort’; it is simply easier 
for the rapidly operating encoder to persist or remain in whatever articulatory 
feature happens to be held than to change it. Due to such perseveration, there 
should be a general tendency, other things equal, for phonemes displaying a 
certain feature to be followed by phonemes displaying the same feature in a 
given language. Obviously, this tendency cannot be carried to its logical conclu- 
sion, or the code becomes uniform and hence meaningless to the hearer. Else- 
where in this report (section 5.3), Saporta describes some computations on se- 
quential phonemes in English which bear on this matter. 


The combination of short-circuiting and perseverative principles in skill execution 
leads to the following expectations: (a) That the positions of instability in a language 
with reference to these factors should be those in which the phones both antecedent and 
subsequent to a given phone include one or more features in common. On this basis, for 
example, one would predict that voiceless consonants would be more common in initial 
aiid terminal positions in words than in medial positions where they would often be sur- 
rounded by voiced vowels. (b) That where a given phone is bounded by antecedent and 
subsequent phones displaying different features within the same dimension (e.g., voicing, 
tongue position, lip position, etc.), this phone should be characterized by transient or 
intermediary features between those in its environment. The general fact that subsequent 
phones have more effect upon sound changes than antecedent phones implies that the 
short-circuiting mechanism is stronger quantitatively, but this does not derive directly 
from any theoretical notion. The specification of conditions operative in determining 
which factor will be dominant remains extremely obscure. There is no doubt a tendency 
for intervocalie unvoiced consonants to become voiced—we can at least predict that this 
is more likely to happen than, for example, that they will become glottalized. Still, many 
languages go through very long periods during which intervocalic voiceless stops remain 
entirely stable and we are as yet unable to specify what factors, linguistic, cultural or 
otherwise, determine when the change will take place and when not. 


A number of principles derive from analysis of transfer and interference in 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 151 


sequential materials. A great deal of experimental data’ justify the following 
summary statements. i 

III. Convergent hierarchy. When a variety of antecedent states (e.g., stimuli) 
converge upon a, common subsequent state (e.g., response), transfer is positive and 
retroactive effects are facilitative, the degree of facilitation varying directly with the 
similarity among the antecedent states. This sets the general condition for exten- 
sion by analogy at the grammatical level, e.g., having learned play-ed, walk-ed, 
and fiz-ed it becomes easier to transfer to crawl-ed, digest-ed, and master-ed, as 
well as for what might be called error by analogy at any level, e.g., to the ex- 
tent that a given subsequent acquires high frequency with certain antecedents 
it should tend'to generalize or transfer to other antecedents. The psychological 
fact that the degree of facilitation varies with similarity among the antecedent 
states should also have its evidence in language behavior. Certainly it is easier 
to extend by analogy, from boyhood, manhood, and priesthood to coinages like 
babyhood, warriorhood, and scouthood (semantically similar antecedents) than 
it would be to stonehood, lighthood, or hillhood (semantically dissimilar ante- 
cedents). On the phonemic level, until appropriate data have been analysed, 
one can only hypothesize that high frequency subsequents should tend to gener- 
alize among similar antecedents, e.g., there should be a tendency for the initial 
phonemes in sets with common terminus to be separated by fewer distinctive 
features than chance would dictate (time, dime and lime, rhyme, for example, 
are the only sets in English having this particular medial and terminal, and t/d 
and l/r are separated by few distinctive features). In this connection, it would be 
useful to have a frequency list of syllables in English as well as other languages. 

IV. Divergent hierarchy, When a common antecedent state diverges upon a variety 
of subsequent states, transfer is negative and retroactive effects are interfering. This 
is the generalspsychological condition for competition among responses and errors. 
Having learned to make one reaction to a stimulus, it becomes more difficult to 
substitute another reaction to the same stimulus. Observe the following: 


Terminal 


Initial Medial e 
otiui N (Divergent to Convergent) (Divergent) ° 
pin pin pin 
sin pan pit 
tin pen pick 


Even for the simple task of rapid repetition, initial (convergent), medial ven 
vergent-to-convergent), and terminal (divergent) sets become increasingly 
difficult in that order. On the same ground, one would expect ‘stress points’ to 
be located in positions of divergence rather than convergence, both at phonemic 
(e.g., /t/ tending toward /d/ in terminal and medial positions in SANE 
American) and at grammatical (e.g., irregular nouns or verbs) levels. In all suc! 

cases, shifts should be in the direction of the stronger (i.e., more frequently used) 


7 See C. E. Osgood, The similarity paradox in human learning: A resolution, Psycho- 


logical Review 56. 132-143 (1949). 


152  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


habits in these divergent hierarchies; /t/ should be under greatest stress to shift, 
toward /d/ in those positions where the total probability of /d/ is higher, and 
‘weak’ irregular verbs should be more susceptible to error than ‘strong’ irregular 
verbs. The latter prediction has been verified for irregular verbs in English, a 
significant negative correlation being obtained between frequency of errors in 
writing past tense forms and frequency of usage in Thorndike-Lorge lists. 


In the present paradigm it should be noted that the grlater thé similarity among the 
divergent responses, the less the interference. This requires some clarification: With in- 
ereased response similarity there is, to be sure, greater intrusion (substitution) of one 
response for the other; on the other hand, there is less interference in the sense of block- 
ing, or failure to respond, and in terms of latency of response. This effect is maximal with 
reciprocally antagonistic reactions, where the learning of one is accompanied by inhibi- 
tion of the other. This also has interesting implications for language change: In the first 
place, similar phonemes in terminal positions should tend to merge, e.g., intrude upon 
each other with such frequency and unpredictability that the difference would lose its 
distinctiveness. This would, help explain the general fact that there is greater diversity 
in initial position than in terminal position. A similar phenomenon should apply to gram- 
matical affixation—there should be greater diversity among prefixes (and hence differ- 
ential semantic or lexical significance) than among suffixes (where automatic grammatical 
significance should be the rule). In the second place, antagonistic reactions in competing 
divergent positions should tend to interfere with each other in the sense of blocking and 
increased latency. The combination of these two factors—merger among closely similar 
reactions and reciprocal blocking among antagonistic—leads to the expectation that the 
sets of phonemes that appear in given positions following a constant should tend toward 
an average separation in distinctive features, neither too similar nor too disparate, when 
frequency is studied. 

These psychological considerations offer to shed some light on the typological linguistic 
problems relating to the contrast between prefixing and suffixing languages. Since inflec- 
tive and derivational elements are few in number compared to the morpheme membership 
in root position, the general considerations relating to transitions from divergent hier- 
archy (in this case root classes) to convergent hierarchies (derivational and inflectional) 
morpheme classes apply here. In accordance with the principles just discussed, there is 
greatest facilitation when the divergent hierarchy is followed by the convergent. On this 
basis, one would expect suffixing languages to be more frequent than prefixing ones, a fact 
noted by Sapir. One would also expect the prefixing languages to be more fusional (i.e., 
irregular) and suffixing languages to be more agglutinational (i.e., mechanical) in their 
morphophonemics. The consideration advanced is that in prefixing languages the difficult 
transition from a class with few members to a class of wide choices will be ameliorated to 
the extent that there are special variant forms (alternants) each restricted to a small 
number of subsequent roots. Indeed, in limiting cases this becomes in a sense a single 
choice since the enunciation of a prior element which can only be followed by some single 
subsequent, commits one to this subsequent in advance. The longer this process goes on 
the more the prior element becomes fused with the subsequent until it cesses to be an in- 
dependent morpheme. 

We might therefore advance the developmental typological thesis that prefixing lan- 
guages tend towards the isolating type. The evidence both for this and the hypothesis of 
greater irregularity for prefixing languages needs careful examination before any defini- 
tive conclusion can be drawn. Our impression regarding this latter thesis of the more 
fusional nature of prefixing languages is that it holds in general. Very striking is the total 
absence to our knowledge of nominal inflectional elements in prefixing position, where 
the disparity of hierarchies is greatest. In'support of the thesis of the development of 
isolating from prefixing languages, there is striking positive evidence in five cases and no 


a 


2 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 153 


contrary instances of which we are aware. Annamite, Chinese, Thai, Zapotec (Mexico), 
and Ewe (West Africa) are all classical isolating, monosyllabic languages and all are re- 
lated to languages of a prefixing type of the Austroasiatic, Sino-Tibetan, Thai-Malayo- 
Polynesian, Oto-Manguey, and Niger-Congo families respectively. 


The empírical fact, noted earlier—that the smaller the number of features 
separating two phones, the greater the probability of change from one to the 
other—seems to be incorporated in the hierarchy analysis above. 

Another principle relaling to the locus of formal language change derives from 
information theory: SS 

V. Sequential redundancy. The more redundant, i.e., predictable, the occurrence 
of one message element from knowledge of the occurrence of another, the greater the 
probability of modification of one of them. Information theory in itself does not 
indicate the precise locus of change, but since messages are unreeled in only one 
direction, it seems likely that susceptibility to modification should be greater 
in the terminal members of such redundant sets. This, incidentally, should be a 
special condition for loss of a phoneme in a language as compared to change. It 
will probably be necessary to distinguish between inherent redendancy, e.g., 
wheresthe physiology of the articulatory processes requires it, and incidental 
redundancy. * 

(2) Principles Relating to the Process of Change. The process of language change 
is'probably best conceived in terms of the total communication act, involving 
continuous functional interaction between speakers and hearers. In this process 
the speaker is the petitioner for changes (cf., Sturtevant’s notion of ‘lapses’) 
and the hearer is the judge who, via social feedback or differential reinforcement, 
either allows or refuses to allow each particular modification. Given the existence 
of many ‘stress points’ in a language, speakers are under more or less continuous, 
if unconscious, pressures toward modification; to the extent that the process 
of effective communication in a group is or is not hindered by such modifications, 
the language will change. : 

I. Production of changes. Individual speakers of a language will tend to pr duce 
changes in those positions and of those types indicated above in proportion to the de- 
gree of stress under which they are communicating. By ‘stress’ here is meant any 
condition of the speaker that reduces his attention to his own self-feedback. Pre- 
sumably young children in the process of learning language are particularly prone 
to predictable errors—there seems to be evidence on the rate of change in lan- 
guage suggesting modification in terms of generations of speakers. sae 
rapid speech, speech under fatigue or any other debilitating ne 
as it occurs‘in popular songs, and so forth will be special conditions facilitating 
change. i 1 

IL. Social Feedback. To the extent that a speaker modification (a) pae a dif- 
ference inthe code, (b) is not redundant, and (c) occurs in a position of hig B A 
tion value with respect to appropriate behavior, the hearer will rh en a in 
inforce the existing ‘correct’ form. The social relation between the pai n 5 d 
in the communication act is also involved here—parents and elders QE pel 
likely to correct children and youths than vice versa, for example. What wi 


154  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


dealing with here are the conditions under which a hearer is likely to notice an 
erroneous or missing signal and evince this by either a checking verbal response— 
“What did you say?"—or by unexpected behavior from the speaker’s point of 
view. If I ask my son for the nail and am handed the pail, to use a crude illustra- 
tion, I am likely to say, “No, I said nail,” with appropriate emphasis and clear 
articulation of the initial phoneme. Obviously the factors indicated above are 
interactive: a phoneme shift which makes a difference;in the code may never- 
theless be passed by the censor if it occurs in a position of low information value 
in the semantic sense and/or if it is redundant with respect to its phonetic en- 
vironment (e.g., carried by the allophonie variation in surrounding phonemes). 
Similarly, a shift may occur in a position of high information value if it does not 
change the code (or, in doing so, does not produce a different word) and/or if 
it is in redundant relation to its environment. As young individuals in a language 
community learn the language as well as the culture, they develop self-correcting 
tendencies based on self-feedback. However, this self correction is clearly de- 
pendent upon the pattern of differential reinforcement received from other 
members of the community and hence should follow the same principles. 

III. Strengthening. Uncorrected modifications, being reinforced as parts of total 
communicative acts, become stronger habits and compete more and more effectively 
with ‘correct’ habits. The mere fact of occurrence of changes in predictable loci, 
e.g., non-random modifications, indicates the existence of underlying readinesses 
at these points. If we accept the general notion that effective communication is 
typically rewarding (needed objects are brought to the speaker, his social goals 
are accomplished, and so forth), then it follows that all stimulus-response se- 
quences contributing to the total communicative act will tend to be reinforced. 

IV. Generalization. As the habits producing modifications at ‘stress points’ be- 
come stronger, these new response tendencies will generalize or spread tq other posi- 
tions, initially to similar antecedent environments (e.g., similar stimuli) and thence 
gradually to all environments. This analysis implies that all changes begin their 
careers as ‘sporadic,’ tending to become ‘conditional’ changes under appropriate 
conditions, and eventuating as ‘unconditional’ changes. There is considerable 
doubt among linguists as to the validity of this notion—most apparently assum- 
ing ‘sporadic,’ ‘conditional,’ and ‘unconditional’ changes to be different in kind— 
but the empirical evidence needs to be re-examined from this point of view.” 
The ‘appropriate conditions’ under which sporadic modifications become condi- 
tional and conditional changes unconditional are frequency conditions. The 
present analysis would assume that where changes of a sporadic nature are re- 
corded, the initial occurrence was in a position of maximal ‘stress’ or predictability 
from both environment and competing habit structure, but the tendency toward 
generalization was blocked by stronger regular or ‘correct’ habits in other similar 
environments. Similarly, the restriction of a conditional change to its specific 


1 There is also a matter of definition involved here. We are assuming that most ‘sporadic’ 
changes occur in environmental stress points of the sort discussed, while many linguists 
E this term for changes of unlawful, haphazard sorts with respect to environmental 

actors. T 


ü 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 155 


environments suggests that continuing generalization tendencies were blocked by 
stronger regular habits in other environments in which the same speech sounds 
occur. These strong regular or ‘correct’ habits represent ‘frequency mountains’ 
over which the generalization tendencies cannot pass. Presumably various 
‘strong irregulars’ (such as certain nominative plurals like child-children and 
certain verb forms like go-went) are relics of the language’s past and survive the 
onslaughts of generalizing changes because of their high frequencies of usage. 
It also follows from all that has gone before that positions of emphasis—loudness, 
initial positions of utterances, positions of high information value, and so forth— 
will be the most resistant to change since these are positions where modifications 
produce cheeking reactions from hearers in the language community. 

V. Social change. Language change in a community will be gradual and cumula- 
tive, representing a continuous changing proportion of individuals who do or do not 
hear and produce a particular feature or set of features. The process of change in 
the community would most probably be represented by an S-curve. The rate 
of change would probably be slow at first, appearing in the speech of innovators, 
or more likely young children; become relatively rapid as these young people be- 
comé the agents of differential reinforcement; and taper off as fewer and fewer 
older and more*marginal individuals remain to continue the old forms. On an 
empirical level, it should be possible to make a comparative study of forms used 
as a function of age and other sociological variables. It was suggested that the 
rate of change may be a function of the size of the language community; it is also 
undoubedly a function of the status of the communication system. The nature of 
language change within the individual is a difficult question—some linguists feel 
that this is an all-or-nothing matter akin to mutation, whereas most psycholo- 
gists feel that there should be a period, at least, of oscillation between competing 
forms. Perhaps, in a manner akin to imprinting in birds, individuals never change 
in the features they hear and produce after early childhood experiences, language 
changes being purely a matter of sociological shift in the composition of the group. 
Again, empirical data would have to be collected with this question in mind. 

Information theory also generates a very general prediction concerning the 
direction that language change should take at any given stage. Information 
theory techniques are readily applicable to phonological changes since we ini 
a unit, the phoneme, which is sharply limited in number for each language an 
hence susceptible to counting on the basis of texts or lexicon. The same vestire 
tion applies to the distinctive features into which phonemes may be ate 
In employing information theory concepts here, two alternatives plas vnd 
We may cofnpare two or more stages of the same language pile arn we n 
records or we may compare related languages on the basis of assured Wei og pos 
an ancestral phonological system reconstructed by the well-estal E : i 
niques of historical linguistics. The comparisons would be in terms of entropy 


estimates. ich in- 
Our general hypothesis would be that there are two Dur as Je uu 
fluence change in a phonological sysjem: & tendency de i as ub uds 
competing tendency toward redundancy (cf., Zipf's notions o is 
. 


156  PSYCHOLINGUISTIOS: A SURVEY OF THEORY AND RESEARCH PROBLEMS . 


economies). A phonemic system may be considered as efficient to the degree that 
all combinations of distinctive features are utilized in the phonemes, that all 
phonemes are equi-probable, and that their occurrences are independent of 
neighboring phonemes. In fact, however, there can never be maximum efficiency 
—not only would articulatory difficulty of certain combinations of features and 
of certain sequences always be limiting factors, but channel noise would seriously 
diminish the comprehensibility of such a system. Perhaps, also, the speed of 
transmission of semantic units would exceed the channellcapacity of the decoder. 
For these reasons, we may think of a language as maintaining an unstable balance 
between efficiency and redundancy factors. 

VI. Entropy balance. The more the entropy of a given language syatem deviates 
from that representing balance between efficiency and redundancy factors the greater 

‘will be the tendency to change in the direction of balance. This ‘homeostatic’ prin- 
ciple obviously requires some statement about the balance point. If we give 
speaker and hearer equal weight in the communication situation, then the balance 
point would be expected to be 50 per cent efficiency. Synchronically, the calcula- 
tion of entropy measures of phonological units for an adequate sample of con- 
temporary languages should show something like a normal distribution about’ this 
50 per cent balance point. Diachronically, similarly, a sample of the stages of a 
given language through time should show a normal distribution about 50 per 
cent efficiency. Finally, if at a given stage a language is either well above or well 
below the mean 50 per cent entropy level, we should expect to find either a de- 
crease or an increase in entropy to characterize the subsequent process of change 
in that language. 

A phonemic system may be considered efficient to the degree that the number 
of distinctive features needed to describe the number of phonemes approaches 
the minimum of log; n, where n symbolizes the number of phonemes in the 
system. For example, a system of 32 phonemes, using 10 distinctive features, 
would have an efficiency of exactly 50 per cent, since it could be done with only 
five distinctive features (loga 32 = 5) under conditions of maximum efficiency. 
We may define the redundancy as 1 — E (efficiency). A system of 32 phonemes 
and 8 distinctive features would be 62.5 per cent efficient and 37.5 per cent re- 
dundant. The general hypothesis is that both the average efficiency of different 
languages synchronically and of the same language diachronically for different 
stages in its history should approximate the same mean value when this measure 
is applied. 

Unfortunately, very few languages have been analysed in terms of their distinctive 
features. Nevertheless, the following data display a surprisingly close approximation to 
predictions: The phonemes of English have been analysed as being 28 in number and re- 
quiring 9 binary oppositions."? The efficiency of this System then is 4.80/9 or 53 per cent. 
Russian phonemes, on the other hand, are 42 in number and employ a total of 11 distinc- 
tive features.” The efficiency of the Russian system is therefore 5.38/11 or 48.9 per cent. 


"2? Jakobson, Fant, and Halle, Preliminaries to speech analysis (Cambridge, 1952). 
3 Cherry, Halle, and Jakobson, Toward tbe logical description of languages in their 
phonemic aspect, Language 29. 34-46 (1953). 


D 


< PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 157 


The efficieney of modern Spanish, as shown in the table below, is 50.975. The tendency of 
three contemporary languages to cluster about 50 per cent efficiency is apparent. The only 
available presentation of analyses of successive stages of the same language is that of E. 
Alarcos Llorach fer four periods of Spanish. The inventory of phonemes, distinctive 
features, and efficiency estimates is given in the following table: 


Stages E Phonemes ^ Distinctive iReatures Efficiency 
I : t 21 : 9 4.9/9 = 48.8% 
II 24 8 4.58/8 = 57.3% 
III 27 10 4.75/10 = 47.5% 
IV 24 9 4.58/9 = 50.9% 


The average*of these stages is 51.1 per cent and there is a cyclic trend about this, value, 
both of which are consistent with the hypothesis as applied to diachronic data. Needless 
to say, no secure conclusions can be drawn from such scanty evidence, but the smallness 
of the deviations from predictions is striking. 

The same prediction, interestingly enough, can be reached on a priori grounds within 
information theory itself. If we have n distinctive featuree of binary form, we would ex- 
pect 2" possible phonemes. In addition we would expect the selection of a sub-set 
of phonemes which are maximally discriminable. If the distinctive features were dimen- 
sions ef a similarity space, we would expect the phoneme regions to be maximally distant 
(i.e., maximally*diseriminable). Such a set would be in diagonally opposite regions. If 
there are n distinctive features, then there would be 2-1 maximally distinguishable re- 
gions. Then the ratio of existing phonemes to possible phonemes would be 

Qe 
LA = 50 per cent. 


"The general notion just described says that languages should tend to change in 
such ways as to return toward a balance between efficiency and redundancy 
factors, but it does not specify what types of changes would accomplish these 
ends. In géneral, it seems likely that phonemic merger should tend to increase the 
entropy and hence efficiency of a language system (by reducing the total number 
of phonemes relative to the same number of distinctive features) while phonemic . 
split should tend to decrease the entropy of a language system. This, of course 
assumes that the merging phonemes vary in frequency and the result of pooling 
them will be a more even distribution and hence an increase in absolute entropy. 


In other words, the preceding analysis assumed approximately equal use of the 
phonemes in a language. If we had data at our disposal on phoneme sequences, 
to transitional measures, namely, 


similar predictions could be made with regard i 4 
an increase in entropy as a result of merger and a decrease in entropy as a resu 
of splitting, 


6.3.1.3. Proposed experiments. A number of research suggestions are embodied 


in the theoretical analyses above. Additional proposals may be noted d (1) 
Prediction of merger and splitting from entropy measures. The otra theory 
analysis just given suggests certain rather obvious tests. For Lane ae 
of relative entropy for Javanese, which has maintained most Proto- ronesi 


consonants intact, should be compared with those for Hawaiian or Samoan, in 
e 


"* Fonologia Español (Madrid, 1950). 


158 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


which there has been widespread merger. The process of splitting could be tested 
by studies of consonant frequencies in the transition from classical Latin to 
Italian, during which the number of consonant phonemes increased. (2) Re- 
analysis of historical data. There is an immense amount of evidence available on 
language change. This should be restudied and sampled in such a way as to test 
the various hypotheses that have been suggested above. Do we, for example, find 
that changes tend to occur initially in those positions ‘where anticipatory and 
perseverative environmental factors combine to modify ‘the intervening phone? 
Do we find that changes are more probable in terminal and medial positions than 
in initial? Would a fine enough time series reveal the predicted generalization of 
changes? (3) Contingency analysis of languages in general. Numerous typologies of 
world languages have been made, but usually without any clear purpose. The 
various principles discussed above generate certain predictions as to what lan- 
guage characteristics should (a) appear together and (b) shift from one into the 
other historically. A matrix in which the columns were defined by characteristics 
(agglutinative, tonal, etc.) and the rows by a random (or perhaps exhaustive) * 
sample of world languages could be analysed to determine empirically what 
characteristics tend to appear together and what languages tend to be related to 
more than chance degrees. This analysis presupposes further development and 
validation of the specific quantitative indices for each characteristic which have 
recently been advanced. (4) Experimentally produced lapses. There are a number 
of ways in which contemporary speakers can be placed under stress and the locus 
and nature of their lapses recorded and checked against theory. Some of the pos- 
sibilities are: (a) enforced rapidity of speaking, with tape recording of the results 
that can then be stretched for analysis; (b) detailed analysis of the spontaneous 
speech of children of various ages; (c) speech under the conditions of delayed feed- 
back—will the loci of disturbances be predictable from principles such as those 
above?; (d) analysis of the changes that occur in popular singing; (e) sampling 
of deliberate ‘humorous’ modifications, e.g., ‘speakers’ to ‘speagers, dies 
from native speakers on request. 


6.3.2. Semantic Change 


632.1. Linguistic facts. Change in meanings is as constant a feature of lin- 
guistic history as change of forms. Changes in the semantic area of language 
are in many instances motivated by the introduction of new cultural items or 
changes in old ones. The basic responses to such situations are usually one of the 
following: (1) Borrowing from another language. (2) Extension of an old term 
used to designate something similar either formally or functionally. (3) Coinage 
of a new term, often by compounding or derivation from previous morphemes. 
A kind of borrowing in which a new formation is based on the traditional re- 
sources of a language but modelled after a formation in a foreign language is 
sometimes called a calque. The following examples will illustrate these processes. 
English borrowed street from Latin (via) strata to designate the paved roads 
which were new to the Anglo-Saxons. It extended the meaning of the already 
existing verb writan ‘to scratch, to include runes on the bark of trees. English 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 159 


formed a new word railroad out of two existing morphemes. An example of 
calque is the German Fall in the meaning of grammatical case. It is a translation 
of the Latin casds which has this same meaning in addition to the literal meaning 
of ‘falling.’ It in turn is a loan translation of the Greek piosis, literally, ‘a falling.’ 
Very many changes in meaning can be shown to occur without any precipi- 
tating cause in non-linguistic cultural change. These changes like those listed 
above can be classified’ as (1) coinages, new forms with new or old meanings 
whether completely nef (noggin) or novel combinations of preexisting elements 
(big shot). (2) Meaning shifts in preexisting forms. (3) Obsolescence (e.g., loss of 
the term whither). Meaning shifts can practically all be covered by the term 
metaphor. The various figures of speech of traditional rhetoric, synecdoche, 
metonymy, etc. name a process by which a term is extended through various asso- 
ciations, e.g., part for whole, whole for part, specialization, generalization, weak- 
ening, elevation, degradation, etc. The nonce metaphor of the poet or of the or- 
dinary speaker becomes a meaning shift if it spreads to the rest, of the speech- 
- community. If the older meaning becomes obsolescent then a complete shift 
has taken place, usually with an intermediate period in which both senses 
exist? More often the various meanings all continue in use, some being viewed 
as primary otBers as metaphorical. The prevailing polysemy of words and other 
linguistic forms is the synchronic result of this diachronic process. 
" Once a shift has taken place, the result may be a chain reaction, The new 
meaning of the particular form which has undergone a shift may have been 
covered by some already existing term. This second term may become obsoles- 
cent, or may specialize in some narrower meaning, or in turn shift with further 
results. For this reason a change in meaning should never be considered in isola- 
tion but rather in its effect on a set of forms with related meanings. This is ws 
concept of the semantic field, which has been dealt with especially in Europe." 
Specific parallel changes of meaning tend to oceur with high frequency p widely 
Separated areas. Such meaning changes can usually occur in either direction. 
Thus ‘sun’ has become ‘day’ or ‘day’ has shifted to ‘sun’ independently in many 
languages. Often the same term is used for both, and if historical reconstruction 


is not possible we cannot tell in which direction the shift occurred. Such instances 
data (see below). 


should be assembled and correlated with word association ¢ ee be i 

As in other aspects of historical study we can utilize either Md d 
proper or that derived from comparison of related einer orm ide 
the continuation of an older form in-the same language, or shares à 


an i 1 blance in form and 
origin with one in a related language as shown by resem 
meaning, {s called a cognate. Examples of cognates are (1) eco anm 
‘stone’ and modern English stone in the same meaning. Q) ern 


bone and German Bein ‘leg.’ In this case English has generalized while Ge 


keeps the earlier restricted meaning. ; nti 
6.3.2.2. Theoretical analysis of meaning change. We ean a Kew. cd 

that ‘semantic change’ in the present context refers to change li à 
?5 See particularly Jost Trier, Der deut$che Wortschatz im Sinnbezirk des Verstandes; die 


Geschichte eines sprachlichen Feldes (Heidelberg, 1931). : 
LJ 


160 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 

sefi m E El,-^ [s]; F E CER T a 

[e [527.7 ny er I] ^7 Is] Ts, UA > x^v s 
mes: 


encoding message, decoding 
Figure 18 a 


of a linguistic sign (or in the semantic state ME with a linguistic sign) 
and not to change in the significance of referends (objects) themselves as distally 
perceived. In other words, when the object MOUTH comes to be called mouth 
rather than cheek, we do not assume that the meaning of this object to human 
communicators has changed, but rather that the arbitrary linguistic sign by 
which communicators refer to this object has shifted. 

This situation is described in learning theory symbols below and in Figure 18. 
Thedistal perceptual sign (8]), deriving from an object (8;) elicitsitsappropriate . 
mediation process (fm,———Sm,) in the encoder, but rather than mediating 
the eneoding of the original linguistic reaction (R3), it now mediates a different 
linguistic reaction (Ela. There is no change in the significance of the perceptual 
sign (e.g., the meaning of MOUTH), but there is a new encoding unit associated 
with this object via the mediation process. Similarly, on the decoding side of the 
communication equation, the original linguistic sign (EW as a message event 
‘no longer elicits a mediation process (t'm,—Sm,) capable of mediating behavior 
(Rx,) appropriate to the same object (S;), but another linguistic sign ({S]:) 
does elicit this process. Processes previously associated with the new message 
event, |R]2~|S]2, if there were such, are indicated by brackets, as are the dis- 
placed message events. à 

Even a casual survey of materials of semantic change as noted above indicates 
that these shifts are not haphazard. Rather, the referends of the members of a 
set of cognates tend to be closely related semantically. What general mechanisms 
of semantic change can be derived from theory which might account for this 
lawfulness? 

(1) Association transfer. The regularities of physics, biology, and culture com- 
bine to enforce certain transitional dependencies in sequences of signs. Thus 
sun to warmth, eye to see, man to work, and vice versa and so forth. As shown 
in Figure 19, such redundant sequences of signs provide conditions for establish- 
ing central associational connections (Sm,—Tm,). Such intraverbal or associative 
connections are a major determinant of reactions in free association tests. 
However, as shown by the arrow (1) connecting [Sh with rm, this transitional 
redundancy also provides a condition for a shift in meaning, such that the prior 
sign comes to signify what was initially an associate. Similarly, on the en- 

"coding side, under conditions of high redundancy the representational process 
associated with the prior sign will tend to,become an eliciting condition for en- 
coding the subsequent linguistic unit, as shown by the other arrow @ con- 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 161 
a ls 
1 © 2 
"a.u NL, 
^ mp 7m mo EX 
CB $— 2, 


l Figure 19 


, necting Sm, with [Rh This represents a response competition situation in both 
; decoding «(competing mediators) and encoding (competing expressions), e.g.» 
| divergent hierarchies. To the extent that the mediating reactions rm, and rms 
are similar, the subsequent will tend to intrude in place of the antecedent (e.g., 
if ugly-duckling were a high frequency combination, the heard sign ugly might 
also tend to elicit the representational process originally associated with the 
* sign duckling, and ugly would be acquiring a diffefent meaning). To the extent 
that the vocalic expressions [Rh and [Re are similar, the subsequent will tend to 
intrude in place of the antecedent (e.g. if bright-light were a high frequency 
| combination; the representational process characteristic of brightness should 
| „tend to elicit the encoding unit light—the encoder will have a tendency to sub- 
stitute the linguistic sign light, as in “It’s a very light day,” in situations ap- 
propriate to the original meaning of bright). Both of these processes, mediator 
substitution and vocalic substitution, have the same end result—a shift in the 
sign associated with a particular mediation process. Of course, numerous addi- 
tional factors would be operative in determining the probabilities of such shifts— 
for example, whether the redundant items are in same or different grammatical 
classes, relative frequencies of usage, and so forth. 

(2) Situational context redundancy. Again by virtue of physical, biological, and 
cultural regularities, the distal signs associated with certain objects will co- 
occur with high redundancy. When looking at CHEEK one will nearly always 
see MOUTH also; when reacting to the COLOR of an object one will also be 
perceiving its SHAPE. If the male nobility are nearly always seen on horse- 
back, horseman will tend to acquire the meaning of nobility or gentleman (cf., 


Span. caballero); if the material of which something is made is redundantly and 


rather exclusively experienced in context with its function, the name for the 
). As shown below, the greater 


material may substitute for the function (ef., nickel ; : 
the frequency of co-occurrence in common situational context of two distal 


signs, and hence the greater the probability of co-occurrence of their appropriate 


mediational processes the greater will be the tendency for substitution of one 
encoding unit for the other (as shown by the dashed arrows). Note that the same 
shift in encoding units could be accomplished by a change in meaning of the 
distal signs (e.g., confusion and substitution between Tm, and Ij); but this need 
not be the case—the word originally used for JAW can be substituted for that 
for MOUTH without implying any change in how people perceive these parts 
of the physiognomy. The likelihood of mediator confusion lapis depend upon 


: ; 


162  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS . 


Figure 20 


their similarities, and this in turn upon the cultural differentiation between 
these objects (e.g., to the extent that specific operations, such as chin-beard 
styling or chin guards for battle, apply to JAW as separate from MOUTH, 
mediator confusion should be reduced). This distinction between mediator and 
vocalic skill unit shift verges on the Weltanschauung problem (cf., section 7). 
It ean also be pointed out that situational context redundancy of the sort de- 
Scribed here imposes difficulties in the transmission of language from one gen- 
eration to another, and thus further increases the likelihood of language change*- 
when an adult refers to, and even ‘points at’ his LIPS, the child is quite likely 
to be reacting in terms of his MOUTH. í 

(3) Physical stimulus generalization. This is a very straightforward mechanism 
and simply refers to the tendency for a reaction, in this case mediational, to 
spread from one stimulus pattern to others, as a function of their physical 
similarity. Thus the word for SUN should tend to generalize to the object MOON 
as distally perceived and vice versa; the word for the human EYE should tend 
to generalize to similar appearing knots in wood, keyholes in doors, and slots 
in needles; the word for FLOOR should tend to generalize to other level ex- 
panses, such as PLAIN and ROAD. The greater the habit strength of a particular 
mediational and encoding sequence (e.g., the greater its frequency of occurrence), 
the greater should be its capacity for generalization. Thus, if the name of one 
carnivorous animal (cat in English) acquires a very high frequency of usage as a 
result of environmental factors, it may generalize so widely as to substitute for 
carnivorous animals in general. 

(4) Mediated generalization. The mechanism just described is one means 
whereby a class of stimulus patterns, in that case physically similar, may become, 
associated with the same label. Mediated generalization is another mechanism 
but does not require physical similarity. If one object, as distally perceived 
(e.g, LEG of a person), acquires a certain significance (e.g., ‘stand*on’) and 
another object (e.g., that which holds up a table) acquires through independent 
conditioning a very similar signification, then the instrumental reactions as- 

. sociated with one sign (in this case, the vocalic encoding unit leg) will tend to 
transfer via mediation to the other sign. Other illustrations would be the transfer 
of the term ship to space traveling objects (spaceship) in science fiction, the 
use of terms like chicken and filly to refer to young girls, and reference to head 


pe——: 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 163 


as the top or most important part of something. Some such process as this 
presumably is basic to metaphor in general. 

6.3.2.3. Reseagch proposals on semantic change. (1) A first step by way of re- 
search would be £o summarize and categorize (perhaps in terms of the mechanisms 
described) the various types of semantic changes that have occurred in language 
families. This should be done across language families as well. Of course, much 
of this work has alee been done and merely needs to be analysed. (2) Given 
the above collection of/information on historical semantic changes, it would be 
interesting to compare these data with those obtainable from association tech- 
niques, using, of course, contemporary subjects. The first mechanism described 
above, association transfer, involves reactions to linguistic signs, and the usual 
free association technique should yield data most directly comparable to these 
types of language change. The other three mechanisms above, situational context 
redundancy, primary and mediated generalization, involve reaction to the distal 
or perceptual signs of objects, e.g., labeling operations, and a technique used by 
Karwoski, Gramlich and Arnott, among others, where people associated to 
objects themselves as perceived distally, should yield relevant data. The experi- 
meptal question is this: to what extent can one demonstrate significant correla- 
tion between she associations made to words and objects and evidence on 
semantic changes that have occurred? Will the types of changes that have been 
lmown to occur in languages with considerable frequencies be matched by high 
frequencies of appearance in association tests? Demonstration of this sort would 
lend general support to the hypotheses generated here. (3) Study of the verbal 
paraphasias of aphasics—the encoding units substituted in the search for the 
‘correct? units—would be interesting in its own right as well as instructive in 
the present context. Do verbal paraphasias, when collected in sufficient numbers, 
also parallel free association and object association frequencies? (4) The condi- 
tions for Bach of the mechanisms described can be manipulated experimentally 
under laboratory conditions. For example, one can deliberately vary (a) the 
sequential redundancy of signs, (b) the contextual (situational) redundancy, 
(c) the physical similarity of objects, and (d) the development of common 
significances for dissimilar objects, all in a situation using nonsense materials, 
and, after training in labeling, measure the errors produced and their natures, 
the difficulties in training itself, the changes that occur in retention of labels, 
and so forth, 


16 Karwoski, Gramlich, and Arnott, Journal of Social Psychology 20. 233-47 (1944). 


5 


7. SYNCHRONIC PSYCHOLINGUISTICS II: MACROSTRUCTURE 


Within the general organization of this report into synchr¢/nic, sequential, 
and diachronie problems, the reader has probably noticed a general trend from 
the molecular toward the molar. Although all three types of problems span this 
continuum to some extent, it was particularly evident in synchronic psycho- 
linguistics, and for that reason we have made an arbitriiry segregation of syn- 
chronic problems into ‘microstructure’ (section 4) and *niacrostrueture' (section 
7). In this section, then, we are concerned with relations between larger segments 
of messages and grosser psychological correlates. The effects of motivational states 
upon encoding and decoding is one such relation (7.1). The persistent and im- 
portant issue of meaning is another (7.2). Referring as it does to relations be- 
tween events in messages (signs) and events in behaving organisms (representa- 
tional processes), it clearly falls within the area of synchronic psycholinguistics, 
but we limit ourselves here to an analysis of various ways of defining meaning, 
and the measurement of certain aspects of meaning. A third problem which 
seems to fall best in this catetory is that of information transmission (7.3), both 
between individuals using the same code and between individuals using différént 
codes (e.g., translation). Finally, we have a little to say about the very compli- 
cated and ill-defined area of language, cognition, and culture, which includes the 
so-called ‘Weltanschauung’ problem. Here we limit ourselves to an attempt to 
clarify somewhat the nature of the problem and to presentation of concrete re- 
search studies, some completed and some proposed. It was the consensus of the 
seminar that arm-chair theorizing on this problem has about reached the satura- 
tion point and what is needed is experimental data collecting and hypothesis 
testing. 


7.1. Effects of Motivational States upon Decoding and. Encoding” 


In the ordinary, everyday uses of language much of the information trans- 
mitted concerns emotional states, attitudes, and motives of both speakers and 
hearers. Malinowski used to refer to this as ‘phatic communion.’ In this section 
we wish to analyse both some of the ways in which the motivational states of 
hearers can influence their decoding of messages and some of the ways in which 
such states of speakers can influence their encoding of messages. As will be seen, 
motivational states can influence discrete, linguistic aspects of messages as well 
as continuously variable, non-linguistic aspects, although the latter are pre- 
sumably more susceptible to such effects. These are distinctly psycholinguistic 
problems, concerning relations between states of communicators and states of 
messages. 


7.1.1. Learning Theory Propositions 


T will be useful to segregate the effects of motives into two classes—their 
general energizing effects and their more specific cue effects, a distinction made 


11 Charles E. Osgood. * 
164 


» 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 165 


in most psychological discussions (cf., Miller and Dollard, for example). To the 
extent that drive states are intense, perhaps involving changes in body chemistry 
(e.g., hormonal phanges and the like), they will serve to energize the organism, 
increasing the strengths of action tendencies; to the extent that drive states are 
distinctive, e.g., comprise discriminable stimulus patterns which can be selec- 
tively associated with reactions, they will serve as cues in the elicitation of those: 
responses to which thef have been conditioned. We shall refer to the energizing 
properties of drive by the symbol D and to the selective, specific stimulus prop- 
erties of drive by the symbol Sp. The former effects may be either innate or 
learned (as in acquired drives states like anxiety), whereas the cue effects of 
drives must be learned like other stimulus-response connections. 

7.1.1.1. Energizing effects. One of the basic principles of Hull-type learning 
theory states that drive combines multiplicatively with habit strength to yield 
reaction potential. In symbolic terms: sEr = sHg X D. Sinceit is the excitatory 
potential associating stimulus with response (sEr) that is directly coupled with 
behavioral indices in this theory, this means that thé amount of drive associated 
with a response tendency (e.g., on either decoding or encoding sides) will directly 
influence the probability, latency, and so forth of the behavior concerned. If the 
learned habit Strengths associating stimulus with response (sHr) of alternative 
decoding or encoding reactions are near their maxima, and hence of roughly 
équal strength, it can be seen that motivational variables will become very 
important in determining what reactions actually occur. This condition is prob- 
ably typical of most ordinary language behavior, and hence momentary fluctu- 
ations in motivation—attitudes of speakers and hearers, ‘sets’ of one kind or 
another, interests in one or another aspect of the message, and so forth—will be 
significant in directing both decoding and encoding activities of communicators. 

The postulated multiplicative relation between habit strength and drive has 
one very significant corollary: if two or more alternative responses to the same 
stimulus (e.g., à divergent hierarchy) have different habit strengths, the effects 
of increasing generalized drive (D) will be to further increase the difference be- 
tween the two reaction potentials. In other words, the effects of increased gener- 
alized drive upon habit-family hierarchies will be to further augment the Brob- 
ability of the dominant response and relatively damp the probabilities of weaker 
reaction tendencies. This has important implications for language behavior under 
stress conditions. 

7.1.1.2. Cue effects. The cue effects of 
like any other contextual condition—by a : : d 
the elicitation of particular responses, these cues acquire evocative properen an 
contribute to the total eliciting pattern, e.g. they may serve to ‘weight the 
stimulus situation in favor of one response rather than another. If the motive 
state of anger has been associated with swearing responses, subsequent "gd 
rences of this motive state will increase the probability of such responses; i 
strong fear states were associated with the first, childhood language of a bipenn 
speaker, but not with the second, adult language, subsequent fear Gud E: à 
‘throw’ the speaker back into his childhood tongue. Less traumatic interp'ay 


specific drive states operate precisely 
virtue of having been associated with 


166 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 
4 


between motive states (interests, attitudes, and the like) are presumably opera- 
tive continuously in the flow of communicating. What we have referred to else- 
where as the ‘intentions’ of speakers reflect the integration of sch motive states 
with semantic states. à 

To this point we have been speaking only of the cue effects of specific drive 
(e.g., drive states previously and differentially associated with particular re- 
actions). What should be the cue effects of generalized drive (e.g., drive states 
not previously and differentially associated with particular reactions)? Since 
the effect of any flood of novel stimulation is to change the pattern eliciting 
specific reactions, e.g., it shifts the momentary pattern along generalization 
continua away from the point of maximal habit strength, the effect of non- 
specific drive should be to weaken existing habits. Since weaker members of 
response hierarchies, which are nearer the reaction threshold anyway,.will tend 
to be eliminated in this fashion, the cue effects of non-specific drive (just like 
the energizing effects above) will be to relatively augment the probability of 
occurrence of dominant responses. 

7.1.1.3. Language habit-family hierarchies susceptible to motivational effects. 
Referring back to the general learning theery model of decoding and encoding 
processes developed in section 6.1, there are at least four places at which hier- 
archies susceptible to motivational effects are found. 

(1) Semantic decoding. Since all signs are to some extent multi-significant, 
homonyms being only the extreme case, varying the intensity of generalized 
drive should tend to channel perception and significance into more stereotyped 
modes and varying specific motivational states should operate selectively to 
change the probabilities of particular alternative significance or ways of per- 
ceiving. 

(2) Anticipational decoding. Given varying histories of contiguity and re- 
dundancy in ‘what follows what’ in both linguistic and perceptual decoding, 
the central sensory representations of antecedent events will be differentially 
associated with hierarchies of subsequent central sensory events, usually as 
predictive rather than evocative tendencies (e.g., ‘tuning up’). Increasing the 
level of generalized drive (e.g., decoding under anxiety) should operate to stereo- 
type these input sequencing mechanisms, thereby producing errors. Increases 
in specific drive associated with specific decoding sequences should simply 
stabilize such sequences, offering resistance to external disturbances (for ex- 
ample, the greater the interest in a particular musical instrument, the better 
one will follow its line through a complex selection). Whether specific motives 
operate upon anticipational (and dispositional) mechanisms directly o? mediately 
via feedback from the semantic system would be difficult to ascertain. 

(3) Semantic encoding. Semantic states, as ‘intentions,’ are associated with 
hierarchies of alternative encoding units or ‘expressions.’ Again, the effect of 
ingreased generalized drive should be to further stereotype semantic encoding, 
and the effect of increased specific drive should be to increase the probability 
and amplitude (stress, pitch) of those message units associated with such drive 
states or ‘interest,’ while decreasing their latencies (e.g., tending to move them 


D 
) 


o MEM RAV WAR S 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 167 


forward in the message sequence, either through choice of Miematye construc- 
tions or, if drive is high enough, breaking through constructive barriers). 

(4) Dispositiohal encoding. By virtue of varying degrees of contiguity and 
redundancy bote units in sequential encoding, antecedent skill sequences, 
as centrally represented, will be variably associated with hierarchies of subse- 
quent skill sequences, as centrally represented. These output redundancies in 
language behavior are particularly concerned with grammatical ordering, affix- 
ing, and so on. The effects of generalize drive will be to stereotype the structure 
of utterances (making the most probable sequences still more probable) and the 
effects of specific drive—probably indirectly, through selection of semantic items 
— will be tq select among alternative constructions those most closely associated 
with the motive state operative. 


7.1.2. Research Proposals and Predictions 
It is always easy to note errors in perceptual decoding, misinterpretions of the 


e significance of signs, and conclude that such and suclf a motivational state ‘must’ 


have been operative. A young minister was reading a newspaper report on a 
speech given by the Bishop in his area; he sat down and wrote a letter to the 
editor complaihing that, while it was all right for the paper to disagree with the 
doddering old Bishop, it was not fair to keep inserting in italics, the word apple- 
sauce—the editor wrote back pointing out that the word in italics, which the 
young minister had repeatedly misread, was applause. To conclude that the young 
minister ‘must’ have had no respect for his superior may be valid, but the argu- 
ment is circular scientifically if the act of misperception is the only basis for as- 
suming the motive state. Similarly on the encoding side, it is easy to note shifts 
in construction from passive to active in what appear to be appropriate motive 
conditions, to note a ‘primitivisation’ of word selection and construction selection 
under emotional states, and infer the presence of the independent variable, but 
the argument is equally circular. What is needed for research in this area is con- 
trolled situations in which the independent variable, motive state, can be manipu- 
lated independently of the dependent variable, language behavior, and the effects 


F z ie: 
upon the latter observed. In other words, we need to devise experimental*situ- 


ations in which either generalized drive level or specific motive state can be made 


i i i ther (control) group 
to be present in one group or at one time and absent in ano! 1 
or another (control) time. Whereas this is relatively easy to accomplish as far as 


generalized drive level is concerned, it is very difficult to accomplish as far as 


specific motive state (attitude, interest, etc.) is concerned. 
Li 


7.1.2.1. Decoding operations and motivation. (A) Generalized pra in this 
report (section 5.4) association techniques were described for buil d ioe 
materials having either relatively high or relatively low aa po M ici 
for a particular community of people. If one group of US Hd ER im 
munity were caused to be under general high drive level an: = er uti 
relatively low drive level, and both were given the task of redis o ps t 
to both high and low transitional Sequencing materials, we would p 


168  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 
F 4 

significant interaction between these two variables—the high drive level subjects 
should be relatively much poorer on comprehension, retention, etc. on the low 
transitional probability material and relatively better on the pigh transitional 
probability material, e.g., the spread of the high drive level group on the two 
materials should be significantly greater. There are many pogsible ways to 
experimentally induce differential drive levels—unpredictable shock might be 
used, threat of punishment for failure, calling the task ‘aj important intelligence 
test,’ and even making them extremely hungry, thirsty, or angry. It would also 
be-possible to select two extreme groups on the Taylor Manifest Anxiety Scale, 
assuming the high scoring subjects to be under generally higher anxiety drive 
level (this technique has worked well in a number of other situations). 


‘Another study, on perceptual decoding, would be to subject our high D and low D groups 
to Miles’ Kinephantoscope (ambiguous figure capable of multiple interpretations), pre- 
dicting fewer alternatives and longer ‘holding’ of the dominant interpretation in the high 
D group. A variant of the latter design would put verbal homonyms in ambiguous sentences, 
predicting along similar lines. A project requiring considerable ingenuity would be to pre- 
sent puns to people under high and low D, puns involving usually a shift from the more 
probable significance of a word to a less probable significance, the prediction being that 
under high D people will be less likely to *cateh on’ and perhaps more annoyed when they 
do. Is it true that highly anxious people (or angry people, or hungry people, or tired people) 
are more oblivious to plays on words—as decoders? 


(B) Specific Sp. Many of the experiments undertaken by Bruner and Postman 
and their associates on the effects of motivational factors upon perception are 
interpretable as effects of specific motive states upon decoding. The effects of 
value systems (religious, theoretical, economic, etc.) upon facilitating perception 
of co-valuant words presented in the tachistoscope is a case in point (assuming 
that motivational factors can be disentangled from frequency-of-usage factors— 
cf., work of Howes and Solomon).’§ So-called ‘free’ association is another’standard 
experimental situation in which specific motivational factors play a part— 
witness the use of this technique in ‘lie-detection’ and in psychotherapy to get 
at unconscious complexes. An experiment by Foley and Matthews’? has demon- 
strated that the associations made to words like administer and binding vary 
appropriately with the amount of professional training in either law or medicine. 
In all of these cases, however, manipulation of motive or interest state is in- 
direct in terms of subject selection—and frequency-of-usage variables therefore 
probably are confounded. One can speculate on the possibility, however, of creat- 
ing interest of children in animals vs. routes, for example, and then presenting 
messages like ‘Bear right; the general design here would be to study the as- 
sociations or other indices of decoding significance resulting from the presenta- 
tion of homonyms, under differential motivation. Capitalizing on the difficulties 
of our young minister above, it should also be possible to study the effects of 
differential motivation upon misperception of signs, e.g., pairs like applesauce 
and applause, under tachistoscopic or other experimental conditions. 


78 Psychological Review 57. 229-34 (1950). K 
19 Journal of Experimental Psychology 33. 299-310 (1943). 


y 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 169 


7.1.2.2. Encoding operations and motivation. (A) Generalized D. A basic de- 
"sign much like that proposed for decoding can be suggested here. Using the same 
predeterminatio 'of transitional probabilities (nouns to verbs, adjectives to 
nouns, and so forth—see section 5.4), subjects can first become familiar with 
lists of these words and then use them as the vocabularies for spontaneously 
encoding short stories, while under either high or low generalized D. Again it 
would be predicted that the average transitional probabilities of sequences 
spontaneously encoding under high D would be higher than when encoding . 
under relatively low D. Another experimental approach to this problem would 
be to solicit spontaneous messages from high and low D subjects and then give 
systematicaHy mutilated portions of them (cf., ‘Cloze’ method, section 5.3.4) 
to other groups of subjects to be filled in—predicting that the materials of high 
D subjects would yield higher average fill-in scores than the materials produced 
by relatively low D subjects. ; 


Another approach would utilize tape recorded interviews Sbtained from patients under- 
going psychotherapy: using judgments of the therapist as a criterion of periods of relatively 
high and low D (anxiety), excerpts would be mutilated and given to other people for recon- 
struction—prediction is that the sequences produced under high D should be more suscep- 
tible to correct reconstruction. Recordings made of people speaking under high D (anger, 
fear, and so on) may be compared with recordings by the same people later discussing the 
same events—using the same mutilation and fill-in technique—and one would make the 
same prediction as to the transitional predictability of the sequencing. Linguists could 
analyse any of these experimental and control productions in terms of the diversity of con- 
structions and so forth displayed. Type-token ratios, another measure of language diversity 


or flexibility, could also be taken. 


o 


(B) Specific S». As pointed out earlier, it is extremely difficult to think up 
adequate ways of experimentally manipulating specific motive states, such as 
are presumed to contribute to the selection among alternative semantic and gram- 
matical forms in encoding. Therefore, we may start by indicating in general 
terms the effects to be expected. In the utterances produced by a speaker in a 


specific motive state, the stress on those units related to (e.g., ‘expressing’) the 


motive state should be increased, the pitch of those units should be higher, ånd 


(as a secondary effect) the vowel lengths should be greater. Here we are referring 
to nonphonemic stress, pitch and vowel length, and in languages where these 


message characteristics are phonemic these effects should be continuous beyond 


the purely phonemic elements.® All of these affects are attributable to the rela- 
tion of response amplitude to amount of reaction potential as determined by 


drive. 9 à ; 
Following the relation between reaction potential and latency, it joe be 
expected that those semantic units associated with specific drive should ten to 
be encoded earlier in an utterance, e.g., motive states should affect the ordering 
8 i i tion was raised as to how generally, across languages, do the 
ax peel SE ited with greatest interest have the greatest stress, ee 
pitch, and lengthening of vowels; most of the linguists felt that while not universa, 


would be a general trend in this direction. 
E 


170  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


of unitsin messages. Where alternative constructions are available in the language 
this will show up in selection among these alternatives. Interest in the actor 
enhances initial encoding of this unit, hence an active constriction; interest in 
an object, on the other hand, enhances initial selection of this unit, hence a pas- 
sive construction (example: ‘A very young boy won the diving clíampionship' 
vs. ‘The diving championship was won by a very young boy").* Interest in the 
action itself increases the probability of encoding the verb first, and hence selec- 
tion of a command or exhortatory construction (e.g., ‘Fat your dinner’ or ‘Run 
for your life!’). The position of subordinate clauses in utterances should be par- 
ticularly susceptible to the effects of motivation—interested in the conditional 
status of the matter, I am likely to say, “If you wish, we can go to'the movies" 
rather than “We can go to the movies, if you wish.” 


In general, one would expect these differential motive states of speakers to operate 
within the existing rules of the language, by selecting among alternate constructions and 
using non-phonemic possibilities for emphasis. However, sufficiently strong motivation 
combined with insufficient flexibility in the language structure may produce errors—the 
speaker breaks through the grammatical constraints of the system. Thus, at a picnic one 
of our linguists heard a man say, ‘Garlic I taste!’ when startled by a strong flavor. When 
one looks for them, he finds many such lapses in everyday speech under motivated condi- 
tions. Errors of this type are particularly hard to eradicate in the encoding of children— 
the “Me and Johnny went . . ." is in part a reflection of the youngster’s ego-involvment 
in himself. Also, other things equal, one would expect the effects of specific motive states 
to show up most clearly in those aspects of the message which are in ‘free’ variation.® 
It should also be noted that since the effects of specific Sp persist through utterances one 
would expect motivation to produce redundancies wherever possible—for example, the 
negation oriented individual who tells the policeman, “I ain't never done nothing to no- 
body nohow."' Finally, one would expect certain types of ‘slips of the tongue’ to be due 
to the moving forward in the sequence of items of greater interest, where phonetic similari- 
ties facilitate this intrusion—large samples of ‘slips’ would have to be collected and analysed 
to test this notion. 


But how is one to manipulate the specific motive states of speakers with respect 
to particular items in potential utterances in such ways as to test these predic- 
tions? As suggested earlier, the sequence of events in psychotherapy offers some 
possibility of independent estimation of speaker states. Carroll, also mentioned 
above, has tried manipulating the interest of students in a classroom situation 
in either the actor or the objects acted upon, with predicted results. In another 
experiment conducted in Carroll’s laboratory, two students were separated by 
a panel, each given a set of objects, pegs and a board, and given a specific schedule 
of questions to ask—'Is your left hand peg blue?,' ‘Give me your orange block’ 
and so forth. The syntactical characteristics of answers to specific questions 
were studied. This suggests another kind of control over the interest state of the 
decoder-encoder—the nature of the information requested by another com- 
municator. One could ring changes on this model as a means of investigating 


81 Carroll reports a classroom experiment on this problem with positive results. 

82 Greenberg mentioned an African language, Hausa, which ideally illustrates this 
E EAEN flexibility in order permita the item of maximal interest to come first in 
utterances. 


» 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 171 


the present problem. Within the same general situation, it should also be pos- 

sible to increase the general drive level—e.g., making it a competitive situation, 

a ‘test of inte ger ce, giving certain objects reward or threat value, and so forth. 
e 7.2. Meaning 


7.2.1. Meanings of "Meaning" 


There is a vast {literature concerning the subject of ‘meaning,’ with contribu- 
tions from the fields of philosophy, philology, literary criticism, linguistics, and 
psychology. For a key concept, however, the variability in meaning of this 
chameleon-like word is notable. It is not our purpose in this section to review all 
or even a small part of the applications of the term ‘meaning’ in this literature. 
Rather, we intend only to outline the dimensions of a field of denotation within 
which, we believe, lie most of the important meanings of ‘meaning’ found in 
scientific discourse. 

Ogden and Richards pointed out the necessity of an adequate theory of signs 

“as a prerequisite for understanding the nature and kinds of meaning. Charles W. 
Morris took up this task and has laid the foundations for such a theory, He 
has delineated the three principal aspects in the study of sign function: semantics 
(the relation of signs to things signified), pragmatics (the relation of signs to 
elicited behavior), and syntactics (the relation of signs to signs within a system of 
signs); and he has dealt with the classificatory problems resulting from the 
phenomena, of perceptual constancy and of equivalences, both in signs and de- 
notata. Modern linguisties, largely independent of Morris’s work, has developed 
a highly effective classificatory scheme for dealing with alternatives and equiv- 
alences in language signs, but, at least in its American development, it has 
generally avoided dealing with problems outside of those having to do with the 
forms of language signs, or ‘syntactics’ in Morris's terms. The success of this 
method in this area of application, however, has prompted some anthropologists 
to apply a similar method in the analysis of sign-to-denotatum relationships, 
i.e., of ‘semantics’ in Morris's terms.** Lastly, recent developments in the formu- 


lation of behavior theory by experimental psychologists throw additional light 
ion the contributions of Morris’s 


on the nature of sign behavior. In this formulati i f 1 
work and of modern linguistics find added relevance and are seen in their relation 


to the total problem of sign behavior. OON. dq 

As an approach to the consideration of the possibilities in the meaning o 
‘meaning,’ let us consider the position of the currently dominant trend in Ameri- 
can linguistics. Bloomfield” defined the meaning of a linguistic form as “‘the 


situation in which the speaker utters it and the response which it calls forth in 


83 Floyd G, Lounsbury. j 
MG. x. Ogden and LA. Richards, The meaning of meaning? (London and New 
is ; n J B] 
PAS W. Morris, Foundations of the theory of signs, International uela AA 
Unified Science 1. 2 (Chicago, 1938); also: Signs, language and behavior ars d do 
*5 E.g., see Ward H. Goodenough, Progerty, kin, and community T Kon ii 
versity Publications in Anthropology, No. 46, esp. pp. 103-110 (New Haven, á 
31 Leonard Bloomfield, Language, chapter 9 (New York, 1933). 2 ij 
° 


York, 


172  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


the hearer.” He continues, “The speaker's situation and the hearer's response are 
closely co-ordinated, thanks to the circumstance that every one f us learns to 
act indifferently as a speaker or as a hearer. In the causal selyuence—speaker’s 
situation — speech — hearer’s response—the speaker's situation, as the earlier 
term, will usually present a simpler aspect than the hearer’s response; therefore 
we usually discuss and define meanings in terms of a speaker’s stimulus.” Bloom- 
field further distinguishes between the multitudinous uniqge total situations 
which prompt us to utter any one linguistic form, and the common distinctive 
semantic features which all of these situations share and which alone have rele- 
vance to the linguistic problem. These latter he refers to as ‘distinctive meaning.'5$ 
Having proceeded this far, Bloomfield paints a very discouraging picture of the 
linguist's possibilities for understanding and handling meaning. To describe 
adequately the speaker's situations and the hearer's responses would require 
unattainable knowledge in the natural sciences, sociology, physiology, and psy- 
_ chology. Even if the external situations and the overt responses could be ade- 
quately described and classified, the internal states of the speaker’s and hearer's 
bodies would be imponderable variables. In his development of linguistic method, 
however, Bloomfield points out that for the analysis of linguistic form a khowl- 
edge of meaning is largely unnecessary. All that is relevant is to test for dif- 
ferential meaning, ie. whether two forms mean the same or different. This being 
the case, and the linguist’s business being conceived as the analysis of linguistic 

. form, linguistics could proceed without further concern for theories of meaning. 
"Theso-called American school of linguistics, receiving its orientation from Bloom- 
field, has gone ahead on this basis, developing the procedures and refining the 
theoretical underpinnings of the analysis of linguistic form, apart from and in- 
dependent of the analysis of meaning. Its only acknowledged dependence upon 
meaning was the now classical question of ‘same or different. Even this ap- 
parently simple question, however, cannot always be answered satisfactorily. 
Here, however, instead of depending on a theory of meaning, linguists have 
largely operated by rule of thumb. There have been attempts to show theo- 
retjcally at least that even this minimal dependence on meaning is unnecessary. 
Although not all are agreed on this point, most are satisfied that to try to analyze 
the two problems of form and meaning simultaneously adds confusion rather 
than clarification. Some favor the analysis of each of the two systems independ- 
ently and only then the analysis of the relationships between them. The structures 
of the two systems are not isomorphic and the relationship between them is not 
one of simple matching. Others, however, tend to view the two systems as iso- 
morphic and attempt to define the linguistic units in such a way as to preserve this 
isomorphism. The units of form in such an analysis attain a complexity and ir- 
regularity, however, which betray their dual nature. Exponents of this latter 


$ 88 Bloomfield also used the term ‘linguistic meaning’ for the distinctive semantic features, 
but we have avoided introducing this label here, and have used his alternative term, for 
the reason that the phrase ‘linguistic meaning’ has recently come to be used in a quite 
different sense as noted below. 


4 
^ 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 173 


position see in the analysis of form the whole problem of linguistics. Analysis of 
‘meaning’ is felt to be unnecessary for linguistics, but when achieved by some 
other discipline, Jrould be expected to present parallels or correlates to the 
linguistic analysis. Instead of looking for the meaning of a linguistic form in non- 
linguistic sitüations and non-linguistie responses, a ‘meaning’ is sought rather in 
the extended contexts of preceding linguistic situations and following linguistic 
responses. One writer, summarizing this position, puts it thus: 


“Could we perhaps do something further with *meaning' inside linguistics? Yes, but 
only on condition that we distinguish sharply between the inside and the outside. Let the 
sociologists keep the outside practical meaning; then we can undertake to describe the pure 


linguistic meahing. We can do it thus: | 
“Among permissible combinations of morphemes, some are commoner than others. 
Thus there are Conditional probabilities of occurrence of each morpheme in context with 


others... . 

“Now the linguist’s ‘meaning’ of a morpheme (or of a combination thereof, of any com-, 
plexity up to a complete utterance, which might even be a whole book or poem) is by defi- . 
“nition the set of conditional probabilities of its occurrence in context with all other mor- 
phemes—of course without inquiry into the outside, practical, or sociologist’s meaning of 
any of them. . . . So far we have done almost nothing with pure linguistic ‘meaning’ as so 
defined, for the olsvious reason that its mathematics is of the continuous sort, which we 
are not accustomed to handling. .. . Still a beginning has been made on a structural 
semantics by one linguist. ... This work is very recent, and is the most exciting thing 


that has happened in linguistics for quite a few years. . . .’”*? ^ 


Perhaps we can illustrate this approach by means of a simple example. Con- 
sider the English suffix -ly. Generally this element is said to have two meanings 
(or it may be said to be two different morphemes): (1) the ‘adverbial meaning’ 
as in he did it poorly, and (2) the ‘adjectival meaning’ as in it’s a likely story or 
a goodly number. Let us consider only the first of these two, the adverbial suffix. 
Compare the sentences he did a poor job and he did the job poorly. The deter- 
minants of the occurrence of the segment -ly in the second sentence lie entirely 
with that sentence itself, that is, within the fairly immediate context. Such a 
segment is often called in contemporary linguistic terminology an ‘empty morph, 
inasmuch as it is devoid or empty of any non-linguistic situational meanihg, 
and has its occurtence determined entirely by features of the purely linguistic 
context. Its ‘meaning’ then, is only of the jinguistie' sort, as the above-quoted 
writer uses the term. For no other segment of this sentence, however, can the 
occurrence be defined purely in terms of determinants lying within the sentence. 
The approach described in the above quotation, instead of seeking men in 
non-linguistie contextual determinants of occurrence, would seek to pin it own 
in terms of determinants of occurrence in larger and larger purely linguistic 
contexts, For the majority of lexical items such a definition of meaning, while 


theoretically tenable, becomes methodologically useless in the rud uc iy 
linguistic text analysis. The Jinguistic meaning’ of a given wo! may not b 
89 Martin Joos, Description of language design, 


The Journal of the Acoustical Society of 
America 22. 6. 708 (1950). ° 


174 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


found in the linguistic context of that word in any given text or even in a very - 
large collection of texts in which that word occurs one or more times. The lin- 
guistic contexts which suffice to determine the ‘meaning’ of a form may be of the 
order of size comparable to the total of the past language experience of the 
informant. While such a body of text is both unattainable and unandlyzable, the 
linguistic meaning of a form may often be uniquely defined in terms of features 
of a relatively limited amount of specially elicited linguistiy context, such as 
that elicited by the semantic differential (see section 7,2.2.) or by various other 
psychological or linguistic testing devices. 


7.2.1.2. Situational vs. behavioral meaning. The dichotomy between situational — 


and behavioral meanings is foreshadowed in Bloomfield’s definition of meanirig 
quoted earlier, where he includes “the situation in which the speaker utters 
[a form] and the response which it calls forth in the hearer.” The dichotomy 
applies as well to linguistic meaning (as described above) as it does to extra- 
linguistic meaning, for the total stimulus situation antecedent to any speech 
event or fraction thereof may include not only extralinguistic but also linguistic: 
determining factors; and the total behavioral response may include linguistic 
as well as extralinguistic behavior. x ; 
Morris’s three-way breakdown of sign relations provided a framework for 
defining three kinds of meaning. The relationship of sign to things signified 
Morris called ‘designation,’ and the study of such relationships he called ‘se- 
mantics.’ The relationship of sign to sign within a system of signs he called 
“mplication’ (signs ‘implicate’ other signs), and the study of such relationships 
he called ‘syntactics.’ The relationship of signs to elicited behavior Morris 
called ‘expression’ (signs ‘express’ interpretants or behavioral responses), and 
the study of such relationships he called ‘pragmatics.’ Applying the term meaning 


in its widest extent, to contexts of all sorts, there may thus be distinguished ^ 


three kinds of meaning: designative meaning (semantic), implicative meaning - 
(syntactic), and expressive meaning (pragmatic). The choice of labels, however, 
is not the most felicitous. Rather, we may refer to them as situational meaning, 
linguistic meaning, and behavioral meaning, respectively. 

This presents a trichotomy rather than the 2x2 division which would be 
expected from the two dichotomies linguistic vs. extralinguistic and situational 
vs, behavioral. The trichotomy results from lumping together the entire linguistic 
portion of any behavior chain, and from disregarding the fact that a stimulus — 
situation may consist of both linguistic and nonlinguistic stimuli and that a 
total response may contain both linguistic and nonlinguistic responses. Inasmuch - 
as linguistic analysts customarily isolate out only the linguistic portions of & 
behavior chain for their study, there is a certain practical justification for this. _ 
There are problems in the psychology of language, however, where it is relevant 
to distinguish, in respect to a given form, the antecedent conditions from the 
following developments within the linguistic stream. 

A recent trend in American structural linguistics has been to be concerned 
only with the so-called linguistic meaning. Lexicography is concerned with 
designative meaning, ie., situational meaning. Psychologists are interested 


3» 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 175 


principally in behavioral meanings. Anthropological studies are directed to 
both designative and behavioral meaning. 
7.2.1.3. Intend: vs. external loci for ‘meaning.’ In section 2.2.2 of this volume 
are described the#empty organism’ formulation and the ‘mediation’ formulation 
of behavior.theory. The inadequacy of such stringent behaviorist formulations 
as those of the empty-organism type becomes particularly obvious when applied 
to language behayior. As’ Bloomfield recognized when he spoke of the speaker’s 
internal states, aiid as all who have approached meaning from the vantage 
point of literature and the arts have recognized, intervening internal variables 
must be allowed for. While not directly observable, they are yet not entirely 
beyond the reach of psychological techniques of investigation. The justification 
for a ‘mediational’ formulation within a still cautious behaviorist methodology 
has been presented elsewhere. In any case, the internal mediation processes 
must be referred to in our survey of the definitions of meaning. Some of the 
diversity in definitions and uses of the term ‘meaning’ has been on the issue of 
„the locus assigned to ‘meaning.’ Some behaviorists woeld refer only to externally 
observable situations and/or responses, while others are referring to internal 
states of the organism, or more specifically to states in the mediation process, 
Therefore this dichotomy must be admitted as another dimension of the ‘mean- 
ing’ field. The dichotomy internal vs. external applies to all of the types isolated 
thus far. Behaviorists, both among psychologists and linguists, who resist the 
interpolation of mediational phases in their behavior formulations are forced to 
admit only ‘externals’ in their definitions of meaning. Some of those who favor 
the reckoning with mediational phases are quite as emphatic in having the term 
‘meaning’ apply only to mediational responses, even though these are not 
observable directly and may be inferred only by means of devious testing devices 
or from clinical data. Many definitions and uses of ‘meaning,’ on the other 
hand, are quite unclear in this aspect of their reference. 
7.2.1.4. Particular vs. general; totality vs. distinctive features. A further respect 
in which definitions of meaning differ is in whether they have the term refer to 
a single total immediate context, or to the class of many or of all total contexts, 
or to the common features of all such contexts of the given language sign. Many 
definitions, of couyse, are ambiguous on this point and pay no attention to these 


' distincti do, as in Bloomfield’s ‘meaning’ (total immediate 

distinctions. Some, however, do, e a EN T 
other dimensions previously 
external 


response connected with a given sign; or of 


all such situations or perona i does 
One may be concerned at different s i 
sibilities in this trichotomy. Thus a linguist proceeds from a single poe 
context of a form, to a collection of many linguistic contexts of the form, to the 
abstraction of the common distinctive features of these (Page dh 3 
anthropologist, in the study of a kinship system. Tor example, pri samc S 
collection of kinsmen and kin-types (thé ‘genealogical method? in anthropologie 


=> 


176 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


field work), to kin-classes (the native classification as given by native termi- 
nology), to the distinctive semantic features of the kin-classes (the product of 
anthropological analysis). 8 9 

This trichotomy closely parallels Morris's distinctions bet*yeen ‘denotatum,’ 
‘designatum,’ and ‘significatum.’ We need only modify the term ‘class of many 
or all total contexts’ so as to admit of potential as well as actual situations, or 
assumed or imagined as well as real situations, or rather to admit of a class as 
a ‘kind’ whether or not there exists a real representative offthe class. Morris's 
term ‘denotatum’ refers to real instances, if any. A sign may or may not have 
any denotata. (E.g., I may or may not have an ‘uncle.’ If I do, he, John Doe—if 
that be his name—is a denotatum of the language-sign ‘my uncle’ when I use it.) 
The term ‘designatum’ refers to a recognized kind or class which may or may 
not have any real or immediate representatives. (The description of what 
‘uncles’ are or may be is the designatum of the sign. Thus, in our kinship system, 
the designatum of ‘uncle’ is the class consisting of the types father’s brother, 
mother’s brother, father’s sister's husband, mother’s sister's husband, etc.) 
The ‘significatum,’ on the other hand, consists of the defining features of the 
designatum class. (In terms of components in the semantic dimensions which 
underlie the structure of our kinship system, the significatum,of ‘uncle’ would 
be the distinctive common semantic features of the class, viz., kinsman, male, 
first-degree collaterality, and first or higher ascending generation.) a 

7.2.1.5. The dimensions of difference in definitions of meaning. The set of four 
dimensions of difference described above serves quite well as a framework within 
which to understand many of the definitions and uses of the term ‘meaning’ 
which we have encountered in linguistic, psychological, and philosophical 
writings. It is probably not adequate—i.e., probably does not recognize enough 
kinds of difference—to serve as a framework for classifying all of these definitions 
and uses. There is, furthermore, often the problem of interpreting "just what a 
given writer was aiming at. Most scientific writers, however, are sufficiently 
specific and clear so that this difficulty is at a minimum. In the references to 
meaning in literary criticism this problem is of greater magnitude but also of 
less importance to us at the moment. 

We may represent these dimensions, and the possibilitiey (components) in 
each dimension, as follows: : 

I. S:R (stimuli vs. responses, or situational vs. behavioral) 
II. E:I (external vs. internal) 
IH. N:L (nonlinguistic vs. linguistic contexts) 
IV. EDIS T:D (a total context vs. all total contexts vs. distinctive features 
common to all total contexts). 

In terms of components from these dimensions we may now represent, by way 
of example, a few of the better known conceptions of ‘meaning’: 
à Bloomfield’s ‘meaning’: (S + R)-(E + D)-N-T 

Bloomfeld's ‘distinctive meaning’: S-(E + I)-N-D [Bloomfield mentions 
only situations (S) in his discussion of ‘distinctive meaning,’ though he included 
both situations and behavior in his defthition of ‘meaning.’ Perhaps (N + L) 


» 
D 


—— maie s. o. T 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 177 


should be substituted for N in ‘distinctive meaning' above, depending on the 
interpretation of one of Bloomfield's remarks.] 

Joos's ‘linguistic meaning’: (S + R)-E-L-D 

Morris's *denotatum': S-E-N-T. 

Morris’s, ‘designatum’: S-E-N- X, T 

Morris's ‘significatum’: S-E-N-D 

Behavioral maliation’l ‘meaning’ (Osgood): R-I-N-D 

Substitution theory ‘meaning’ (1): S-E-N-(T, D?) 

Substitution theory ‘meaning’ (2): R-E-N-(T, D?). 


7.2.2. Measurement of Connotative Meaning 


The use of the semantic differential as a possible measuring instrument is 
suggested atə several places in this report. For this reason, as well as for its 
intrinsic interest in psycholinguistics, a brief description of this instrument and 
its possible applications is given here. The aspect of meaning to which this 
measuring device is assumed to provide an index is the distinctive psychological 
state of the communicator which occurs whenever a sign is either presented 
(degoging) or produced (encoding). The semantic differential is a combination 
of association and scaling procedures. A sample of potential bi-polar associations 
to a particular concept is provided for the subject, his task being simply to 
indicate the direction of each association and its intensity on a 7-step scale. 
In other words, from the myriad of linguistic and non-linguistic behaviors 
mediated by semantic states, a small but carefully devised sample is selected. 

7.2.2.1. Logic of semantic differentiation. The label ‘semantic differential’ 
points quite accurately to its intended operation—a multivariate differentiation 
of concept meanings in terms of a limited number of semantic scales of known 
composition.” The logic of the present instrument can be summarized as follows: 
(1) The pPocess of description or judgment can be conceived as the allocation of a 
concept to an experiential continuum defined by a pair of polar terms. The content 
of most complex verbal assertions, e.g., ‘Black bean soup is quite thick in con- 
sistency,’ can be reduced to the allocation of a concept to a scale: 
thin. 


EM BEAN SOUP thick — :X: 


The greater the intensity of strength of association, the more extreme the allo- 
cation in one direction or the other. (2) Many different continua of Judgment are 
essentially equivalent and hence may be represented by a single dimension. Judg- 
ments on a set of scales such as good-bad, fair-unfair, clean-dirty, kind-cruel, 
noble-bestial, and so forth are very highly correlated and can be shown to repre- 
sent mainly a single, ‘evaluative’ factor. (3) A limited number of such continua, 


2 Charles E. Osgood. : ^ b y 

?! The experimental and theoretical background of this method is asend rom 
nature and measurement of meaning, Psychological Review Pi hod ( y 
applications are described in a subsequent mimeographed report. j 

9: The term is not to be confused with the General Semanticist 


:49 9: 
which involves logical operations of a quite different sort. 


s structural differential 


178  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


representative of the dimensionality of meaningful judgments, can be used to define 
a semantic space within which the meaning of any concept can be specified. This 
indicates some variant of factor analysis as the basic methodoldgy..) 

7.2.2.2. Factor analysis of meaning. A set of 20 familiar céncepts (such as 
LADY, SIN, FATHER, BOULDER, and RUSSIAN) was judged in randomized 
order against 50 7-step scales defined by frequently used polar opposites by 100 
college student subjects. The 50/50 correlational matrix obtaired by correlating 
the judgments on each scale with those on every other scale wis factor analysed 


by Thurstone’s centroid method. Four factors were extracted. 


The first factor identified itself as evaluative by inspection of the scales which have high 
and pure loadings on it: good-bad, clean-dirty, tasty-distasteful, valuable-worthless, pleasant- 
unpleasant, and so on. The second factor identified itself fairly well as a potency variable: 
large-small, strong-weak, heavy-light, and thick-thin had high loadings on only this factor. 
The third factor appeared to be an activity variable: high and pure loadings were obtained 
for fast-slow, active-passive, and hot-cold. The fourth factor was small in magnitude and 
indefinite as to nature. Of the total variance in judgments available for apportionment 
(e.g., reliable variance), the three factors isolated account for about 60 per cent, of which 
more than half is evaluative. The remaining 40 per cent is presumably attributable toa large 
number of specific (probably denotative) factors. 


7.2.2.8. Semantic profiles, distances, and structures. The factor analysis of 
meaning is not an end in itself. Its purpose is to make possible the selection of.a 
minimum number of specific scales which taken together, will give the maximum 
coverage of the semantic space. Ideally, we should like to select one specific 
polar scale to represent each factor, this scale being maximally loaded on this 
factor and minimally on all other factors. In practice, of course, due to imperfect 
reliability, lack of ‘pure’ scales in the language and so forth, a small sample of 
scales representing each factor is used. 

When a group of subjects (1, 2, 3, ...m) rate a set of concepts«(A, B, C, 
...N) on the system of scales (a, b, c, ... n) which constitutes the semantic 
differential, a cube of data is generated. Each cell in this cube represents, with 
a number from 1 to 7, the judgment of a particular concept against a particular 
scale by a particular subject. A single slice of the cube represents the complete 
data for a single subject—all of his judgments of a group of concepts on a group 
of scales. It is also possible to collapse the cube along the subject dimension, 
producing a single set of numbers (e.g., averages of subjects). The following 
operations can be applied to either individual or group (mean) data. 


(1) Meaning of a concept. Table 1 represents a single slice of this cube, i.e., raw data for 
a single subject. For purposes of illustration, nine scales are shown, arranged in terms of 
loadings on the three factors already isolated. Low numbers refer to the polar terms to 
the left, high numbers to polar terms on the right. The profile of numbers in each column 
is one way of describing the meaning of that concept, within the limited coverage of the 
factors so far isolated. For this subject, WHITE ROSE BUDS are good, impotent, and 
passive; HERO is good, potent, and active; etc. These descriptions are admittedly gross, 
highlighting the connotative aspects of meaning and failing to catch many denotative 
aspects. A far more efficient way of representing the meaning of a concept is available, 
however, deriving from the mathematical logic’ of factor analysis. Given orthogonality of 


=} 
© 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 179 


TABLE 1 
xU BED D" SENEC qu S. o gy 
E 
[4 
i a > 
a 
a 8 S a 
$t 3 B 
[1 A p 4 n 
BE o E E Bn 
Gea 8 B BEA 
9 RB Ep n 
I agood « T bo ULNIS SRM: 52 EET Bad zi 
b beautiful... 7, 002 A Ba. Lucania ie) ugly 
cÍfreh. e; Ao INQUE Lo ELMO. (Sr 1147.2. 0: tale 
II dstrong Torn $408, M dE C Todo Br wenle 
€ large aay OM Pa Sip Rete fe 4 a 1 small 
f loud (ESE APE ESE ARS) KI TaT l 7:05:20 2. V BDfU 
III ,g,active 4 TT LT 3s 9b T. v X 27. pBselva 
À tense a 98 004. 100115: 20] 039) 1- 8 Ol 0) relaxed. 
i hot T wobei 9 Seema. 1:55 oa 


the factors (here, three in number), the meaning of any concept can be specified as a point 
in the n-dimensional space defined by the factors (in this case, a solid, three-dimensional 
space), this point being the intercept of the projections on each of the factors, Thus, the mean- 
ing of WHITE ROSE BUDS can be defined as 1 7 7 (median positions on each of the three 
factorsin order), the meaning of HERO as 1 1 1, FATE as 5 4 6, andSLEEP as 2 5 7. Means 
for a group of subjects, for each concept against each scale, can be computed and treated 
in the same way, in which case the ‘cultural meaning’ of a concept is being specified. This 
conception gf meaning as a point in n-dimensional space (or a volume in the case of group 
data where the variability is given) has both the advantage of economy in description and 
of a mathematical rationale. » A. 

(2) Difference in meaning. In semantic measurement we will often want to indicate the 
degree of difference (or, conversely stated, similarity) in meaning, betweeri two concepts 
for the same individual or group, between two people or groups for the same concept, or 
between two time points of measurement (e.g., change in the meaning of a concept for an 
individual or group}. The following formula is used here: D = /Zd* whered is the difference 
between two concepts (or individuals) on a single scale. The relation of this measure to 
the generalized distance formula in mathematics has the advantage of providing us with a 
rationale quite compatible with the entire logic our methods have been following: under 
certain specifiable conditions, D provides a direct index of the distance between two points 
in the n-dimensional space defined by our factors. The chief conditions are that the factors 
be orthogonal and that they be equally represented in the set of scales. 4 qe 

(3) Semantic Structure. D also has the advantage of extreme computational simplicity. 


Given a raw score matrix such as that shown as Table 1, one simply sums the squared differ- 
ences on each scale between each concept and every other concept, extracting P ers 
root of the sum. The complete operation on a small matrix like that shown for one in qon 
can be done with a small desk calculator and table of square roots in a few minutes. Ti 

operation generates a D-matrix, such as that shown as Table 2 for these pore xs 
table gives the ‘distances’ of every concept from every other concept in equiv: eine A 
Sinse these distances are all relative to the same dimensions, they have the additione 


* e^ 


180 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


TABLE 2 

A B c D E F G Hi. I J 
A S 
B 13.4 ^ 
CQ. 205. 1277 
D 1277 1204 18.27 p 
E 1241 . 830 "1212 714 i 
F 490 938 412 1153 9.17 ^ 
G 13.78 13.64 1411 861 735 1257 i 
H 1166 424 11360 1015 8.60 800 1233 -3 
I 13.08 1261 1349 141 7.14 1187 224 1118 
J 927 510 . 949 985 825 6.2 11.75 346 10.54 


° 


advantage of ‘plottability’ within a space having the same dimensionality as the number 
of factors. In the present case, restriction to three factors yields a reasofiably accurate 
plot in three dimensions, e.g., a solid model which concisely represents all of these ‘dis- 
tances’ and provides an attractive way of demonstrating data. Although such pictorial 
representations are limited to. three-factor system, the mathematical model is good to 
any number of dimensions. The smaller any distance in such a matrix, the more similar in 
meaning the concepts involved, e.g., D (HERO) and I (SUCCESS) above. 


The semantic differential is not a specific test form but a procedure. The 
scales used will differ from problem to problem, but are selected in terms of 
known factor composition. Whereas in the study of social attitudes scales such 
as fair-unfair, valuable-worthless, and clean-dirty may be chosen to represent the 
evaluative factor, in the study of aesthetic meanings equally evaluative scales 
such as pleasant-unpleasant, beautiful-ugly, and tasty-distasteful may be more 
appropriate. Nor is the method limited to either isolated stimuli or verbal 
stimuli; complex verbal signs can be studied as well as non-verbal signs (auditory 
patterns, cartoons, etc.). A number of applications have been made—to the 
study of attitude change, to changes in meaning during the course 6f psycho- 
therapy, to the measurement of identification, to the study of dream and political 
symbolism, and even to some problems in aesthetics—and others are contem- 
plated. Here we are interested in possible applications to psycholinguistic 
problems. 

7.2.2.4. Application to some psycholinguistic problems. The problem of meaning 
is a central one in the area covered by psycholinguistics. To the extent that the 

‘semantic differential provides a satisfactory index of even limited aspects of 
meaning—chiefly connotative aspects at its present level of development—it 
Should be useful in a variety of psycholinguistic problems, some of which have 
been indieated elsewhere in this report. > 

(1) Laws of word mixture. An interesting problem, for people in linguistics 
and the humanities as well as for psychologists, concerns how the meanings of 
combinations of signs relate to the meanings of the components. Are there 
analogies with the laws of color mixture? One law is that complementary colors 
(opposites) cancel each other toward neutral grey—will the combination of 
words of opposed meaning tend toward a meaningless result, for example, the 

LI 


^ 


e 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 181 


meaning of A SUBTLE OAF? Another law of color mixture is that the hues of 
mixtures must always lie between those of primaries (e.g., red and green yield 
yellow, red abd yellow an orange, etc.). Will the point in semantic space repre- 
senting A CAT-ÉIKE WRESTLER fall on a line between the points representing 
WRESTLER and CAT-LIKE? In the color space, any mixture must fall on the 
line connecting the components—will this be true for ordinary word mixtures? 
Would STRONG: POWER, perhaps lie further away from the origin than either 
of the componerks? From the color mixture analogy it also follows that any 
mixture must be less saturated than its components. This is probably not true 
for word mixture—a STURDY TREE would probably be further from the 
origin tham either of the components. ^ 

Enough illustrations have been given to indicate the general proposal here. 
We wish to détermine if there are any general laws governing semantie combina- 
tion. The existence of a mathematical model (e.g., the semantic space about an 
arbitrary origin defined by a set of factor coordinates in which any meaning is 
represented as a unique point) facilitates the statement of hypotheses and 
indicates the nature of the tests to be made. The general procedure would be 
to have the same subjects, at different times, differentiate various component 
words and combinations of these components, and then determine if any general 
statements can be made about the semantic results of combinations. There 
may be some indirect utility for linguistics here, for example on the problem of 
semantic units. Whereas the combinations POWERFUL MALE and MASCU- 
LINE POWER should fall on the line in semantic space defined by the locations 
of MALE (MASCULINE) and POWER (POWERFUL), and the combination 
DOGGY HEAT fall on the line defined by HEAT and DOG, this would not 
be true for the combination HOT DOG—indicating that this has become a 


new semantic unit. 

(2) Quantitative study of opposition. 
brought out a delightful little book on the 
on logical grounds various types of oppos 
to offer a quantitative way of approachi T mplete 
logical opposites should yield perfectly reciprocal profiles, i.e., determine points 
in the semantic Space which are at equal distances on opposite ends of a single 
straight line through the origin. FRESH (good, active, and somewhat strong) and 
STALE (bad, passive, and somewhat weak) might be an example. What might be 
called psychological opposites would be polar with respect to one dimension but 
fall at the same locus on others, i.e., define a line pa B ie puis Ts 
igi i (0) ood, strong, an 
placed froth the origin. An example ucc es : des Asaf min Aart 


and DEVIL (bad, strong, and active). A 

contrasts, which nid not really opposites at all. Examples would AA 
and SAILOR, HAND and FOOT, DOG and CAT—in such cases t àe points in 
semantic space are not in opposite directions of the space, in all probability, 


but rather close together. They display strong linguistic contrast, in that they 
constitute minimal p ical contextual samplings 


Some 20 or more years ago C. K. Ogden 
nature of opposition. In it he analysed 
ites. The semantic differential seems 
ing the same problem. Complete or 


airs which haye nearly identi 


182 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


(i.e., occur in the same environments). This type of measurement would also 
permit definition and comparison of degrees of opposition among a group of 
possibilities (as found in standard dictionaries of similars and opposites). 

(3) Onomatopoeia. Another problem of interest to the linguist as well as the 
psychologist and student of aesthetics concerns the possible meanings associated 
with speech sounds. There seems to be almost universal agreement among 
linguists that sound symbolism is a myth, or better that it is juways a function 
of particular associations with meaningful words in a given l£nguage. Thus the 
common association of /i/ with smallness and thinness is simply a cultural 
accident. This attitude doesn't jibe with the existing evidence, such as it is— 
the studies of Sapir, Stanley Newman, Fischer-Jgrgensen, and a few others are 
all positive in trend, although inadequate in controls. The proposed investiga- 
tion would employ a complex analysis of variance design in which the same 
phones (initial, medial, and terminal) would be made to occur in various phonetic 
environments in nonsense syllables for English and some other language, say 
Spanish. These materials ‘would be tape recorded by a trained phonetician; 
subjects would use the semantic differential to register the connotative meanings 
of each sound. Question 1. Do speakers of English show consistent deviations 
in meaning as a function of a particular phone, regardless of the total sound 
context (e.g., does /s/ tend to make a nonsense combination smaller, thinner, 
and more active regardless of context)? Question 2. Do listeners who speak an- 
other language show correlated effects, e.g., are the onomatopoeic effects com- 
mon cross-linguistically? Question 3. Can these onomatopoeic effects be related 
to general synesthetic phenomena, e.g., will higher pitch vowels tend to be 
smaller and brighter than low pitch vowels? 

(4) Cross-cultural generality of semantic factors. An underlying theoretical 
question is the generality of semantic factors cross-culturally and cross-linguis- 
tically. If it could be shown that essentially the same factors operate in the 
meaningful judgments of all people, regardless of culture, race, and language— 
that they all differentiate concepts in terms of evaluation, potency, activity, and 
so forth—then many new approaches to cross-cultural communication and 
understanding would be opened up. One could, for example, do a much better 
job of explaining his own nation’s values, interests and motives if he could choose 
concepts and qualifiers in the other language which he knew had corresponding 
significances. 

(5) A functional dictionary of connotative meaning. Almost daily professional 
people who deal with communication—teachers, writers, journalists propa- 
gandists, advertisers, politicians, and so on—are faced with the *problem of 
selecting words to convey their intentions to others. The ordinary dictionary 
Provides little information on connotations, and Roget’s Thesaurus is not only 
indefinite in this respect but is also prone to projection on the part of the user 
(eg. he selects in terms of his private meanings rather than those typical of his 
audience). The work done so far on the measurement of meanings of adjectives 
and nouns encourages us to believe that at some later time, when the factor 
system has been better stabilized, it will be feasible to construct a functional 


» 


i 
| 
| 
! 


© 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 183 


(e.g., representative of word-meanings as they are used by people) and quantized 
(e.g., presented in terms of quantitative units provided by the differential) 
dictionary of’ connotative meanings. 

The operations of semantic differentiation allow us to indicate the meaning 
of any verbal éoncept as a point in an n-dimensional space; this point can also 
be defined by a geries of index numbers representing locations along the set of 
factors. Using a fample of subjects, carefully drawn to be representative of the 
population, the point (and the index numbers) would represent the mean loca- 
tion of the sample and another number would indicate the dispersion of the 
individual points about this mean (i.e., the variability in meaning of this concept). 
The concepts in this functional dictionary would be arranged in double classifi- 
cation: once,in ordinary alphabetical arrangement (e.g. NOBLE: 134xxxx, 
indicating this word to be extremely favorable, somewhat potent, neither active 
nor f@ssive, and so on for additional factors) and once according to location in 
semantic space (e.g., under 134xxxx in a distributign running the gamut from 
7777777 to 1111111 one would find NOBLE along with all other words having 
the same connotative meaning, and in their neighborhood would be found 
similar meanings). 1 

There are several ways in which such a functional dictionary could be used. 
Bor example, one could look up a particular noun, such as WARRIOR, and 
find a group of adjectives having closely similar connotative meaning, or one 
could find another group of adjectives similar in all respects except one, say 
evaluation (e.g., if WARRIOR were 322xxxx, one might look under 722xxxx 
and find words like vicious, savage, and barbaric). In other words, one could move 
in any desired direction from a given point in the space and find appropriate 
words. Wishing to choose an adjective which accurately represents for other 
people one’s own meaning for a concept, one could quickly differentiate his own 
meaning for the concept and then look into the functional dictionary under the 
index thus derived for words having appropriate connotation. An interesting 
derivative of this work could be study of semantic isoglosses (e.g., geographical 
boundaries across which the meanings of common words shift) in much the game 
manner that linguists have studied phonemic isoglosses. 


7.3. Information Transmission by Language Messages 


Language is the chief ingredient of the cement which holds societies together 
over both space and time. One of the prime goals of the student of communica- 
tion is therefore an understanding of the way in which information (in the 
colloquial sense) is transmitted from one individual to another via language 
messages. Measurement of information transfer in this sense may make use of 
information theory. One may estimate the reduction in uncertainty of the be- 
havior of a human destination as a function of messages received from another 
human source. The reduction in uncertainty (or increase in predictability) 
could be expressed as 80 many ‘bits’ of information (in the mathematical sense). 
This approach is most applicable whea the codes used by source and destination 
are the same. When source and destination use different and equally arbitrary 


e Nem 


184 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


codes, however—that is, when we are dealing with information transfer in trans- 
lation situations—new problems are introduced and information) theory con- 
ceptions require some extension. u 


7.3.1. Information Transmission without Code Translation? 2 


In section 2.3. of this report, one bit of information was defined as the amount 
of information needed to specify one of two classes of equally, probable events. 
"There are at least two possible modifications of this definition ‘which allow us to 
measure the amount of information transmission in a language message. These 
modifications are discussed below and their usefulness evaluated. 

7.8.1.1. Information transmitted by language signals. The first modification of 
the definition of the unit of information is as follows: If each of two equiprobable 
language stimuli unequivocally elicits a different response, then each of the pair of 
stimuli transmits one bit of information. From this definition it is easy to pruceed 
to a general equation for measuring information transmission in terms of such 
units. Let s be any of a set of language stimuli, S, each of which elicits a response `° 
r which is one of set of responses, R. Let p(s) and p(r) be the probabilities of 
any s or r, and let p,(r) be the appropriate conditional probability. Furthermore, 
let us assume that these probabilities are unchanged throughout the period of 
measurement. 

This entire set of definitions permits us to treat language message Brno 
as a set of input events, the responses to these stimuli as output events, and the 
receivers of these messages as communication channels. Thus it is possible to 
use the conventional measurement of information transmission, I,, which will 
give the average amount of information transmitted by the set S. Using the 
symbols introduced above, I, becomes I, = H(R) — Hg(R). If responses to the 
stimuli are independent of the stimuli, then H(R) = Hs(R) so that d, = 0. If 
only one response is made to each stimulus (i.e., if for every s, only one p.(r) = 
1), then H;(R) = 0 and I, = H(R) = HS). 

An example of the kind of situation to which this sort of information measure- 
ment is appropriate is that of military drill in which language stimuli serve as 
stimuli for a set of immediately executed responses. In suck» situations, the 
language stimuli serve as signals in the same manner that traffic lights or the 
dials.on a control panel serve as signals. However, many language messages do 
not serve as signal stimuli in this sense but rather serve to modify behavior to 
stimuli other than those in the message itself. Another modification of the defini- 
tion of the unit of information measurement which may be supplied i in this kind 
of situation is given below. 

7.3.1.2. Information transmitted by conventional language messages. The second 
modification of the definition of the unit of information measurement is as 
‘follows: If the receivers of a message make two responses randomly to each of two 
equiprobable stimuli before message reception but make only one of these responses 
to the stimuli after message reception, then one bit of information has been trans- 


9: Kellogg Wilson. a a 


E 


* and good judgment on the pa: 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 185 


mitted by the message. Let S and R be classes of extra-message stimuli and re- 
sponses and l& s nd r be any members of these classes. Let p(s) represent the 
probability of any s and assume that all p(s) remain unchanged by transmission 
and reception of the message. Let p,(r) be the pre-message conditional prob- 
ability of a response r to, a stimulus s and let p.(r) be the corresponding post- 
message probability. i; 

Using the entirejset of definitions introduced above, our measure of I, becomes 


I, -H&(R)- H,(R) = TA p(s) »» s(t) logs par). + L p(9) x ps (r) log: pai(r). 


If the message has no effect.on behavior and H&(R) = Hs(R), then I, = 0. If 
the message eliminates the entropy of behavior so that behavior is completely 
predictable and H&(R) = 0, then I, = Hg(R), the pre-message behavioral 
entropy. In general, I; gives the amount of the entropy change produced by the 
message where the unit of change is the amount of enjropy reduction associated 


“with the development of a perfect discrimination from a random discrimination 


between two equiprobable stimuli. 

This measure,has two characteristics which should be noted. i 

(a) I. is dependent on the characteristics of the receivers and on the situation 
inewhich the set of p,(r)’s and p,(r)’s are estimated. 

(b) I, is dependent on the pre-message behavioral tendencies of the receivers. 

Unlike the previous measure of I, which considers the probabilities of the 
elements in the message, this measure treats the message as a sort of unitary 
deus ex machina which somehow modifies response tendencies. This sort of 
treatment allows us to avoid estimation of the probabilities of the units of the 
message but at best only permits us to postpone the identification of the message 
units which produce behavioral change and the means by which this change 
occurs. Unfortunately, our present knowledge permits us to only speculate 
concerning these units and the nature of their effects. 


7.3.2. Information Transmission with Code Translation? ^ 
Translation is &sually regarded as a tool which is improved only by experience 
rt of translators. The development of information 
theory techniques and electronic computers has created increasing interest in 
both theoretical and practical problems of translation, and it may be worth- 
while to analyse translation as a psycholinguistic process (see also section 6.2). 
Information:theory provides a general model in which the translator, whether 
human or machine, is treated as a channel, the foreign or from’ language (FL) 
is the input, and the translation or ‘to’ language (TL) is the output. (For prob- 
lems of translation as currently viewed by some linguists, see now also Inter- 
national Journal of American Linguistics 20:4 [1954]). k "A 
7.3.2.1. Need for addition to information model. Let us first imagine a 
machine translator between languages A and B. If whenever a given input 
e 


" 
*! Susan M. Ervin. 
2 . ren 


186 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


state in A occurs (e.g., scanning of the German word Pferd) the machine reliably 
produces a specific output state in B (e.g. prints the Englis word house), 
conditional entropy would be zero and information transmission would appear 
to be perfect—except that it could be complete gibberish. What i is accomplished 
here is actually a translation of the FL into some one of a near izifinity of possible 
TL’s, but not English. We would like our machine ttanslatgr to produce with 
equal consistency the output horse when Pferd occurs—and thj is is only a minimal 
requirement. This highlights a fundamental limitation of information theory— 
it has nothing to say about the correspondence between the states of two systems 
associated in a channel. The measured information transfer would be the same 
and maximal for a machine which translated Pferd/house and so forth at random 
as for a machine which translated Pferd/horse and so forth, ae long as some 
specific output state was completely dependent upon each specific input state. 

Figure 21 may help to make this situation clear, as well as the kind of éxten- 
sion needed. Some SOURGE, perhaps the writer of a book of perhaps the reporter 
of certain NON-MESSAGE EVENTS, encodes in MESSAGE SYSTEM A 
(FL); the role of the translator, human or machine, is to decode these signals 
and then encode non-corresponding but equivalent signals in MESSAGE SYSTEM 
B (TL); this translated message may then be decoded by some DESTINATION 
and it may influence his behavior with respect to certain NON-MESSAGE 
EVENTS. The crux of the problem lies in the phrase, ‘non-corresponding but 
equivalent’ above. We must take as given the fact that the message events in 
two languages cannot be corresponding in any physical sense (except for oc- 
casional cognates and the like) but can be equivalent in the sense that they are 
associated with corresponding states in source and destination systems. 

The particular physical events which, in the message code, come to represent 
semantic states or non-message events are completely arbitrary; oh the other 
hand, given the similarity of human organisms and the generality of learning 
principles, one would expect considerable correspondence in semantic states 
(e.g., the behaviors of both German and English speakers to the object HORSE 
may be similar and hence the representational processes associated with Pferd 
in the one case and horse in the other could be similar). In other words, equivalence 
of events in FL and TL is defined by correspondence of those events in source 
and destination which are associated with the FL and TL events respectively. 
Input Pferd is equivalent to output horse in translation because the semantic 
intention in the German source encoding Pferd corresponds closely to the se- 


NON- 
TEM > [DESTINATION -> MESSAGE 
EVENTS 
potentially corresponding systems 


Figure 21 


M 
i 
3 
1 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 187 


mantic significance in the English receiver decoding horse. This analysis ad- 
mittedly thnows the gates wide to the demon ‘meaning’ (with all its fuzziness 
and coapta but to rule it out seems to also rule out by definition the 
central problemi. 

The necessaty addition to information theory measures can only be suggested 
here. The following notibns?* may be useful in describing the translation process: 


Correspondence between any two systems is the proportion of the states in one that are 
identical with states of the other. 

Stability of a system (e.g., translator) refers to the reliability with which it transforms in- 
put states into predictable output states. 


S = H(0) — H:(0) 


e 
"This measure is equivalent to that of information transfer in section 7.3.1.1. and indicates 
thewlegree to which states in TL are predictable from states in FL, regardless of their 
translation equivalence. i 

Translation-equivalent states of two message systems are thoffe associated with corresponding 
states in sources (using one language) and destinations (using the other language), The 
rules for specifying such correspondences would, of course, need to be made explicit. 

Fidelity of a system (e.g., translator) is the validity with which it transforms input states 
into equivalent output states. 


* t F = Zp(mn-m) 
man 


Where m is any one of the states of the FL and n is any one of the states 
of the TL. p(m, n = m) is simply the probability of a corresponding state of TL occurring 
with a given state, m, of FL. This is not an information theory measure and does not 
yield an estimate of information transfer in bits; rather, it is a simple percentage meas- 
ure—the proportion of all input events which result in equivalent output events, as these 
have been defined. 


'The specification of states of message systems which are correlated with states 
of encoders and decoders (problem of units), establishing criteria of stability, 
establishing criteria of fidelity, and the treatment of non-equivalent states are 
all problems which arise no matter what kind of analysis of the translation process 
one chooses andeno matter whether one deals with human or machine translation. 
The following discussion attempts to highlight certain of these problems, but 
does not provide any definite answers to them. 


7.82.2. Specification of states of message systems. (1) The size of units. Since 
the establishment of units of equivalence must be done on the basis of meaning, 
the choices of units to be used in describing states of the systems cannot be 
arbitrary. There may be situations in which it would be appropriate to deal 
with morphemic units, or utterances or sequences of utterances as units. For 
example, the following two utterances have a correspondence of meaning even 
though the smaller units do not correspond: "The barn door is locked’ = ‘Pre- 
cautions have been takén.' The size of the unit used in establishing correspondence 


«> Terminology and definitions are fron? Osgood and Wilson, A general model of the com- 
munication process, unpublished mimeographed paper. 
e 


188 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


may restrict the kind of operations possible in measuring stability and fidelity; 
the procedures one would use to establish meaning equivalence for two mor- 
phemes might not be the same as those used in measuring equivaience for two 
utterances. 3 

(2) Simultaneous states. As soon as one attempts to establish. porréspondence 
between two utterances it becomes clear that there may be complex interrela- 
tions between morpheme choice, order, grammatical patterns, ptress, and pitch. 


For example, in the following sentences changes in English Stress and intonation cor- 
respond to changes in French intonation and constructions. 
a Is patriotism a ivirstue? a’ Le patriotisme est une verfu? 
a” Le patriotisme est-ce une vertu? 
a!” Est-ce que le patriotisme.est une vertu? 


b als patriotism a virtue? b’ Est-ce le patriotisme qui est une vertu? 
c Is patriotism a avirtue? c’ Est-ce une vertu que le patriotisme? 
d Is patriotism a virtue? d’ Est-ce que le patriotisme est une vertu? 


2 

English sentences a and b differ only in intonation patterns; in French the difference is 
one of grammatical construction. The difference between b and c is chiefly one of stress, 
but b’ and c' differ again in grammatical construction. On the other hand, the Frenchsa’” 
and d' differ only in intonation, but the corresponding English a and d differ both in stress 
and intonation. It is possible to set up rules for this set of translations, but the description 
of states must simultaneously include stress (for English), intonation, and grammaticel 
construction pattern if it is to handle the correspondences adequately. Some of the initial 
work on this general problem has already been done by those studying machine translation. 


1.3.2.3. Establishing criteria of stability. The stability measure could be applied 
to the same input at different points in time, or to input as translated by different 
translators—in other words, it can be used either as a measure of test-retest 
reliability or as a measure of intercoder reliability. 

In either case it is necessary to decide how large a class of events shall be 
considered the same event for purposes of scoring stability. For example, the 
same input event may result in the following output events: ‘crimson,’ ‘red,’ 
and ‘blue.’ These might be considered three different events, two, or perhaps 
even one. Even in a translation situation in which the alternative possible out- 
put events are arbitrarily limited there will be a problem of degrees and kinds 
of meaning difference between the possible output events. A stability measure 
which is equally sensitive to trivial and to gross differences may prove to be less 
useful than one which takes into account semantic relations between output 
events. Unfortunately, although linguistic analysis may help to classify some 
differences—for example, in grammatical structure of two utterancus—it stops 
short of the semantic evaluation which is necessary in order to deal with degrees 
of difference. Some other methods are necessary in dealing with this problem. 
It would probably be advisable at this point to set up an experiment with limited 
output alternatives and develop some provisional classifications. It may be 
found that for this kind of work even a crude method of meaning measurement 


is adequate. 
^5 


D 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 189 


Among the methods which should be investigated further in this context are 
the following: . 


@ 

(a) Rating by speakers. This is the simplest and possibly the most comprehensive method. 
It is not known how much agreement can be reached, or what the effects of various kinds 
of instructions diy speakers asked to rate differences in meaning may be. 

(b) The semantic differefitial. This method has been applied to words in context and it 
may prove adaptable to the problems of meaning differences between utterances, Like the 
word association ngethod, however, it uses a lexical item as the primary stimulus and may 
not be sufficiently sensitive to grammatical relations, for example. 

(c) Word associations. One method of establishing a measure would be to have subjects 
associate to words in a context, giving a series of associations to the same word. These 
associations*could be listed in rank order of frequency, and the lists equated in length by 
omission of the least frequent items on the longer lists. A measure of likeness in lexical 
meaning of two items is the proportion of identical associations given to them. dr 


7352.4. Establishing criteria of fidelity. The crucial operation in establishing 
fidelity is actually demarking the equivalent units ip input and output systems. 
Equivalent message events have been defined as those associated with identical 
(or, at least, highly similar) semantic states in encoder (FL) and decoder (TL). 
In order to establish such identity, there must be some common system to use 
for comparison; this might be called the yardstick system. In practice, the two 
yardstick systems used in translation are external referents and the mediating 
systems of bilinguals. A third system might consist of non-linguistic responses 
to the test events by monolinguals who speak the input and output languages. 


(a) The judgment of bilinguals. Asking a bilingual whether two utterances in different 
languages have the same meaning is a procedure which presents some of the same problems 
as the rating of meaning differences by speakers of one language. It is not clear what kind 
of meaning he is thinking of when he makes the judgment, nor is it known how much like- 
ness of meaning he considers necessary to justify a statement of identity. It may be that 
one translation conveys the emotional tone of the FL utterance more fully but that another 
is referentially more exact. Another difficulty is that it is not known how much the judgment 
of bilinguals regarding meanings may be distorted by the knowledge of two languages (see 
section 6.2). If, for example, it were shown that bilinguals and monolinguals typically have 
different semantic differential profiles for the same words, the judgment of bilinguals re- 
garding the connetative subtleties of many words would have to be questioned. 

(b) External definition by referents. Another method would be establishing common re- 
ferents through determining the range of objects and situations to which a term or set of 
terms can refer. This would show quite definitely when a term in one system has only par- 
tial referential overlap with one in another system, but there are at least three major 
drawbacks: It is necessarily laborious in any instance where many terms are involved. 
Furthermore, the method is most useful for objects and becomes less useful for abstractions 
and for emotional referents or complex referents. Finally, there are certain kinds of cor- 
respondence—such as certain propositional values or feelings about the proposition, which 
might be conveyed by style—which it would be almost impossible to demonstrate through 
this method. f; 

(c) The semantic differential. If a translatable form of the semantic differential could be 
rm dimensions, responses to the differential might 


obtained, using cross-culturally unifo: 8 I 2 
be used as a test for equivalenee of meaning. The differential would have certain advan- 
tages here. It could show whether items which are referentially equivalent according to 


a 


e 


190 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


ostensive definition have a shift in meaning in certain contexts. ‘Applesauce’ or ‘baloney’ 
have been given as examples of identifiable substances according to one usage, but in other 
contexts the terms take on entirely different meanings about synonymous with ‘nonsense.’ 
‘Applesauce’ according to the second meaning is not ‘sauce made of app'es’ at all; the dif- 
ferential could show when a combination of morphemes gives rise to a meaning not predict- 
able from the component morphs and their rules of combination (see sec#@on 7.2.2.4 under 
‘word mixture’). The differential may also be sensitive to difierences in meaning due to 
differences in the social situations or speakers associated with certain terms, although this 
fact remains to be demonstrated. For example, it might show that a ‘g t’ and ‘un pistolet" 
are not equivalent semantically. ) 

(d) Responses to utterances. If utterances can be constructed so that some overt response 
is required to them, preferably non-linguistic, equivalence of response under comparable 
conditions could be considered a measure of meaning equivalence in the total utterance. 
This is a method which could be used to show differences in meaning arising from other 
sources than lack of fidelity in translation of lexical states alone. One of the difficulties in 
this procedure is that it requires the use of bilinguals so that response differences in the 
two languages are not due to culture differences. Using sufficiently large matched“#foups 
of subjects, one could test which of several versions of the TL produced responses most 
like those given to the FL utterance. 


7.3.2.5. Treatment of non-equivalent states. Assuming that one establishes 
equivalences between lexical elements, intonation, stress, and grammatical 
constructions there will still be certain non-equivalent states left over. These 
features of the TL which cannot be predicted from the FL, even when translatiou 
procedures have maximum fidelity, are of two kinds: One consists of states: 
required by the rules of the code, such as the gender suffixes on Spanish adjectives. 
These states do not present a problem, because presumably they are completely 
predictable from knowledge of the equivalent states in the FL plus the rules of 
the TL code. Thus, if it is known that ‘girl’ corresponds to ‘muchacha’ it can 
be predicted that other forms bearing a specifiable relation to ‘muchacha’ will 
have feminine suffixes when Spanish is the TL. There are other kinds of states 
required by the code which are not entirely predictable from the rest of the 
utterance in some contexts. Such, for example, are English tense and number. 
These forms require some semantic information; they are not redundant, except 
insorar as one is dealing with agreement. If English is the TL, and one is using 
as an FL a language which does not specify number, the English code requires 
information which is not directly dependent on that supplied by the FL. If 
this information can be obtained from context by explicit rules there is no 
difficulty. However, it may have to be inserted by sheer guessing on the part of 
the translator. On the other hand, the FL may codify information which is not 
normally codified in the TL. s 


Many instances of loss of information in translation probably arise from apparent 
stylistic awkwardness arising from an attempt to codify all the information conveyed by 
the FL. For instance, several Northwest Coast languages require specification of the source 
of information in making assertions. If this part of the input were translated into English 
output the result would not only be awkward but it would mean undue emphasis. Codifying 
of information in the TL which that language rarely codifies itself may be misleading. À 
translation from Korean which translated socjal status information fully might give a 


false impression of the intent or effect of the utterance. Certainly its effect on an English 


2 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 191 


listener would not be the same as the effect of the input utterance on a Korean, which might 
be an indication in itself of distortion of meaning. 

One migh# consider these as rather precious points, but they are questions of judgment 
which face any translator and hence suggest studies of factors in making such choices. Such 
studies might yary the kind of instructions given translators, e.g., the kind of information 
supplied to thei about the intent of the speaker or the purpose of the translation or the 
target audience.gSince variation in the information contained in equivalent states may 
lead to shifts in treatment of non-equivalent states, the message itself should also be man- 
ipulated by the eXperimenter. 


7.3.2.6. A note on back-translation. One of the most convenient techniques for 
pointing yp lack of fidelity in a translation is retranslating the material into the 
FL. Essentially this method repeats the translation process through a different 
system. There is no reason, therefore, to expect that even a translation with 
maximum fidelity can be retranslated to utterances identical with the initial 
ea reason for this change is that the probabilities of certain translations 
may differ when a given language is input rather tHan output, due to differences 
in frequency of usage of partially equivalent terms and differences in the variety 
of,referents for each term. 


For example, assume in the following set of French and English terms that ‘box’ is used 
for more terms than ‘bofte;’ for many of the referents of ‘box,’ the French term would be 
'étui.' In this system, the probability of occurrence of the term ‘box’ is therefore greater 
than the probability of ‘botte.’ Further, the term ‘portfolio’ has only one corresponding 
French term, ‘serviette,’ but ‘serviette’ has three corresponding English terms. ‘Portfolio’ 
would occur less frequently than ‘serviette.’ 


T Tue T 


Towel | a Essuie-mains 1,0——» Towel 
Q0), $ (2) : (13) 
Portfolio 1,0— —À" Serviette E — Portfolio 
(10) (19) (AY 
L3 LI Pr L4 
Case {3 ——> tut {a Case, 
(10) r (10) Ne (9) 
Box {22> Boite 1.0—- Bex 
(10) (9) (1A) 
a Figure 22 


The consequences on the back-translation can be seen from the input-output probabilities 
in a minimum context arrangement. The highly frequent term ‘box’ is back-translated as 
‘box’ 90 per cent of the times it occurs in FL, and in addition 10 per cent of the FL, oc- 
currences of ‘case’ are back-translated as ‘box,’ resulting in an increase in frequency of 
‘box.’ The infrequent ‘pdftfolio’ fares less well in back-translation and loses 60 per cent of 
its frequency in FL. It appears in TL; only 20 per cent of the times it occurred in the Fly 
input, in addition to being back-transldted in place of 16 per cent of the occurrences of 
‘towel’ and two per cent of the occurrences of ‘case.’ 
e 


© E 
2 fox s 


192 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


Back-translations may reveal instances of only partial equivalence, but it is 
necessary to have sufficient frequencies of occurrence to establish the probabil- 
ities. In the above example, back-translation shifted the frequencies markedly 
in three of the four terms, pointing to the possibility of only partial cerrespond- 
ence in referents. Some guesses about the semantic relations between the terms 
can be gathered by examining the probabilities within the matri?es. If there are 
traditional translation terms, back-translation cannot be used for revealing 
non-equivalence, because it results in reciprocal biases iù the two translation 
processes. Thus, if ‘aimer’ were always translated as ‘love’ in English, and vice 
versa, the partial equivalence between the two terms would never bg revealed 
in back-translation into French. Experimentation with back-translation using 
frequency data thus may demonstrate its possibilities and limitations as a tool 


for revealing partial or complete non-equivalence in translation. vee 


7.4. Language, Cognition, and Culture’! 


The relation of language to culture and cognition is of particular concern 
because of the determinative influence which language is supposed to exert on 
the other two variables. The writings of Benjamin Whorf and the others who 
have followed his example present this view in its clearest form, although the 
general thesis has a long tradition both on this continent with Franz Boas and” 
Edward Sapir and in Europe from Herder through von Humboldt to Ernst 
Cassirer, Leo Weisgerber, and Jost Trier. However, research in this area has 
been complicated by terminological and methodological confusion and failure 
to state hypotheses in testable form. Recently, this thesis has been studied 
critically by several groups of interested anthropologists, linguists, and psy- 
chologists. The Wenner-Gren Foundation International Symposium on Anthro- 
pology” and the Conference of Anthropologists and Linguists’: included the 
Weltanschauung problem on their agenda, and a special Conference on Ethno- 
linguistics’ was held specifically to consider its ramifications and to develop a 
program of research. The material presented at these meetings can neither be 
summarized nor adequately evaluated in the space available here, although it 
certainly influenced the discussions of the seminar. à 


74.1. Terminological Problems 


A necessary prerequisite for research in this area is independent definitions 
of the critical terms. One common confusion, most typical of those who deal 
with the problem in ‘language vs. culture’ terms, is failure to distinguish lan- 
guage as a message system external to the particular user from language as 
states of its users, e.g., as meanings, attitudes and the like. We restrict the term 


** Donald E. Walker, James J. Jenkins, and Thomas A. Sebeok. 

9'*A. L. Kroeber, ed. Anthropology today (Chicago, 1953). 

** C. Levi-Straus, et al, Results of the Conference of Anthropologists and Linguists, 
Indiana University Publications in Anthropology and Linguistics, Memoir 8 (1953). 

** Conference on Ethnolinguisties, Chicago, March 23-27, 1953. 3 


D 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 193 


‘language’ to the former (messages) and use the term ‘cognition’ for the latter 
(states of its) users). Within this restriction of the term ‘language’ there is still 
another confusipn—between what has come to be called langue (the system or 
codification) and parole (the particular manifestations or instances of language 
behavior). "Thi comes gut clearly in statements such as “anything can be ex- 
pressed in any language, but the structure of a given language will favor certain 
statements and hinder others.” For the most part (e.g., in Whorf’s work), the 
question has centered about the relation of systems of codification (‘langue’) to 
. cognition and culture, and we accordingly restrict our usage of the term ‘lan- 

guage’ to mean codification. The term ‘culture’ seems too broad to be dealt 
with successfully, at least at the research level. For our own purposes, therefore 
we substitute the term ‘behavior,’ by which we mean overt activities with 
respect to social and physical objects which may be to varying degrees shared 
with other members of a language community. 


If language were to be treated as somehow independent of culture, it would be necessary 
to develop experimental situations which are wholly ‘culture-free’ in which we could ob- 
serve behavior. However, the vast literature criticizing these attempts is a demonstration 
of the pitfalls whieh await the unwary here. The search for universally familiar or unfamiliar 
test materials is only a beginning. Culturally conditioned differences in motivation, per- 
sonality, and so forth immediately interject themselves. The dilemmais made clear by con- 
sidering the two courses of action available: first, one may make an a priori judgment that 
an experimental situation is culture-free—but the history of such attempts indicates that 
no two authorities ever agree. Secondly, one may try to demonstrate empirically that a 
situation is culture-free. This involves comparing across a large number of cultures and 
demonstrating lack of difference in the behavior elicited in this situation—but such experi- 
mental situations would be useless from that point on, since the ultimate objective is to 
study differences between cultures. In other words, it seemed to the seminar that the notion 
that "langugge is independent of culture’ is patently untestable. 


We may rephrase the general issue here as follows: we wish to set up experi- 
mental situations in which relations between. codification, cognition, and behavior 
can be studied. By codification we will mean all those aspects of speech behavior 
which are forced upon the individual speaker by the rules of his language, 
infringment of “which results in defective communication. Codification must 
be thought of as including the phonemic, morphemic, and syntactic structure of 
a given language, as well as the lexicon. If I am to communicate about colors I 
must employ the color-terms available in the language, and languages will 
differ in how their color terminology ‘carves up’ the physical spectrum: I must 
also follow he grammatical rules by which lexical items are compounded in the 
language. By cognition we will mean those representational processes in language 
users whereby certain stimulus patterns (signs) come to stand for other stimulus 
patterns (significates). Cognitive processes will be functioning in such total 
activities as perceptual organization, recognition, retention, thinking, concept 
formation, and problem solving. By behavior we will mean nothing more than 
these total activities—of perceptual organization (e.g., sorting colored yarns 
into discrete piles), recognition (e.g., pointing to those color patches previously 


D a 
= = 


e 


194  PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


shown amongst a larger number), and so forth. One of the major hypotheses 
under test, of course, is that these performances will vary as a function of codifi- 
eation of language. 3 


7.4.2. Methodological Problems 7 


* 4 

Fi The human infant enters this world at some place on its curface and finds 
himself among adults who have certain stable ways of behaving, toward physical 
objects and in social situations and who use a language which has a certain 
codification. Gradually he learns how to behave like others do, i.e., learns the 
culture, and in doing so simultaneously develops elaborate systems of cognitions 
(ways of perceiving, meanings, associations, attitudes, and so forth), and also 
simultaneously he learns the language, associating auditory stimulus patterns 
with cognitions (decoding) and cognitions with vocal behaviors (encoding), 
both according to the rules of the code. Given this complete interweaving of 
culture, cognition, and language in the course of development, it is not surprising 
that social scientists later find it difficult to disentangle the threads. The greatest 
single pitfall in the way of research in this area is thus circularity of inference. 
“People in different cultures have different ‘world views’ because their languages 
differ (e.g., the Hopi have different time conceptions because their language 
uses a different tense system). How do we know their ‘world views’ differ? 
Why, because they use language differently!” To escape from this tautological 
trap, the independent and dependent variables, language structure and ‘world 
view,’ must be independently measured, 


(1) Independent indices. It is possible to describe similarities and differences in codifica- 
tion of languages almost completely independent of the cognitions and behaviors of their 
users. This is the special technical job of descriptive linguists, and their only dependence 
upon the cognition-behavior of informants is solicitation of judgments of same or different 
in meaning. It is not so easy to distinguish cognitions from behavior. As a matter of fact, all 
the psychologist or anthropologist can observe is situations and behaviors—cognitive states 
are always matters of inference. However, there are reliable rules for making such infer- 
ences and it is possible to experimentally segregate the behavioral indices of cognitions from 
the behavioral performances that serve as the dependent variables in concrete research situations. 
This will be evident in the design of some of the experiments which follow. Of course, in 
many cases it is not necessary to make a sharp distinction between cognition and behavior; 
it may be sufficient to demonstrate that behavior-as-dependent-upon-cognition is influenced 
by codification. 

(2) Fallacy in translation methods. A method frequently used by Whorf and others to 
‘prove’ that differences in ‘world view’ stem from differences in codification is to translate 
some utterance from one language quite literally and in most general, abstra&t terms into 
another, usually English. Thus we find what we would call a ‘dripping spring’ being trans- 
lated from Apache ‘as water, whiteness moves downward,’ and implications for ‘world 
view’ or ways of perceiving are drawn from the differences in English. Lenneberg has criti- 
cized this procedure on several grounds.!*? Indeed, if we were to translate the English state- 
ment a/drip(p)/ing/spring into its most literal and abstract terms, we would come out 
with something like “An instance of the general class, characierized by liquid falling in 
small, natural segments, process on-going, eruption of water,” and most speakers of Eng- 


5 


100 Language 29. 463-71 (1953). 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 195 


lish would fail to recognize their own ‘world view’! Another limitation of the translation 
method is that it distorts the significance of metaphors, with which languages abound but 
which have logt their literal significance entirely. What would the Apache ethnolinguist 
conclude about the English ‘world view’ from literal translations of ‘in the face of,’ ‘break- 
fast,’ ‘beforehand? and the like? 

(8) Intra*cult®al and cross-cultural designs. The Weltanschauung problem seems to re- 
quire experimental designs which draw comparisons across cultures, and most of the re- 
search—or perhaps better, observation—has been of this sort. But this is not necessarily 
the case. As will begeen below, many of the specific hypotheses which derive from the gen- 
eral thesis that language affects thought and culture can be as adequately tested within a 
single language-culture community. If it can be shown that the way a single language codi- 
` fies continuous sensory experiences affects the behavior of its users with respect to these 
sensory experiences, this certainly makes more feasible the broader notion that between- 
culture/between-language differences in codification should affect behavior. And, as a 
matter of fact?intra-cultural designs usually offer more stringent controls than cross-cul- 
tural designs, because of the difficulty in the latter case of disentangling ‘cultural’ vari- 
ablef'tfüm codification per se. à 


7.4.8. Restatement of Hypotheses and Concrete Research Designs 


In the most general sense, Whorf assumed that people in different cultures 
perceive the world differently and drew his data for this assertion chiefly from 
differences in language. What is asserted beyond this point is hard to come to 
grips with. Sometimes it seems to be implied that people see the world differently 
because they have different languages and sometimes it appears that people 
perceive the world differently and thisis merely correlated with language differences. 
Available evidence seems to justify the statement of correlation, but the causal 
direction is by no means clear. Obviously, what one is actually claiming is of the 
utmost importance in any serious attempt to deal with the Weltanschauung 
problem experimentally, and the following statements try to formulate an inter- 
related set of hypotheses in testable form. Illustrative experiments, both avail- 
able and proposed, are described under each hypothesis. 

7.4.8.1. Cognition and behavior. Although not directly involved in the Whorf 
view, since nothing here is said about language per se, an assumption under- 
lying all of this work is that cognitive states are determinants of overt behavjor— 
or, more genertlly, that ways of perceiving and conceiving the world affect 
behavior norms toward physical objects and in social situations (culture). 
The reverse relation must also be stated—that behavior influences cognitions— 
and this, in the more general form, is the notion that people in different cultures 
will view the world differently, quite apart from and beyond language factors 
per se. 

H ypothesis I. The cognitive states associated with stimuli influence the responses 
made to these stimuli. The independent variable here is cognitive state, which 
must be indexed independently of the measurement of the overt behavior to 
test situations, which is the dependent variable. The type of cognitive state 
involved in a given experiment may be ‘way of perceiving,’ meaning, attitude, 
or so on, and the behavior observed may be solution of problems, simple bodily 
movement, or even ‘cultural response’ such as normative behavior to an eclipse 
or lack of rain. At the common-sense level, this hypothesis is trite. Obviously, 

5 a ERN 


€ 


« 


196 — PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


how a man reacts to a situation—by fighting, by running, by spending his 
money, by praying—depends upon what the situation means to him; obviously, 
how a member of any culture responds to a Rorschach test deperids upon how 
he perceives it, upon what significance the shapes have to him. Bat this hypothe- 
sis can have more subtle implications, as the following experimeyts show. 

* fi 


i 

Cofer and his coworkers have shown that the ability to solve a problem may hinge on 
the presence of word associations. They experimented with the Maier ‘two-string problem’ 
in which two strings are suspended from the ceiling in such a maaner that a subject cannot 
reach both strings at once. The problem is to tie them together. One ‘insightful’ solution 
is to tie something on one string, making a pendulum, start it swinging, hold the other 
string, and catch the pendulum string at the end of its return arc. By the use'of word asso- 
ciation techniques it was found that people making ‘pendulum solutions’ had the associa- 
tion ‘rope-swing’ at fairly high strength. In a second experiment subjects’ were asked to 
learn lists of paired words. One group learned ‘rope-swing’ as a pair in a list. Other groups 
did not have these words paired, although the individual words were in the lists. At’a later 
time these groups were tested on the two-string problem, and the group which had learned 
the appropriate word pair made the most pendulum solutions. (None of the subjects knew ° 
there was a connection between the memorizing and the problem solving and none of them 
cited the word association as a reason for giving that particular solution.) In other studies 
Cofer has shown that it is also possible to inhibit correct solutions by having subjects learn 
misleading associations. 

Sets of semantic relations may also influence problem solving. It has been shown that 
certain descriptive dimensions are highly interrelated—words like ‘up’ and ‘top’ are asso- 
ciated with ‘light’ (in weight), ‘light’ (in color), and ‘small,’ while words like ‘down’ and 
‘bottom’ are associated with ‘heavy,’ ‘dark,’ and ‘large.’ Thus we may think of a set of 
covariant dimensions: ‘up-down,’ ‘top-bottom,’ ‘light-heavy,’ ‘light-dark,’ and ‘small- 
large.’ Solley!?! has shown that congruence of these dimensions may facilitate problem solv- 
ing and incongruence may hinder problem solving. His experimental task was the classic 
‘pyramid puzzle.’ The subject is presented with a board with three pegs in it. On one of 
the pegs is a pyramid of disks of decreasing size. The problem is to transfer the yramid to 
one of the other pegs, moving only one disk at a time and never putting a larger disk on 
top of a smaller one. If the task starts with an inverted pyramid, the rules aré the same 
except that a smaller disk may not be placed upon a larger one. Varying the color, weight, 
and size of the disks, Solley found that the most rapid solutions were made in the situation 
in which all the dimensions were congruent (i.e., the top disk was smallest, lightest in weight 
and in color, while the bottom disk was largest, heaviest and darkest). The most difficult 
situation was that in which the concordance of the verbal relations with the problem was 
least. As each dimension was altered, the problem became more difficult. 


These studies are intra-cultural, but the same or similar designs could be 
used cross-culturally. For example, could it be shown that the various types 
of solutions of the hanging-strings problem (pendulum, tying an extension on 
one string, elevating oneself by standing on some object, etc.) have significantly 
different frequencies for members of different cultures? Assuming that members 
of different cultures have somewhat different semantic structures (e.g., such 
that light-dark is independent of up-down, for example), would their performance 
on Solley's pyramid problem differ? A number of designs of this sort are feasible. 

Hypothesis II. The behavior initially elicited by stimuli influences the cognitive 


» 


a 
?! Unpublished research, University of Illinois. 


— » 


2) 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 197 


states that come to be associated with signs of these stimuli. This is the converse of 
Hypothesis I, and it raises the issue as to the origin and development of cognitive 
processes. A large number of psychologists, despite their haggling over mecha- 
nism, would agrée that meanings, ways of perceiving, concepts, and the like 
develop out» of ¢he matrix of behavior toward objects and situations. Even at 
the sub-human lÁyel of the rat, many experiments demonstrate that the signifi- 
cance of an originally neutral stimulus, e.g., a buzzer signal, can be modified by 
association with fdod, shock, and the like and the total behaviors these stimulus- 
objects elicit. Similarly? at the intra-cultural human level, we usually attribute 
. *thefact that one individual reacts with submissiveness and another with hostility 
to authority-symbols, for example, to differences in the meanings of such sym- 
bols to them, these in turn being based on past experiences. At the cross-cultural 
level, it is often said that some peoples view the world as hostile, some as friendly, 
some-digide it into things and actions, and some view it as all of one piece. 
The present hypothesis would imply that differences in language structure are 
ə not essential, that behavior-cognition relations may fpe sufficient. 

7.4.3.2. Codification and cognition. Here we come most directly into the Whorf 
problem. In the broadest sense, we ask if the structure of a language has an 
influence upon*the ‘world view’—perception, thought, memory, and even 
philosophy—of the people employing it. When the question is pared down to 
researchable size, the answer clearly seems to be positive. But we must also 
ask the converse question: Does the ‘world view’ of a people influence the 
structure of their language? Whether one concludes ‘yes’ or ‘no’ here seems to 
depend upon what is included in the notion of ‘structure,’ as we shall see. We 
do not need to ask about relations between our third variable, behavior, and 
codification, since it would probably be agreed by everyone that the only way 
in which the structure of language can influence overt behavior (or ‘culture’) is 
via the mediation of cognitive states—and conversely, the only way in which 
behavior (or ‘culture’) can influence the structure of language is also via the 

sə mediation of cognition. 

Hypothesis III. The form of codification of the language used to talk about stimuli 
influences the cognitions associated with these stimuli. Here the independent vari- 
able is codification, as indexed usually by linguistic methods, and the dependent 
variable is cognitive states, as indexed by certain criterion behaviors. These 
criterion behaviors are usually some standard psychological measure of symbolic 
activity, such as recognition, recall or reproduction in the case of memory, 
sorting behaviors in the case of concept formation, scoring of ‘stories’ or ‘proto- 
cols’ obtain&d from TAT or Rorschach in the case of ‘ways of perceiving,’ and 
so on. The codification variable may be relative availability of labels in the 
lexicon, presence or absence of certain grammatical characteristics, and so forth. 

(1) Intra-cultural approaches. A number of studies on the influence of labeling 
upon retention are available in the standard psychological literature. Language 
may function here as a fnarker to alter the memory of particular characteristics 
of a stimulus or to differentially facilitate the remembering of certain aspects of 
situations. An early study by Lehmann in 1889 showed that subjects could dis- 


€ eu 


e 
t € 


198 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 


tinguish more easily between shades of gray if they had heard numbers associated 
with them. More typical was an experiment by Carmichael, Hogan, and Walter. 
It was shown that the names given to diagrams affected the reproduction of the 
diagrams when the subjects were later asked to draw them. Fer example, for a 
stimulus figure of two circles connected by a straight line, subjegts in one group 
were told that it might help them to remember the figure if fiey remembered 
the word ‘spectacles’ and another group was given the key word ‘dumbbells.’ 
When asked to draw the figures at a later date, the first group, tended to distort 
the figure by making the straight line much like the nos?piece of a pair of spec- 
tacles; the second group tended to make the line thicker and the circles into 
„elipses resembling dumbbells. T 
A more recent, and more directly relevant, experiment has been reported by 
Lenneberg. First a sample of English speakers was shown a considérable number 
of color patches comparable in all respects but hue and asked to label them. It 
was found that while certain patches yielded highly consistent labels (e.g., 
patches falling close to that part of the spectrum with which the familiar label . 
‘red’ is associated), other patches showed little agreement, many compound 
descriptions (such as ‘a sort of yellowish green’), and blocking. Presumably if 
latencies had been measured they would have shown a correspónding increase. 
This step constitutes an empirical estimation of the way in which English 
lexical codification ‘carves up’ the stimulus dimension of wave-length of light. 
(We assume—and experimental evidence supports it—that this particular 
segmentalization of the spectrum is not forced by the human receptor apparatus.) 
The question, tested in the second step of the study, is this: will the codification 
of color terminology (here, labelability) affect cognitive processes? The cognitive 
process studied, with a different group of subjects, was retention as measured by 
recognition over a short interval. Shown a sample of 20 color patches, say, and 
then shown, after an interval, 40 patches in which the original 20 are included, 
will subjects tend to recognize readily labelable colors more correctly than less 
readily labelable ones? Statistically significant differences were obtained in 
favor of the hypothesis. This experiment samples only one of many possible 
stintulus continua and only one of many possible cognitive processes; the method 
is admirably suited to testing the general hypothesis. A : 


But what is the mechanism here? How does the availability of a culturally standardized 
label (and associated mediation process) facilitate recall, perceptual isolation, common 
classification in sorting and so on? An experiment by Lawrence on the enhancement of 
discrimination of cues by rat subjects is relevant here.!*? He found that forcing rats to 
‘pay attention’ to a given set of cues (e.g., black vs. white walls), by making*hese the dis- 
criminanda in a prior discrimination problem, markedly facilitated transfer to quite differ- 
ent discrimination problems in which the same cues were involved. Rats could even lj ierse 
a discrimination to such ‘attended’ cues better than to others. This seems quite analéjuas 
to the role of language labels with humans—stimulus patterns that come to be associated 
with distinctive labels, and acquire differential meaning on the basis of prior differential 


wp, G. Lawrence, Acquired distinctiveness of cues; II. Selective association in & 
constant stimulus situation, Journal of Experimental Psychology 40. 175-88 (1950). E 


ne > 3 


» 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 199 


reinforcement, later prove to be more distinetive, whether in retention, in eategorizing, 
in perception, or so forth. But what is the mechanism? 

The followirfg type of experiment might get at the answer. A control group is shown a 
set of gray patéhes of varying brightness and simply tested for recognition after a delay 
interval. Various e&perimental groups would be given the same task, but with modifications 
in the prior ekposgre designed to test the following hypotheses: (1) The facilitation may be 
due to the Lp (Re of some distinctive additional cue with the gray. A series of non- 
speech sounds, suci as patterns of notes could be used here. (2) The facilitation may be 
due to a self-produced movement by the subject. Subjects could be instructed to make cer- 
tain movements to eåch of the grays (preferably overt, although thinking about movements 
may also prove to be relevant). (3) The facilitation may be due to the fact that a speech 


* gound is heard and can be reproduced to facilitate recognition. Nonsense syllables could 


be used for this condition, although in a sense giving a name to the color would make the 
syllable meaningful. (4) The facilitation may be due to complex discriminatory associations 
with meaningfu] sounds. Names do not usually exist in isolation but indicate that the ob- 
ject is not something else. For this variation meaningful words would be used. (5) The facili- 
tation mey be due to the secondary reward value of making distinctions marked by lan- 
guage. One could enhance the secondary reward value of the markers in any of the above 


4 designs by associating them with differential reinforcemente 


Traditional concept formation tests involve presentation of stimuli that can be grouped 
according to various categories or dimensions. Such a design can be illustrated concretely 
by reference to a well known concept formation testing device, the Vigotsky blocks. Twenty- 
two blocks differing in height, top surface area (width), shape, and color are to be divided 
into four classes, the ‘correct’ solution involving the intersection of two dichotomies: 
high-low and large-small. These classes are identified in information fed back to the sub- 
ject by nonsense syllables; thus the high-small blocks are called MUR, the low-small ones 
CEV, the high-large ones BIK, and the low-large ones LAG. These ‘names’ are irrelevant 
to the solution, but some subjects attempt to find clues to the basis of classification in them. 
Thus MUR has been interpreted as the French word for wall, while BIK and LAG become 
‘big’ and ‘large,’ respectively. This effect suggests the usefulness of varying the concept 
markers linguistically and determining the effect upon sorting behavior. 

The semantic implications of grammatical categories can also be studied. In some lan- 
guages, for ¢xample; gender may be applied to inanimate objects. In French semantic choice 
is necessary only in a few cases such as adjectives treated as nouns—e.g., le génévois, la. 
génévoise. It is clear, however, that whenever a sex distinction is reasonable, certain endings 
are used with the feminine and others with the masculine. One might expect, then, that in 
the use of gender affixes with inanimate objects there might be some generalization from 
the meaning difference associated with gender when there is a semantic difference. Lists 
of randomly selected masculine and feminine nouns could be analyzed with the semantic 
differential. If there is any generalization between the various nouns in the same gender 
category, there should be some differences in the distribution of the profiles for the two 
groups. This difference should be in the same direction as the difference between masculine 
and feminine forms of adjectives and the profiles for ‘masculine’ and ‘feminine.’ 


‘A considerable amount of observational evidence has been offered in the 
literature in support of the general thesis that the lexical or structural char- 
acte ‘stios of language influence overt behavior directly. Typical is Whorf's 
analysis of many hundreds of reports of circumstances surrounding the starting 
of fires. For example, a man reports to his insurance company that a certain gas 
drum is ‘empty’ and later tosses a burning cigarette into it, causing an explosion. 
Whorf argues that this behavior is a result of the meaning of ‘empty’ in English, 
leading its users to disregard the explosive vapor. As Lenneberg points out, 


[3075 di 


» & 


200 - PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS A 


4 


owever, English is quite capable of yielding the sentence ‘This empty gas $ 
„is filled: with explosive vapor.’ It seems much more likely that this is an 
- instance of the influence of cognitive states upon behavior, i.ẹ., the burning 
cigarette was carelessly thrown into the vapor-filled drum because the man 
perceived it as containing nothing inflammable. Much of the fvidence of this 
sort, if out argument is valid, really belongs under Hypothesis’ I above and has 
nothing directly to do with the relation of codification to behavior. 

(2) Cross-cultural approaches. Much of the anecdotal evidence on the Welt- 
anschauung problem is cross-cultural in! nature, but most of it runs into, the 
methodological complications discussed earlier in this section—particularly 
circularity of inference and use of translation to prove differences in cognitive 
states. Most of the techniques discussed under intra-cultural approaches are 
applicable here, as are a number of additional methods which ‘are briefly de- 
scribed below. , 


> y 


The design developed by Lenneberg for studying the effects of color labeling upon percep- 
tion and retention is readily 2daptable to cross-cultural tests. Work with the Zuni is now: 
in progress. The question is to what extent recognition of colors can be shown to be influ- 
enced by differences in codification of colors in different languages, with experimental 
procedures held constant, The chief difficulty, of course, is to hold experimental procedures 

_ constant. One usually finds that cultures exert subtle but yet powerful effects upon per- 
formance in experimental situations—attitudes toward testing situations, motivation, 
what is assumed by the subject to be significant, desires to please or not please the investi- 
gator, arid so forth. A design much like Lenneberg’s could be used to get at the possible way 
in which differences in the codification of the time continuum affect cognitive states. The 
English language, for example, selects out certain intervals—the ‘second,’ the ‘minute,’ 

- the ‘hour’ and so on—and applies these high frequency labels to them. Could it be shown, 
first intra-culturally, that English language users are more accurate in estimating intervals 
that cluster about these labeled segments than in estimating intervals of a relatively 
unlabelable magnitude? Could-it then be shown cross-culturally that accuracy of interval 

` estimates (or other indices of temporal cognitions) is influenced by differences in codifica- 
* “tion of the time continuum? There are many other dimensions of experience béyond color 
and time that could be studied in this manner, of course. 

The phenomena above are researchable cross-culturally because physical and biological 
conditions (e.g., of color production and reception) are constant. Many other cultural 
‘objects’ are constant across two or more cultures despite language differences, and these 
should be equally researchable. The biological relations among individuals, for example 
(e.g., father, mother, grandfather, cousin, ete. as defined biologically), are constant across 
cultures, yet the ways kinship codifications assign relationships differ—could differences 
in perception, attitude, and the like be shown to vary on this basis? Many cultures share 
the 7-day week system, but the codifications differ. 

A different type of cross-cultural investigation would make use of the kind of communica- 
tion situation devised by Carroll and described elsewhere in this section (7.1) Suppose that 
two individuals who speak the same language are placed at tables separated by a screen 
and both have before them the same set of objects (such as blocks and pegs of various shapes, 
sizes, and colors). One individual acts as the encoder; given a certain arrangement of his 
materials (by, the experimenter); he must use his language to communicate this arrange- 
ment to the other individual, who acts as the decoder. Questions by the decoder may either 
be allowed or not. If a suitably varied set of arrangements'and materials were devised 
(involving semantic relations of order, distance, direction, color, size, shape, being under, 

» 


» 


» 


v 
Li 


: 


as 


PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS . 201 

© M 

over, beside, and so forth), it should be possible to study the general question of 

or not certain languages facilitate or hinder the communication of certain types of rma- 

tion. Would the Navajo, for example, have particular difficulty with communicating the 
order of eventsg Would the East Indian typically have trouble with relative distances bé-' 
tween objects? Or, gould it be found that all languages can be used equivalently to communi- 
cate anything? In view of Lenneberg’s findings with respect. to color, this last possibility 
seems unlikely—pgesumably English speakers would have trouble communicating the selec- 

tion of blocks or pgs of low color labelability. et 

Finally, the bilingual individual seems to provide an exceptionally favorable ground for 
testing this hypothapis—cross-culturally and cross-linguistically within the same individual 
in some cases, cross-lingui&tically but within the same individual bearing the same culture 


. in other cases. With the compound-type bilingual (cf., section 6.2) we are dealing with 


two different language systems but (presumably) with a constant set of cognitions; with 
the coordinaté-type bilingual, on the other hand, we are dealing with two different language 
systems and two somewhat different cognitive systems (and cultures). While language- 
plus-cognition “differences in the latter case may be expected to produce differences in 
behaviog in standard test situations, will language differences per se have any effect in the 
former case? Several experimental procedures can be suggested here. 

(1) Comparative use of the semantic differential. If the dimensions of the semantic differ- 
ential can be assumed to be even roughly translatable, a method of comparing responses 
of bilinguals to those of monolingual speakers could be developed. One would look for shifts 
in bflihguals toward an intermediate meaning on the scales in which there are differences 
between monolinguals in the two languages. Under certain conditions—perhaps a certain 
kind of social setting for the bilingual or translating experience—there may be inhibition 
6f this generalizing process, and bilingual scales may diverge even more sharply than do 
monolingual scales on the dimensions for which differences are found in the two languages. 
The types of words used will be important variables as exemplified in the following cases: 

(a) Words which are alike in form class and ‘phonemic shape’ but are of varying degrees 
of closeness in meaning in the two languages. For the clearly unlike meanings one would 
expect minimum generalization, about as much as one would find between homonyms in 
the same language. E.g.: unlike—French and English sensible; more alike—French and 
English sympathy. 

(b) Words which are alike in form class, and are common translations, but differ in stimu- 


lus similarity. One might expect more generalization than for words which include a differ- 
E.g.: English sleep is both noun and verb} + 


ent form class as 4 homonym of similar meaning. 


French sommeil is only a noun. 
© (e) Words which have the same common translation word in the second language. One 


might expect such words to show a shift towards greater similarity of profile due to the 
common translation word. This hypothesis could be tested without the. assumption of trans- 
latability of the material E.g.: English dream; French songe and réve. 

(2) Verbal responses to non-verbal stimult. If two languages apply labels to de- 
lineate classes in a domain differently, one would expect that bilingual boundaries would 
shift, at least for continuous stimuli. For example, given the spectrum boundaries used by 
English speakers for green and by French speakers for vert, one might expect a shift in 
bilinguals. If the ambiguous areas in.one language are fairly great with considerable un- 
certainty abSut labeling and if the other language clearly includes a border area under one 
of the labels, the second language’s boundaries might determine the range of physical 
stimuli encompassed by the terms in each language. However, in a finely discriminated 
area without much uncertainty in either language, some compromise boundary may result. 
If English monolinguals divide the area between yellow and green approximately equally, 
while the French monolinguals restrict jaune more narrowly compared to vert, the bilingual 
might be expected to shift/the English boundary toward the yellow end and achieve some 
sort of compromise for both Janguages. 


bd ç [s € 


^ 
oe 


202 PSYCHOLINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS 
1 ï E 
If one language makes finer discriminations of a given set of stimuli than the other lan- _ 
guage does, bilinguals may try to approximate the differentiations of the first language 
elaborations in the second. In some cases a direct borrowing of terms niay occur. For ex 
ample, the Zuni have a word which includes both yellow and orange acco»ding to English 
labels. The younger Zuni who are more often bilingual have borrowed the English term 
orange so that they make a differentiation similar to that made by English monolinguals 
Alternatively, without borrowing the term they might have,tried toyapproximate it b; 
calling the colors identified as orange in English a combination of yellow and red or some 
similar cireumlocution. With emotions a similar process may take place. In Crow the same. 
term is used for situations where English would use surprise and fear, ‘und it may be trans- 
lated as either. However, one might expect bilinguals who are adked to compare emotional 
situations in Crow to distinguish in some way between the situations because of the fact 
that English requires it. vs y 
(3) The Thematic Apperception Test can also be used to explore the effects of language 
variation upon response to constant stimuli for bilinguals. The latent content of TAT 
: protocols for both languages of the bilingual would be compared with those for matched. 
monolinguals, The data from such a design could be used to check variations in jhe tilini 
gual’s responses from one language to another. Questions: Does the compound-type bi 
lingual show any differences ingyhat is perceived in the pictures as a function of the langua; 
he is using? Does the coordinate-type bilingual show such differences? In what ways di 
the perceptions of bilinguals differ from those of mated onclingusls? Under what con 
ditions does borrowing from one language to the other occur in bilinguals?! as 
LJ 


it 


et 


Hypothesis IV. The cognitions associated with stimuli influence the form of — 
codification of the language used to talk about these stimuli. This reversal of thé 
usual Weltanschauung notion may strike one as absurd at first glance, but in 
certain respects, at least, it is obviously valid, 

One underlying proposition here is that perceptual processes of human or- 
ganisms—particularly principles of grouping, figure-ground, continuity, closure 
and constancy (cf., section 3.1)—determine what segments or patterns in the 
continuously variable physical environment will typically acquire labels. The 
detachability of leaves from trees, for example, makes it more probable that 
languages in general will have separate labels for ‘tree’ and ‘leaf? than for ‘trunk? 
as differentiated from ‘branch.’ Similarly ‘hand’ should be distinguished from 
‘arm’ more regularly than upper forearm from lower forearm by separate labels, 
Separate labels for boys vs. men should be more common than separate labels 
for men of 30-40 vs. men of 40-50. Research on this proposition would réquire | 
independent assessment of the ‘discriminability’ of a wide sample of such physical 
and social objects and situations (either directly through perception techniques 
or indirectly through physical measurement on such objects in relation to | 

„perception theory) and correlation of this set of facts with cross-cultural fre 
quency of language distinction by labeling. _ 6 

Another proposition here is that the lexicon of a language should be isomorphic, 
80 to speak, with the pattern of the culture of its users. If one group of people 
lives mainly by fishing, their language lexicon should be appropriately expansive 
and discriminative in this area—names for varieties and sub-varieties of fish, — 
names for types of boats and nets and hooks, and nares for the many, many | 


10 Susan Ervin is presently collecting data from bilinguals with this hypothesis in mind. 


LINGUISTICS: A SURVEY OF THEORY AND RESEARCH PROBLEMS ' 203 
a FR 


ils of social organization that surround fishing. A group of people that has 
laborate “aid complicated kinship and marriage system should have an 
abofate and complicated system of kinship and marriage terms. This | 
€ almost trite, of course. However, it clearly fits the hypothesis 
above. Ri absurd to assume that people develop a fishing complex, for 
nple, digo vis happen to have a language with lots of terms relevant to 
I In this connection, Hoijer has described a correlation between the 
"grammátical preoccupation with movement and their nomadic life; 
would seem absurd to conclude that the Navajos took to a nomadic 
ife because their language happened to have this grammatical ckaracter- 
Ce 


t 


ir 


o 


2 
# 
) 
M C 
` 
à 
» 
* 
) d 
> j : 
383 
2 
e» 
5 ` 
H J T7 
E ` 
X o 3 ` 


è 
e 
* . o 
* E D 
z e oe $ hd . * A ieee, | 
è . è B zi ` oe 
s [^ . v = 
» F ° 3 : 
Y è 
e ` 
` 
4 
ta 
? x 
J 9, 
'w 
* > « . 
. - 
- 
o - * 
* 
b ` 
`~ i 
* $ p * 
a. ` 


